Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder

Hwanhee Kim; Soohyun Ko; Byung Ju Kim; Sung Jin Ryu; Jaegyoon Ahn

doi:10.1186/s13321-022-00666-9

Journal of Cheminformatics (Dec 2022)

Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder

Hwanhee Kim,
Soohyun Ko,
Byung Ju Kim,
Sung Jin Ryu,
Jaegyoon Ahn

Affiliations

Hwanhee Kim: Department of Computer Science and Engineering, Incheon National University
Soohyun Ko: GenesisEgo
Byung Ju Kim: UBLBio Corporation
Sung Jin Ryu: UBLBio Corporation
Jaegyoon Ahn: Department of Computer Science and Engineering, Incheon National University

DOI: https://doi.org/10.1186/s13321-022-00666-9
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract In this paper, a reinforcement learning model is proposed that can maximize the predicted binding affinity between a generated molecule and target proteins. The model used to generate molecules in the proposed model was the Stacked Conditional Variation AutoEncoder (Stack-CVAE), which acts as an agent in reinforcement learning so that the resulting chemical formulas have the desired chemical properties and show high binding affinity with specific target proteins. We generated 1000 chemical formulas using the chemical properties of sorafenib and the three target kinases of sorafenib. Then, we confirmed that Stack-CVAE generates more of the valid and unique chemical compounds that have the desired chemical properties and predicted binding affinity better than other generative models. More detailed analysis for 100 of the top scoring molecules show that they are novel ones not found in existing chemical databases. Moreover, they reveal significantly higher predicted binding affinity score for Raf kinases than for other kinases. Furthermore, they are highly druggable and synthesizable.

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords