Frontiers in Genetics (Aug 2021)

Improving de novo Molecule Generation by Embedding LSTM and Attention Mechanism in CycleGAN

  • Feng Wang,
  • Feng Wang,
  • Xiaochen Feng,
  • Xiao Guo,
  • Lei Xu,
  • Liangxu Xie,
  • Shan Chang

DOI
https://doi.org/10.3389/fgene.2021.709500
Journal volume & issue
Vol. 12

Abstract

Read online

The application of deep learning in the field of drug discovery brings the development and expansion of molecular generative models along with new challenges in this field. One of challenges in de novo molecular generation is how to produce new reasonable molecules with desired pharmacological, physical, and chemical properties. To improve the similarity between the generated molecule and the starting molecule, we propose a new molecule generation model by embedding Long Short-Term Memory (LSTM) and Attention mechanism in CycleGAN architecture, LA-CycleGAN. The network layer of the generator in CycleGAN is fused head and tail to improve the similarity of the generated structure. The embedded LSTM and Attention mechanism can overcome long-term dependency problems in treating the normally used SMILES input. From our quantitative evaluation, we present that LA-CycleGAN expands the chemical space of the molecules and improves the ability of structure conversion. The generated molecules are highly similar to the starting compound structures while obtaining expected molecular properties during cycle generative adversarial network learning, which comprehensively improves the performance of the generative model.

Keywords