Evaluation of reinforcement learning in transformer-based molecular design

Jiazhen He; Alessandro Tibo; Jon Paul Janet; Eva Nittinger; Christian Tyrchan; Werngard Czechtizky; Ola Engkvist

doi:10.1186/s13321-024-00887-0

Journal of Cheminformatics (Aug 2024)

Evaluation of reinforcement learning in transformer-based molecular design

Jiazhen He,
Alessandro Tibo,
Jon Paul Janet,
Eva Nittinger,
Christian Tyrchan,
Werngard Czechtizky,
Ola Engkvist

Affiliations

Jiazhen He: Molecular AI, Discovery Sciences, R&D, AstraZeneca
Alessandro Tibo: Molecular AI, Discovery Sciences, R&D, AstraZeneca
Jon Paul Janet: Molecular AI, Discovery Sciences, R&D, AstraZeneca
Eva Nittinger: Medicinal Chemistry, Research and Early Development, Respiratory and Immunology (R&I), BioPharmaceuticals R&D, AstraZeneca
Christian Tyrchan: Medicinal Chemistry, Research and Early Development, Respiratory and Immunology (R&I), BioPharmaceuticals R&D, AstraZeneca
Werngard Czechtizky: Medicinal Chemistry, Research and Early Development, Respiratory and Immunology (R&I), BioPharmaceuticals R&D, AstraZeneca
Ola Engkvist: Molecular AI, Discovery Sciences, R&D, AstraZeneca

DOI: https://doi.org/10.1186/s13321-024-00887-0
Journal volume & issue: Vol. 16, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Designing compounds with a range of desirable properties is a fundamental challenge in drug discovery. In pre-clinical early drug discovery, novel compounds are often designed based on an already existing promising starting compound through structural modifications for further property optimization. Recently, transformer-based deep learning models have been explored for the task of molecular optimization by training on pairs of similar molecules. This provides a starting point for generating similar molecules to a given input molecule, but has limited flexibility regarding user-defined property profiles. Here, we evaluate the effect of reinforcement learning on transformer-based molecular generative models. The generative model can be considered as a pre-trained model with knowledge of the chemical space close to an input compound, while reinforcement learning can be viewed as a tuning phase, steering the model towards chemical space with user-specific desirable properties. The evaluation of two distinct tasks—molecular optimization and scaffold discovery—suggest that reinforcement learning could guide the transformer-based generative model towards the generation of more compounds of interest. Additionally, the impact of pre-trained models, learning steps and learning rates are investigated. Scientific contribution Our study investigates the effect of reinforcement learning on a transformer-based generative model initially trained for generating molecules similar to starting molecules. The reinforcement learning framework is applied to facilitate multiparameter optimisation of starting molecules. This approach allows for more flexibility for optimizing user-specific property profiles and helps finding more ideas of interest.

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords