Nature Communications (Jul 2023)

Macrocyclization of linear molecules by deep learning to facilitate macrocyclic drug candidates discovery

  • Yanyan Diao,
  • Dandan Liu,
  • Huan Ge,
  • Rongrong Zhang,
  • Kexin Jiang,
  • Runhui Bao,
  • Xiaoqian Zhu,
  • Hongjie Bi,
  • Wenjie Liao,
  • Ziqi Chen,
  • Kai Zhang,
  • Rui Wang,
  • Lili Zhu,
  • Zhenjiang Zhao,
  • Qiaoyu Hu,
  • Honglin Li

DOI
https://doi.org/10.1038/s41467-023-40219-8
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Interest in macrocycles as potential therapeutic agents has increased rapidly. Macrocyclization of bioactive acyclic molecules provides a potential avenue to yield novel chemical scaffolds, which can contribute to the improvement of the biological activity and physicochemical properties of these molecules. In this study, we propose a computational macrocyclization method based on Transformer architecture (which we name Macformer). Leveraging deep learning, Macformer explores the vast chemical space of macrocyclic analogues of a given acyclic molecule by adding diverse linkers compatible with the acyclic molecule. Macformer can efficiently learn the implicit relationships between acyclic and macrocyclic structures represented as SMILES strings and generate plenty of macrocycles with chemical diversity and structural novelty. In data augmentation scenarios using both internal ChEMBL and external ZINC test datasets, Macformer display excellent performance and generalisability. We showcase the utility of Macformer when combined with molecular docking simulations and wet lab based experimental validation, by applying it to the prospective design of macrocyclic JAK2 inhibitors.