Nature Communications (Jun 2024)

Deep representation learning of chemical-induced transcriptional profile for phenotype-based drug discovery

  • Xiaochu Tong,
  • Ning Qu,
  • Xiangtai Kong,
  • Shengkun Ni,
  • Jingyi Zhou,
  • Kun Wang,
  • Lehan Zhang,
  • Yiming Wen,
  • Jiangshan Shi,
  • Sulin Zhang,
  • Xutong Li,
  • Mingyue Zheng

DOI
https://doi.org/10.1038/s41467-024-49620-3
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Artificial intelligence transforms drug discovery, with phenotype-based approaches emerging as a promising alternative to target-based methods, overcoming limitations like lack of well-defined targets. While chemical-induced transcriptional profiles offer a comprehensive view of drug mechanisms, inherent noise often obscures the true signal, hindering their potential for meaningful insights. Here, we highlight the development of TranSiGen, a deep generative model employing self-supervised representation learning. TranSiGen analyzes basal cell gene expression and molecular structures to reconstruct chemical-induced transcriptional profiles with high accuracy. By capturing both cellular and compound information, TranSiGen-derived representations demonstrate efficacy in diverse downstream tasks like ligand-based virtual screening, drug response prediction, and phenotype-based drug repurposing. Notably, in vitro validation of TranSiGen’s application in pancreatic cancer drug discovery highlights its potential for identifying effective compounds. We envisage that integrating TranSiGen into the drug discovery and mechanism research holds significant promise for advancing biomedicine.