A molecular video-derived foundation model for scientific drug discovery

Hongxin Xiang; Li Zeng; Linlin Hou; Kenli Li; Zhimin Fu; Yunguang Qiu; Ruth Nussinov; Jianying Hu; Michal Rosen-Zvi; Xiangxiang Zeng; Feixiong Cheng

doi:10.1038/s41467-024-53742-z

Nature Communications (Nov 2024)

A molecular video-derived foundation model for scientific drug discovery

Hongxin Xiang,
Li Zeng,
Linlin Hou,
Kenli Li,
Zhimin Fu,
Yunguang Qiu,
Ruth Nussinov,
Jianying Hu,
Michal Rosen-Zvi,
Xiangxiang Zeng,
Feixiong Cheng

Affiliations

Hongxin Xiang: College of Computer Science and Electronic Engineering, Hunan University
Li Zeng: College of Computer Science and Electronic Engineering, Hunan University
Linlin Hou: College of Computer Science and Electronic Engineering, Hunan University
Kenli Li: College of Computer Science and Electronic Engineering, Hunan University
Zhimin Fu: Department of Pharmacy, Cleveland Clinic Akron General, Cleveland Clinic
Yunguang Qiu: Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic
Ruth Nussinov: Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Cancer Innovation Laboratory, National Cancer Institute
Jianying Hu: IBM T.J. Watson Research Center, Yorktown Heights
Michal Rosen-Zvi: AI for Accelerated Healthcare and Life Sciences Discovery, IBM Research Labs
Xiangxiang Zeng: College of Computer Science and Electronic Engineering, Hunan University
Feixiong Cheng: Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic

DOI: https://doi.org/10.1038/s41467-024-53742-z
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Accurate molecular representation of compounds is a fundamental challenge for prediction of drug targets and molecular properties. In this study, we present a molecular video-based foundation model, named VideoMol, pretrained on 120 million frames of 2 million unlabeled drug-like and bioactive molecules. VideoMol renders each molecule as a video with 60-frame and designs three self-supervised learning strategies on molecular videos to capture molecular representation. We show high performance of VideoMol in predicting molecular targets and properties across 43 drug discovery benchmark datasets. VideoMol achieves high accuracy in identifying antiviral molecules against common diverse disease-specific drug targets (i.e., BACE1 and EP4). Drugs screened by VideoMol show better binding affinity than molecular docking, revealing the effectiveness in understanding the three-dimensional structure of molecules. We further illustrate interpretability of VideoMol using key chemical substructures.

Published in Nature Communications

ISSN: 2041-1723 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/ncomms/

About the journal