Drug discovery and development in the era of artificial intelligence: From machine learning to large language models

Shenghui Guan; Guanyu Wang

Artificial Intelligence Chemistry (Jun 2024)

Drug discovery and development in the era of artificial intelligence: From machine learning to large language models

Shenghui Guan,
Guanyu Wang

Affiliations

Shenghui Guan: Futian Biomedical Innovation R&D Center, The Chinese University of Hong Kong, Shenzhen 518172, China; Laboratory of Biocomplexity and Engineering Biology, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China; Ciechanover Institute of Precision and Regenerative Medicine, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
Guanyu Wang: Futian Biomedical Innovation R&D Center, The Chinese University of Hong Kong, Shenzhen 518172, China; Laboratory of Biocomplexity and Engineering Biology, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China; Ciechanover Institute of Precision and Regenerative Medicine, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China; Corresponding author at: Futian Biomedical Innovation R&D Center, The Chinese University of Hong Kong, Shenzhen 518172, China

Journal volume & issue: Vol. 2, no. 1
p. 100070

Abstract

Read online

Drug Research and Development (R&D) is a complex and difficult process, and current drug R&D faces the challenges of long time span, high investment, and high failure rate. Machine learning, with its powerful learning ability to characterize big data and complex networks, is increasingly effective to improve the efficiency and success rate of drug R&D. Here we review some recent examples of the application of machine learning methods in six areas: disease gene prediction, virtual screening, drug molecule generation, molecular attribute prediction, and prediction of drug combination synergism. We also discuss the advantages of integrative learning in multi-attribute prediction. Integrative models based on base learners constructed from data of different dimensions on the one hand fully utilize the information contained in these data, and on the other hand improve the average prediction performance. Finally, we envision a new paradigm for drug discovery and development: a large language model acts as a central hub to organize public resources into a knowledge base, validating the knowledge with computational software and smaller predictive models, as well as high-throughput automated screening platforms based on organoidal technologies, to speed up development and reduce the differences in efficacy between disease models and humans to improve the success rate of a drug.

Published in Artificial Intelligence Chemistry

ISSN: 2949-7477 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Chemistry; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.sciencedirect.com/journal/artificial-intelligence-chemistry

About the journal

Abstract

Keywords