DrugReAlign: a multisource prompt framework for drug repurposing based on large language models

Jinhang Wei; Linlin Zhuo; Xiangzheng Fu; XiangXiang Zeng; Li Wang; Quan Zou; Dongsheng Cao

doi:10.1186/s12915-024-02028-3

BMC Biology (Oct 2024)

DrugReAlign: a multisource prompt framework for drug repurposing based on large language models

Jinhang Wei,
Linlin Zhuo,
Xiangzheng Fu,
XiangXiang Zeng,
Li Wang,
Quan Zou,
Dongsheng Cao

Affiliations

Jinhang Wei: School of Data Science and Artificial Intelligence, Wenzhou University of Technology
Linlin Zhuo: School of Data Science and Artificial Intelligence, Wenzhou University of Technology
Xiangzheng Fu: School of Chinese Medicine, Hong Kong Baptist University
XiangXiang Zeng: College of Computer Science and Electronic Engineering, Hunan University
Li Wang: Department of Computer Science, University of Tsukuba
Quan Zou: Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China
Dongsheng Cao: Central South University, Hunan University

DOI: https://doi.org/10.1186/s12915-024-02028-3
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Drug repurposing is a promising approach in the field of drug discovery owing to its efficiency and cost-effectiveness. Most current drug repurposing models rely on specific datasets for training, which limits their predictive accuracy and scope. The number of both market-approved and experimental drugs is vast, forming an extensive molecular space. Due to limitations in parameter size and data volume, traditional drug-target interaction (DTI) prediction models struggle to generalize well within such a broad space. In contrast, large language models (LLMs), with their vast parameter sizes and extensive training data, demonstrate certain advantages in drug repurposing tasks. In our research, we introduce a novel drug repurposing framework, DrugReAlign, based on LLMs and multi-source prompt techniques, designed to fully exploit the potential of existing drugs efficiently. Leveraging LLMs, the DrugReAlign framework acquires general knowledge about targets and drugs from extensive human knowledge bases, overcoming the data availability limitations of traditional approaches. Furthermore, we collected target summaries and target-drug space interaction data from databases as multi-source prompts, substantially improving LLM performance in drug repurposing. We validated the efficiency and reliability of the proposed framework through molecular docking and DTI datasets. Significantly, our findings suggest a direct correlation between the accuracy of LLMs' target analysis and the quality of prediction outcomes. These findings signify that the proposed framework holds the promise of inaugurating a new paradigm in drug repurposing.

Published in BMC Biology

ISSN: 1741-7007 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbiol/

About the journal

Abstract

Keywords