A method combining LDA and neural networks for antitumor drug efficacy prediction

Weiwei Zhu; Lei Zhang; Xiaodong Jiang; Peng Zhou; Xinping Xie; Hongqiang Wang

doi:10.1177/20552076241280103

Digital Health (Sep 2024)

A method combining LDA and neural networks for antitumor drug efficacy prediction

Weiwei Zhu,
Lei Zhang,
Xiaodong Jiang,
Peng Zhou,
Xinping Xie,
Hongqiang Wang

Affiliations

Weiwei Zhu: Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, China
Lei Zhang: Department of Pharmacy, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, , Hefei, Anhui, China
Xiaodong Jiang: Medical Oncology Department, The First Affiliated Hospital of University of Science and Technology of China, Hefei, Anhui, China
Peng Zhou: School of Life Science, Hefei Normal University, Hefei, Anhui, China
Xinping Xie: School of Mathematics and Physics, Anhui Jianzhu University, Hefei, Anhui, China
Hongqiang Wang: Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, China

DOI: https://doi.org/10.1177/20552076241280103
Journal volume & issue: Vol. 10

Abstract

Read online

Background Personalized medicine has gained more attention for cancer precision treatment due to patient genetic heterogeneity in recent years. However, predicting the efficacy of antitumor drugs in advance remains a significant challenge to achieve this task. Objective This study aims to predict the efficacy of antitumor drugs in individual cancer patients based on clinical data. Methods This paper proposes to predict personalized antitumor drug efficacy based on clinical data. Specifically, we encode the clinical text of cancer patients as a probability distribution vector in hidden topics space using the Latent Dirichlet Allocation (LDA) model, named LDA representation. Then, a neural network is designed, and the LDA representation is input into the neural network to predict drug response in cancer patients treated with platinum drugs. To evaluate the effectiveness of the proposed method, we gathered and organized clinical records of lung and bowel cancer patients who underwent platinum-based treatment. The prediction performance is assessed using the following metrics: Precision, Recall, F1-score, Accuracy, and Area Under the ROC Curve (AUC). Results The study analyzed a dataset of 958 patients with non-small cell cancer treated with antitumor drugs. The proposed method achieved a stratified 5-fold cross-validation average Precision of 0.81, Recall of 0.89, F1-score of 0.85, Accuracy of 0.77, and AUC of 0.81 for cisplatin efficacy prediction on the data, which most are better than those of previous methods. Of these, the AUC value is at least 4% higher than those of the previous. At the same time, the superior result over the previous method persisted on an independent dataset of 266 bowel cancer patients, showing the generalizability of the proposed method. These results demonstrate the potential value of precise tumor treatment in clinical practice. Conclusions Combining LDA and neural networks can help predict the efficacy of antitumor drugs based on clinical text. Our approach outperforms previous methods in predicting drug clinical efficacy.

Published in Digital Health

ISSN: 2055-2076 (Online)
Publisher: SAGE Publishing
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://journals.sagepub.com/home/dhj

About the journal