Personalized anti-tumor drug efficacy prediction based on clinical data
Xinping Xie,
Dandan Li,
Yangyang Pei,
Weiwei Zhu,
Xiaodong Du,
Xiaodong Jiang,
Lei Zhang,
Hong-Qiang Wang
Affiliations
Xinping Xie
School of Mathematics and Physics, Anhui Jianzhu University, Hefei, China
Dandan Li
School of Mathematics and Physics, Anhui Jianzhu University, Hefei, China
Yangyang Pei
School of Mathematics and Physics, Anhui Jianzhu University, Hefei, China
Weiwei Zhu
Institute of Intelligent Machines/Zhongqi AI Lab., Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
Xiaodong Du
Experimental Teaching Center, Hefei University, Hefei, China
Xiaodong Jiang
Medical Oncology Department, The First Affiliated Hospital of University of Science and Technology of China, Hefei, Anhui, 230001, China
Lei Zhang
Pharmacy Department, The First Affiliated Hospital of University of Science and Technology of China, Hefei, Anhui, 230001, China
Hong-Qiang Wang
Institute of Intelligent Machines/Zhongqi AI Lab., Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China; Corresponding author.
Anti-tumor drug efficacy prediction poses an unprecedented challenge to realizing personalized medicine. This paper proposes to predict personalized anti-tumor drug efficacy based on clinical data. Specifically, we encode the clinical text as numeric vectors featured with hidden topics for patients using Latent Dirichlet Allocation model. Then, to classify patients into two classes, responsive or non-responsive to a drug, drug efficacy predictors are established by machine learning based on the Latent Dirichlet Allocation topic representation. To evaluate the proposed method, we collected and collated clinical records of lung and bowel cancer patients treated with platinum. Experimental results on the data sets show the efficacy and effectiveness of the proposed method, suggesting the potential value of clinical data in cancer precision medicine. We hope that it will promote the research of drug efficacy prediction based on clinical data.