Machine learning application in personalised lung cancer recurrence and survivability prediction

Yang Yang; Li Xu; Liangdong Sun; Peng Zhang; Suzanne S. Farid

Computational and Structural Biotechnology Journal (Jan 2022)

Machine learning application in personalised lung cancer recurrence and survivability prediction

Yang Yang,
Li Xu,
Liangdong Sun,
Peng Zhang,
Suzanne S. Farid

Affiliations

Yang Yang: Department of Biochemical Engineering, University College London, Gower Street, London WC1E 6BT, UK
Li Xu: Department of Thoracic Surgery, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai 200043, China
Liangdong Sun: Department of Thoracic Surgery, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai 200043, China
Peng Zhang: Department of Thoracic Surgery, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai 200043, China; Corresponding authors.
Suzanne S. Farid: Department of Biochemical Engineering, University College London, Gower Street, London WC1E 6BT, UK; Corresponding authors.

Journal volume & issue: Vol. 20
pp. 1811 – 1820

Abstract

Read online

Machine learning is an important artificial intelligence technique that is widely applied in cancer diagnosis and detection. More recently, with the rise of personalised and precision medicine, there is a growing trend towards machine learning applications for prognosis prediction. However, to date, building reliable prediction models of cancer outcomes in everyday clinical practice is still a hurdle. In this work, we integrate genomic, clinical and demographic data of lung adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC) patients from The Cancer Genome Atlas (TCGA) and introduce copy number variation (CNV) and mutation information of 15 selected genes to generate predictive models for recurrence and survivability. We compare the accuracy and benefits of three well-established machine learning algorithms: decision tree methods, neural networks and support vector machines. Although the accuracy of predictive models using the decision tree method has no significant advantage, the tree models reveal the most important predictors among genomic information (e.g. KRAS, EGFR, TP53), clinical status (e.g. TNM stage and radiotherapy) and demographics (e.g. age and gender) and how they influence the prediction of recurrence and survivability for both early stage LUAD and LUSC. The machine learning models have the potential to help clinicians to make personalised decisions on aspects such as follow-up timeline and to assist with personalised planning of future social care needs.

Published in Computational and Structural Biotechnology Journal

ISSN: 2001-0370 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Technology: Chemical technology: Biotechnology
Website: https://www.journals.elsevier.com/computational-and-structural-biotechnology-journal

About the journal

Abstract

Keywords