Prediction of Radiation Pneumonitis With Machine Learning in Stage III Lung Cancer: A Pilot Study

Melek Yakar MD; Durmus Etiz MD; Muzaffer Metintas MD; Guntulu Ak; Ozer Celik PhD

doi:10.1177/15330338211016373

Technology in Cancer Research & Treatment (May 2021)

Prediction of Radiation Pneumonitis With Machine Learning in Stage III Lung Cancer: A Pilot Study

Melek Yakar MD,
Durmus Etiz MD,
Muzaffer Metintas MD,
Guntulu Ak,
Ozer Celik PhD

Affiliations

Melek Yakar MD: Eskisehir Osmangazi University Center of Research and Application for Computer Aided Diagnosis and Treatment in Health, Eskisehir, Turkey
Durmus Etiz MD: Eskisehir Osmangazi University Center of Research and Application for Computer Aided Diagnosis and Treatment in Health, Eskisehir, Turkey
Muzaffer Metintas MD: Department of Chest Diseases, Medical Faculty of Osmangazi University, Eskişehir, Turkey
Guntulu Ak: Department of Chest Diseases, Medical Faculty of Osmangazi University, Eskişehir, Turkey
Ozer Celik PhD: Department of Mathematics-Computer, Eskisehir Osmangazi University, Eskişehir, Turkey

DOI: https://doi.org/10.1177/15330338211016373
Journal volume & issue: Vol. 20

Abstract

Read online

Background: Radiation pneumonitis (RP) is a dose-limiting toxicity in lung cancer radiotherapy (RT). As risk factors in the development of RP, patient and tumor characteristics, dosimetric parameters, and treatment features are intertwined, and it is not always possible to associate RP with a single parameter. This study aimed to determine the algorithm that most accurately predicted RP development with machine learning. Methods: Of the 197 cases diagnosed with stage III lung cancer and underwent RT and chemotherapy between 2014 and 2020, 193 were evaluated. The CTCAE 5.0 grading system was used for the RP evaluation. Synthetic minority oversampling technique was used to create a balanced data set. Logistic regression, artificial neural networks, eXtreme Gradient Boosting (XGB), Support Vector Machines, Random Forest, Gaussian Naive Bayes and Light Gradient Boosting Machine algorithms were used. After the correlation analysis, a permutation-based method was utilized for as a variable selection. Results: RP was seen in 51 of the 193 cases. Parameters affecting RP were determined as, total(t)V5, ipsilateral lung D max , contralateral lung D max , total lung D max , gross tumor volume, number of chemotherapy cycles before RT, tumor size, lymph node localization and asbestos exposure. LGBM was found to be the algorithm that best predicted RP at 85% accuracy (confidence interval: 0.73-0.96), 97% sensitivity, and 50% specificity. Conclusion: When the clinical and dosimetric parameters were evaluated together, the LGBM algorithm had the highest accuracy in predicting RP. However, in order to use this algorithm in clinical practice, it is necessary to increase data diversity and the number of patients by sharing data between centers.

Published in Technology in Cancer Research & Treatment

ISSN: 1533-0346 (Print); 1533-0338 (Online)
Publisher: SAGE Publishing
Country of publisher: United States
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://journals.sagepub.com/home/tct

About the journal