Radiation Oncology (Jun 2024)

Development of a risk prediction model for radiation dermatitis following proton radiotherapy in head and neck cancer using ensemble machine learning

  • Tsair-Fwu Lee,
  • Yen-Hsien Liu,
  • Chu-Ho Chang,
  • Chien-Liang Chiu,
  • Chih-Hsueh Lin,
  • Jen-Chung Shao,
  • Yu-Cheng Yen,
  • Guang-Zhi Lin,
  • Jack Yang,
  • Chin-Dar Tseng,
  • Fu-Min Fang,
  • Pei-Ju Chao,
  • Shen-Hao Lee

DOI
https://doi.org/10.1186/s13014-024-02470-1
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Purpose This study aims to develop an ensemble machine learning-based (EML-based) risk prediction model for radiation dermatitis (RD) in patients with head and neck cancer undergoing proton radiotherapy, with the goal of achieving superior predictive performance compared to traditional models. Materials and methods Data from 57 head and neck cancer patients treated with intensity-modulated proton therapy at Kaohsiung Chang Gung Memorial Hospital were analyzed. The study incorporated 11 clinical and 9 dosimetric parameters. Pearson’s correlation was used to eliminate highly correlated variables, followed by feature selection via LASSO to focus on potential RD predictors. Model training involved traditional logistic regression (LR) and advanced ensemble methods such as Random Forest and XGBoost, which were optimized through hyperparameter tuning. Results Feature selection identified six key predictors, including smoking history and specific dosimetric parameters. Ensemble machine learning models, particularly XGBoost, demonstrated superior performance, achieving the highest AUC of 0.890. Feature importance was assessed using SHAP (SHapley Additive exPlanations) values, which underscored the relevance of various clinical and dosimetric factors in predicting RD. Conclusion The study confirms that EML methods, especially XGBoost with its boosting algorithm, provide superior predictive accuracy, enhanced feature selection, and improved data handling compared to traditional LR. While LR offers greater interpretability, the precision and broader applicability of EML make it more suitable for complex medical prediction tasks, such as predicting radiation dermatitis. Given these advantages, EML is highly recommended for further research and application in clinical settings.

Keywords