Cancer Management and Research (Sep 2024)

Machine Learning Prediction of Residual and Recurrent High-Grade CIN Post-LEEP

  • Zhai F,
  • Mu S,
  • Song Y,
  • Zhang M,
  • Zhang C,
  • Lv Z

Journal volume & issue
Vol. Volume 16
pp. 1175 – 1187

Abstract

Read online

Furui Zhai, Shanshan Mu, Yinghui Song, Min Zhang, Cui Zhang, Ze Lv Gynecological Clinic, Cangzhou Central Hospital, Cangzhou City, Hebei Province, People’s Republic of ChinaCorrespondence: Furui Zhai, Department of Gynecological Clinic, Cangzhou Central Hospital, 16 Xinhua West Road, Cangzhou City, Hebei Province, People’s Republic of China, Tel +86-0317-2075783, Email [email protected]: This study aims to develop a machine learning (ML) model to predict the risk of residual or recurrent high-grade cervical intraepithelial neoplasia (CIN) after loop electrosurgical excision procedure (LEEP), addressing a critical gap in personalized follow-up care.Methods: A retrospective analysis of 532 patients who underwent LEEP for high-grade CIN at Cangzhou Central Hospital (2016– 2020) was conducted. In the final analysis, 99 women (18.6%) were found to have residual or recurrent high-grade CIN (CIN2 or worse) within five years of follow-up. Four feature selection methods identified significant predictors of residual or recurrent CIN. Eight ML algorithms were evaluated using performance metrics such as AUROC, accuracy, sensitivity, specificity, PPV, NPV, F1 score, calibration curve, and decision curve analysis. Fivefold cross-validation optimized and validated the model, and SHAP analysis assessed feature importance.Results: The XGBoost algorithm demonstrated the highest predictive performance with the best AUROC. The optimized model included six key predictors: age, ThinPrep cytologic test (TCT) results, HPV classification, CIN severity, glandular involvement, and margin status. SHAP analysis identified CIN severity and margin status as the most influential predictors. An online prediction tool was developed for real-time risk assessment.Conclusion: This ML-based predictive model for post-LEEP high-grade CIN provides a significant advancement in gynecologic oncology, enhancing personalized patient care and facilitating early intervention and informed clinical decision-making.Keywords: cervical intraepithelial neoplasia, loop electrosurgical excision procedure, residual or recurrent, machine learning, predictive modeling

Keywords