Frontiers in Oncology (Jun 2024)

Development and validation of AI models using LR and LightGBM for predicting distant metastasis in breast cancer: a dual-center study

  • Wen-hai Zhang,
  • Yang Tan,
  • Zhen Huang,
  • Qi-xing Tan,
  • Yue-mei Zhang,
  • Bin-jie Chen,
  • Chang-yuan Wei

DOI
https://doi.org/10.3389/fonc.2024.1409273
Journal volume & issue
Vol. 14

Abstract

Read online

ObjectiveThis study aims to develop an artificial intelligence model utilizing clinical blood markers, ultrasound data, and breast biopsy pathological information to predict the distant metastasis in breast cancer patients.MethodsData from two medical centers were utilized, Clinical blood markers, ultrasound data, and breast biopsy pathological information were separately extracted and selected. Feature dimensionality reduction was performed using Spearman correlation and LASSO regression. Predictive models were constructed using LR and LightGBM machine learning algorithms and validated on internal and external validation sets. Feature correlation analysis was conducted for both models.ResultsThe LR model achieved AUC values of 0.892, 0.816, and 0.817 for the training, internal validation, and external validation cohorts, respectively. The LightGBM model achieved AUC values of 0.971, 0.861, and 0.890 for the same cohorts, respectively. Clinical decision curve analysis showed a superior net benefit of the LightGBM model over the LR model in predicting distant metastasis in breast cancer. Key features identified included creatine kinase isoenzyme (CK-MB) and alpha-hydroxybutyrate dehydrogenase.ConclusionThis study developed an artificial intelligence model using clinical blood markers, ultrasound data, and pathological information to identify distant metastasis in breast cancer patients. The LightGBM model demonstrated superior predictive accuracy and clinical applicability, suggesting it as a promising tool for early diagnosis of distant metastasis in breast cancer.

Keywords