BMC Medical Informatics and Decision Making (Sep 2024)
Predictive model of prognosis index for invasive micropapillary carcinoma of the breast based on machine learning: a SEER population-based study
Abstract
Abstract Background Invasive micropapillary carcinoma (IMPC) is a rare subtype of breast cancer. Its epidemiological features, treatment principles, and prognostic factors remain controversial. Objective This study aimed to develop an improved machine learning-based model to predict the prognosis of patients with invasive micropapillary carcinoma. Methods A total of 1123 patients diagnosed with IMPC after surgery between 1998 and 2019 were identified from the Surveillance, Epidemiology, and End Results (SEER) database for survival analysis. Univariate and multivariate analyses were performed to explore independent prognostic factors for the overall and disease-specific survival of patients with IMPC. Five machine learning algorithms were developed to predict the 5-year survival of these patients. Results Cox regression analysis indicated that patients aged > 65 years had a significantly worse prognosis than those younger in age, while unmarried patients had a better prognosis than married patients. Patients diagnosed between 2001 and 2005 had a significant risk reduction of mortality compared with other periods. The XGBoost model outperformed the other models with a precision of 0.818 and an area under the curve of 0.863. Conclusions A machine learning model for IMPC in patients with breast cancer was developed to estimate the 5-year OS. The XGBoost model had a promising performance and can help clinicians determine the early prognosis of patients with IMPC; therefore, the model can improve clinical outcomes by influencing management strategies and patient health care decisions.
Keywords