Bioengineering (Sep 2024)

Machine Learning Algorithm for Predicting Distant Metastasis of T1 and T2 Gallbladder Cancer Based on SEER Database

  • Zhentian Guo,
  • Zongming Zhang,
  • Limin Liu,
  • Yue Zhao,
  • Zhuo Liu,
  • Chong Zhang,
  • Hui Qi,
  • Jinqiu Feng,
  • Peijie Yao,
  • Haiming Yuan

DOI
https://doi.org/10.3390/bioengineering11090927
Journal volume & issue
Vol. 11, no. 9
p. 927

Abstract

Read online

(1) Background: This study seeks to employ a machine learning (ML) algorithm to forecast the risk of distant metastasis (DM) in patients with T1 and T2 gallbladder cancer (GBC); (2) Methods: Data of patients diagnosed with T1 and T2 GBC was obtained from SEER, encompassing the period from 2004 to 2015, were utilized to apply seven ML algorithms. These algorithms were appraised by the area under the receiver operating characteristic curve (AUC) and other metrics; (3) Results: This study involved 4371 patients in total. Out of these patients, 764 (17.4%) cases progressed to develop DM. Utilizing a logistic regression (LR) model to identify independent risk factors for DM of gallbladder cancer (GBC). A nomogram has been developed to forecast DM in early T-stage gallbladder cancer patients. Through the evaluation of different models using relevant indicators, it was discovered that Random Forest (RF) exhibited the most outstanding predictive performance; (4) Conclusions: RF has demonstrated high accuracy in predicting DM in gallbladder cancer patients, assisting clinical physicians in enhancing the accuracy of diagnosis. This can be particularly valuable for improving patient outcomes and optimizing treatment strategies. We employ the RF algorithm to construct the corresponding web calculator.

Keywords