Machine learning for predicting the risk stratification of 1–5 cm gastric gastrointestinal stromal tumors based on CT

Cui Zhang; Jian Wang; Yang Yang; Bailing Dai; Zhihua Xu; Fangmei Zhu; Huajun Yu

doi:10.1186/s12880-023-01053-y

BMC Medical Imaging (Jul 2023)

Machine learning for predicting the risk stratification of 1–5 cm gastric gastrointestinal stromal tumors based on CT

Cui Zhang,
Jian Wang,
Yang Yang,
Bailing Dai,
Zhihua Xu,
Fangmei Zhu,
Huajun Yu

Affiliations

Cui Zhang: Department of Radiology, TongDe Hospital of Zhejiang Province
Jian Wang: Department of Radiology, TongDe Hospital of Zhejiang Province
Yang Yang: Department of Radiology, The First Affiliated Hospital of Bengbu Medical College
Bailing Dai: Department of Radiology, TongDe Hospital of Zhejiang Province
Zhihua Xu: Department of Radiology, TongDe Hospital of Zhejiang Province
Fangmei Zhu: Department of Radiology, TongDe Hospital of Zhejiang Province
Huajun Yu: Department of Radiology, Zhejiang Hospital

DOI: https://doi.org/10.1186/s12880-023-01053-y
Journal volume & issue: Vol. 23, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Backgroud To predict the malignancy of 1–5 cm gastric gastrointestinal stromal tumors (GISTs) by machine learning (ML) on CT images using three models - Logistic Regression (LR), Decision Tree (DT) and Gradient Boosting Decision Tree (GBDT). Methods 231 patients from Center 1 were randomly assigned into the training cohort (n = 161) and the internal validation cohort (n = 70) in a 7:3 ratio. The other 78 patients from Center 2 served as the external test cohort. Scikit-learn software was used to build three classifiers. The performance of the three models were evaluated by sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV) and area under the curve (AUC). Diagnostic differences between ML models and radiologists were compared in the external test cohort. Important features of LR and GBDT were analyzed and compared. Results GBDT outperformed LR and DT with the largest AUC values (0.981 and 0.815) in the training and internal validation cohorts and the greatest accuracy (0.923, 0.833 and 0.844) across all three cohorts. However, LR was found to have the largest AUC value (0.910) in the external test cohort. DT yielded the worst accuracy (0.790 and 0.727) and AUC values (0.803 and 0.700) in both the internal validation cohort and the external test cohort. GBDT and LR performed better than radiologists. Long diameter was demonstrated to be the same and most important CT feature for GBDT and LR. Conclusions ML classifiers, especially GBDT and LR with high accuracy and strong robustness, were considered to be promising in risk classification of 1–5 cm gastric GISTs based on CT. Long diameter was found the most important feature for risk stratification.

Published in BMC Medical Imaging

ISSN: 1471-2342 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Medical technology
Website: http://bmcmedimaging.biomedcentral.com

About the journal

Abstract

Keywords