Bioengineering (Sep 2024)
Application of Machine Learning in the Diagnosis of Early Gastric Cancer Using the Kyoto Classification Score and Clinical Features Collected from Medical Consultations
Abstract
The early detection accuracy of early gastric cancer (EGC) determines the choice of the optimal treatment strategy and the related medical expenses. We aimed to develop a simple, affordable, and time-saving diagnostic model using six machine learning (ML) algorithms for the diagnosis of EGC. It is based on the endoscopy-based Kyoto classification score obtained after the completion of endoscopy and other clinical features obtained after medical consultation. We retrospectively evaluated 1999 patients who underwent gastrointestinal endoscopy at the China Beijing Hospital. Of these, 203 subjects were diagnosed with EGC. The data were randomly divided into training and test sets (ratio 4:1). We constructed six ML models, and the developed models were evaluated on the testing set. This procedure was repeated five times. The Kolmogorov–Arnold Networks (KANs) model achieved the best performance (mean AUC value: 0.76; mean balanced accuracy: 70.96%; mean precision: 58.91%; mean recall: 70.96%; mean false positive rate: 26.11%; mean false negative rate: 31.96%; and mean F1 score value: 58.46). The endoscopy-based Kyoto classification score was the most important feature with the highest feature importance score. The results suggest that the KAN model, the optimal ML model in this study, has the potential to identify EGC patients, which may result in a reduction in both the time cost and medical expenses in clinical practice.
Keywords