Cancer Medicine (Apr 2024)

Construction and validation of artificial intelligence pathomics models for predicting pathological staging in colorectal cancer: Using multimodal data and clinical variables

  • Yang Tan,
  • Run Liu,
  • Jia‐wen Xue,
  • Zhenbo Feng

DOI
https://doi.org/10.1002/cam4.6947
Journal volume & issue
Vol. 13, no. 7
pp. n/a – n/a

Abstract

Read online

Abstract Objective This retrospective observational study aims to develop and validate artificial intelligence (AI) pathomics models based on pathological Hematoxylin–Eosin (HE) slides and pathological immunohistochemistry (Ki67) slides for predicting the pathological staging of colorectal cancer. The goal is to enable AI‐assisted accurate pathological staging, supporting healthcare professionals in making efficient and precise staging assessments. Methods This study included a total of 267 colorectal cancer patients (training cohort: n = 213; testing cohort: n = 54). Logistic regression algorithms were used to construct the models. The HE image features were used to build the HE model, the Ki67 image features were used for the Ki67 model, and the combined model included features from both the HE and Ki67 images, as well as tumor markers (CEA, CA724, CA125, and CA242). The predictive results of the HE model, Ki67 model, and tumor markers were visualized through a nomogram. The models were evaluated using ROC curve analysis, and their clinical value was estimated using decision curve analysis (DCA). Results A total of 260 deep learning features were extracted from HE or Ki67 images. The AUC for the HE model and Ki67 model in the training cohort was 0.885 and 0.890, and in the testing cohort, it was 0.703 and 0.767, respectively. The combined model and nomogram in the training cohort had AUC values of 0.907 and 0.926, and in the testing cohort, they had AUC values of 0.814 and 0.817. In clinical DCA, the net benefit of the Ki67 model was superior to the HE model. The combined model and nomogram showed significantly higher net benefits compared to the individual HE model or Ki67 model. Conclusion The combined model and nomogram, which integrate pathomics multi‐modal data and clinical‐pathological variables, demonstrated superior performance in distinguishing between Stage I–II and Stage III colorectal cancer. This provides valuable support for clinical decision‐making and may improve treatment strategies and patient prognosis. Furthermore, the use of immunohistochemistry (Ki67) slides for pathomics modeling outperformed HE slide, offering new insights for future pathomics research.

Keywords