Clinical actionability of triaging DNA mismatch repair deficient colorectal cancer from biopsy samples using deep learning
Wu Jiang,
Wei-Jian Mei,
Shuo-Yu Xu,
Yi-Hong Ling,
Wei-Rong Li,
Jin-Bo Kuang,
Hao-Sen Li,
Hui Hui,
Ji-Bin Li,
Mu-Yan Cai,
Zhi-Zhong Pan,
Hui-Zhong Zhang,
Li Li,
Pei-Rong Ding
Affiliations
Wu Jiang
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Colorectal Surgery, Sun Yat-sen University Cancer Center, Guangzhou, PR China
Wei-Jian Mei
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Colorectal Surgery, Sun Yat-sen University Cancer Center, Guangzhou, PR China
Shuo-Yu Xu
Bio-totem Pte Ltd, Foshan, PR China; Department of General Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, PR China
Yi-Hong Ling
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Pathology, Sun Yat-sen University Cancer Center, Guangzhou, PR China
Wei-Rong Li
Department of General Surgery, Guangzhou First People's Hospital, Guangzhou, PR China
Jin-Bo Kuang
Bio-totem Pte Ltd, Foshan, PR China
Hao-Sen Li
Bio-totem Pte Ltd, Foshan, PR China
Hui Hui
Bio-totem Pte Ltd, Foshan, PR China
Ji-Bin Li
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Clinical Research, Sun Yat-sen University Cancer Center, Guangzhou, PR China
Mu-Yan Cai
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Pathology, Sun Yat-sen University Cancer Center, Guangzhou, PR China
Zhi-Zhong Pan
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Colorectal Surgery, Sun Yat-sen University Cancer Center, Guangzhou, PR China
Hui-Zhong Zhang
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Pathology, Sun Yat-sen University Cancer Center, Guangzhou, PR China; Corresponding author at: Department of Pathology, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, PR China.
Li Li
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Medical Imaging, Sun Yat-sen University Cancer Center, Guangzhou, PR China; Corresponding author at: Department of Medical Imaging, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, PR China.
Pei-Rong Ding
Sun Yat-sen University Cancer Center; State Key Laboratory of Oncology in South China; Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, PR China; Department of Colorectal Surgery, Sun Yat-sen University Cancer Center, Guangzhou, PR China; Corresponding author at: Department of Colorectal Surgery, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, PR China.
Summary: Background: We aimed to develop a deep learning (DL) model to predict DNA mismatch repair (MMR) status in colorectal cancers (CRC) based on hematoxylin and eosin-stained whole-slide images (WSIs) and assess its clinical applicability. Methods: The DL model was developed and validated through three-fold cross validation using 441 WSIs from the Cancer Genome Atlas (TCGA) and externally validated using 78 WSIs from the Pathology AI Platform (PAIP), and 355 WSIs from surgical specimens and 341 WSIs from biopsy specimens of the Sun Yet-sun University Cancer Center (SYSUCC). Domain adaption and multiple instance learning (MIL) techniques were adopted for model development. The performance of the models was evaluated using the area under the receiver operating characteristic curve (AUROC). A dual-threshold strategy was also built from the surgical cohorts and validated in the biopsy cohort. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1-score, and the percentage of patients avoiding IHC testing were evaluated. Findings: The MIL model achieved an AUROC of 0·8888±0·0357 in the TCGA-validation cohort, 0·8806±0·0232 in the PAIP cohort, 0·8457±0·0233 in the SYSUCC-surgical cohort, and 0·7679±0·0342 in the SYSUCC-biopsy cohort. A dual-threshold triage strategy was used to rule-in and rule-out dMMR patients with remaining uncertain patients recommended for further IHC testing, which kept sensitivity higher than 90% and specificity higher than 95% on deficient MMR patient triage from both the surgical and biopsy specimens, result in more than half of patients avoiding IHC based MMR testing. Interpretation: A DL-based method that could directly predict CRC MMR status from WSIs was successfully developed, and a dual-threshold triage strategy was established to minimize the number of patients for further IHC testing. Funding: The study was funded by the National Natural Science Foundation of China (82073159, 81871971 and 81700576), the Natural Science Foundation of Guangdong Province (No. 2021A1515011792 and No.2022A1515012403) and Medical Scientific Research Foundation of Guangdong Province of China (No. A2020392).