Cancer Management and Research (Jun 2022)

Machine Learning-Based Gray-Level Co-Occurrence Matrix (GLCM) Models for Predicting the Depth of Myometrial Invasion in Patients with Stage I Endometrial Cancer

  • Qin L,
  • Lai L,
  • Wang H,
  • Zhang Y,
  • Qian X,
  • He D

Journal volume & issue
Vol. Volume 14
pp. 2143 – 2154

Abstract

Read online

Li Qin,1 Lin Lai,2 Hongli Wang,3 Yukun Zhang,2 Xiaoyuan Qian4 *, Du He2 𪇞partment of Obstetrics and Gynecology, the Central Hospital of Enshi Tujia and Miao Autonomous Prefecture, Enshi, Hubei, 445000, People’s Republic of China; 2Department of Oncology, the Central Hospital of Enshi Tujia and Miao Autonomous Prefecture, Enshi, Hubei, 445000, People’s Republic of China; 3Department of Pathology, the Central Hospital of Enshi Tujia and Miao Autonomous Prefecture, Enshi, Hubei, 445000, People’s Republic of China; 4Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, 430079, People’s Republic of China*These authors contributed equally to this workCorrespondence: Du He, Department of Oncology, the Central Hospital of Enshi Tujia and Miao Autonomous Prefecture, Enshi, Hubei, 445000, People’s Republic of China, Email [email protected]: Deep myometrial invasion (DMI) is an independent high-risk factor for lymph node metastasis and a prognostic risk factor in early-stage endometrial cancer (EC-I) patients. Thus, we developed a machine learning (ML) assistant model, which can accurately help define the surgical area.Methods: 348 consecutive EC-I patients with the pathological diagnosis were recruited in the tertiary medical centre between January 1, 2012, and October 31, 2021. Five ML-assisted models were developed using two-step estimation methods from the candidate gray level co-occurrence matrix (GLCM). Receiver operating characteristic curve (ROC), decision curve analysis (DCA), and clinical impact curve (CIC) were prepared to evaluate the robustness and clinical practicality of each model.Results: Our analysis identified several significant differences between the stage IA and IB groups. The top seven-candidate factors included correlation all direction offset1, correlation angle0 offset1, correlation angle45 offset1, correlation angle90 offset1, ID moment all direction offset1, ID moment angle0 offset1, and ID moment angle45 offset1. The areas under the ROC curve (AUCs) of the random forest classifier (RFC) model, support vector machine (SVM), eXtreme gradient boosting (XGBoost), artificial neural network (ANN), and decision tree (DT) ranged from 0.765 to 0.877 in the training set and from 0.716 to 0.862 in the testing set, respectively. Among the five machine algorithms, RFC obtained the optimal prediction efficiency using correlation angle0 offset1, correlation angle45 offset1, correlation angle90 offset1, correlation all direction offset1, ID moment angle0 offset1, and ID moment angle45 offset1, and ID moment angle90 offset1, respectively.Conclusion: Our ML-based prediction model combined with GLCM parameters assessed the risk of DMI in EC-I patients, especially RFC, which helped distinguish stage IA and IB EC patients. This new predictive model based on supervised learning can be used to establish personalized treatment strategies.Keywords: endometrial cancer, myometrial invasion, gray level co-occurrence matrix, machine learning, prediction

Keywords