Automatic Detection and Classification of Modic Changes in MRI Images Using Deep Learning: Intelligent Assisted Diagnosis System

Gang Liu; Lei Wang; Sheng‐nan You; Zhi Wang; Shan Zhu; Chao Chen; Xin‐long Ma; Lei Yang; Shuai Zhang; Qiang Yang

doi:10.1111/os.13894

Orthopaedic Surgery (Jan 2024)

Automatic Detection and Classification of Modic Changes in MRI Images Using Deep Learning: Intelligent Assisted Diagnosis System

Gang Liu,
Lei Wang,
Sheng‐nan You,
Zhi Wang,
Shan Zhu,
Chao Chen,
Xin‐long Ma,
Lei Yang,
Shuai Zhang,
Qiang Yang

Affiliations

Gang Liu: Clinical School/College of Orthopaedics Tianjin Medical University Tianjin China
Lei Wang: Department of Spine Surgery, Tianjin Hospital Tianjin University Tianjin China
Sheng‐nan You: State Key Laboratory of Reliability and Intelligence of Electrical Equipment, School of Health Sciences & Biomedical Engineering Hebei University of Technology Tianjin China
Zhi Wang: Department of Spine Surgery, Tianjin Hospital Tianjin University Tianjin China
Shan Zhu: Department of Spine Surgery, Tianjin Hospital Tianjin University Tianjin China
Chao Chen: Department of Spine Surgery, Tianjin Hospital Tianjin University Tianjin China
Xin‐long Ma: Department of Spine Surgery, Tianjin Hospital Tianjin University Tianjin China
Lei Yang: State Key Laboratory of Reliability and Intelligence of Electrical Equipment, School of Health Sciences & Biomedical Engineering Hebei University of Technology Tianjin China
Shuai Zhang: State Key Laboratory of Reliability and Intelligence of Electrical Equipment, School of Health Sciences & Biomedical Engineering Hebei University of Technology Tianjin China
Qiang Yang: Department of Spine Surgery, Tianjin Hospital Tianjin University Tianjin China

DOI: https://doi.org/10.1111/os.13894
Journal volume & issue: Vol. 16, no. 1
pp. 196 – 206

Abstract

Read online

Objective Modic changes (MCs) are the most prevalent classification system for describing intravertebral MRI signal intensity changes. However, interpreting these intricate MRI images is a complex and time‐consuming process. This study investigates the performance of single shot multibox detector (SSD) and ResNet18 network‐based automatic detection and classification of MCs. Additionally, it compares the inter‐observer agreement and observer‐classifier agreement in MCs diagnosis to validate the feasibility of deep learning network‐assisted detection of classified MCs. Method A retrospective analysis of 140 patients with MCs who underwent MRI diagnosis and met the inclusion and exclusion criteria in Tianjin Hospital from June 2020 to June 2021 was used as the internal dataset. This group consisted of 55 males and 85 females, aged 25 to 89 years, with a mean age of (59.0 ± 13.7) years. An external test dataset of 28 patients, who met the same criteria and were assessed using different MRI equipment at Tianjin Hospital, was also gathered, including 11 males and 17 females, aged 31 to 84 years, with a mean age of 62.7 ± 10.9 years. After Physician 1 (with 15 years of experience) annotated all MRI images, the internal dataset was imported into the deep learning model for training. The model comprises an SSD network for lesion localization and a ResNet18 network for lesion classification. Performance metrics, including accuracy, recall, precision, F1 score, confusion matrix, and inter‐observer agreement parameter Kappa value, were used to evaluate the model's performance on the internal and external datasets. Physician 2 (with 1 year of experience) re‐labeled the internal and external test datasets to compare the inter‐observer agreement and observer‐classifier agreement. Results In the internal dataset, when models were utilized for the detection and classification of MCs, the accuracy, recall, precision and F1 score reached 86.25%, 87.77%, 84.92% and 85.60%, respectively. The Kappa value of the inter‐observer agreement was 0.768 (95% CI: 0.656, 0.847),while observer‐classifier agreement was 0.717 (95% CI: 0.589, 0.809).In the external test dataset, the model's the accuracy, recall, precision and F1 scores for diagnosing MCs reached 75%, 77.08%, 77.80% and 74.97%, respectively. The inter‐observer agreement was 0.681 (95% CI: 0.512, 0.677), and observer‐classifier agreement was 0.519 (95% CI: 0.290, 0.690). Conclusion The model demonstrated strong performance in detecting and classifying MCs, achieving high agreement with physicians in MCs diagnosis. These results suggest that deep learning models have the potential to facilitate the application of intelligent assisted diagnosis techniques in the field of spine research.

Published in Orthopaedic Surgery

ISSN: 1757-7853 (Print); 1757-7861 (Online)
Publisher: Wiley
Country of publisher: Australia
LCC subjects: Medicine: Surgery: Orthopedic surgery
Website: https://onlinelibrary.wiley.com/journal/17577861

About the journal

Abstract

Keywords