PLoS ONE (Jan 2022)

Improved diagnostic performance of plain radiography for cervical ossification of the posterior longitudinal ligament using deep learning.

  • Hee-Dong Chae,
  • Sung Hwan Hong,
  • Hyun Jung Yeoh,
  • Yeo Ryang Kang,
  • Su Min Lee,
  • Minyoung Kim,
  • Seok Young Koh,
  • Yongeun Lee,
  • Moo Sung Park,
  • Ja-Young Choi,
  • Hye Jin Yoo

DOI
https://doi.org/10.1371/journal.pone.0267643
Journal volume & issue
Vol. 17, no. 4
p. e0267643

Abstract

Read online

BackgroundA high false-negative rate has been reported for the diagnosis of ossification of the posterior longitudinal ligament (OPLL) using plain radiography. We investigated whether deep learning (DL) can improve the diagnostic performance of radiologists for cervical OPLL using plain radiographs.Materials and methodsThe training set consisted of 915 radiographs from 207 patients diagnosed with OPLL. For the test set, we used 200 lateral cervical radiographs from 100 patients with cervical OPLL and 100 patients without OPLL. An observer performance study was conducted over two reading sessions. In the first session, we compared the diagnostic performance of the DL-model and the six observers. The diagnostic performance was evaluated using the area under the receiver operating characteristic curve (AUC) at the vertebra and patient level. The sensitivity and specificity of the DL model and average observers were calculated in per-patient analysis. Subgroup analysis was performed according to the morphologic classification of OPLL. In the second session, observers evaluated the radiographs by referring to the results of the DL-model.ResultsIn the vertebra-level analysis, the DL-model showed an AUC of 0.854, which was higher than the average AUC of observers (0.826), but the difference was not significant (p = 0.292). In the patient-level analysis, the performance of the DL-model had an AUC of 0.851, and the average AUC of observers was 0.841 (p = 0.739). The patient-level sensitivity and specificity were 91% and 69% in the DL model, and 83% and 68% for the average observers, respectively. Both the DL-model and observers showed decreases in overall performance in the segmental and circumscribed types. With knowledge of the results of the DL-model, the average AUC of observers increased to 0.893 (p = 0.001) at the vertebra level and 0.911 (p ConclusionsThe DL-based OPLL detection model can significantly improve the diagnostic performance of radiologists on cervical radiographs.