Bone & Joint Open (Nov 2022)

Artificial intelligence-generated hip radiological measurements are fast and adequate for reliable assessment of hip dysplasia <subtitle>an external validation study</subtitle>

  • Holden Archer,
  • Seth Reine,
  • Ahmed Alshaikhsalama,
  • Joel Wells,
  • Ajay Kohli,
  • Louis Vazquez,
  • Allan Hummer,
  • Matthew D. DiFranco,
  • Richard Ljuhar,
  • Yin Xi,
  • Avneesh Chhabra

DOI
https://doi.org/10.1302/2633-1462.311.BJO-2022-0125.R1
Journal volume & issue
Vol. 3, no. 11
pp. 877 – 884

Abstract

Read online

AimsHip dysplasia (HD) leads to premature osteoarthritis. Timely detection and correction of HD has been shown to improve pain, functional status, and hip longevity. Several time-consuming radiological measurements are currently used to confirm HD. An artificial intelligence (AI) software named HIPPO automatically locates anatomical landmarks on anteroposterior pelvis radiographs and performs the needed measurements. The primary aim of this study was to assess the reliability of this tool as compared to multi-reader evaluation in clinically proven cases of adult HD. The secondary aims were to assess the time savings achieved and evaluate inter-reader assessment.MethodsA consecutive preoperative sample of 130 HD patients (256 hips) was used. This cohort included 82.3% females (n = 107) and 17.7% males (n = 23) with median patient age of 28.6 years (interquartile range (IQR) 22.5 to 37.2). Three trained readers’ measurements were compared to AI outputs of lateral centre-edge angle (LCEA), caput-collum-diaphyseal (CCD) angle, pelvic obliquity, Tönnis angle, Sharp’s angle, and femoral head coverage. Intraclass correlation coefficients (ICC) and Bland-Altman analyses were obtained.ResultsAmong 256 hips with AI outputs, all six hip AI measurements were successfully obtained. The AI-reader correlations were generally good (ICC 0.60 to 0.74) to excellent (ICC > 0.75). There was lower agreement for CCD angle measurement. Most widely used measurements for HD diagnosis (LCEA and Tönnis angle) demonstrated good to excellent inter-method reliability (ICC 0.71 to 0.86 and 0.82 to 0.90, respectively). The median reading time for the three readers and AI was 212 (IQR 197 to 230), 131 (IQR 126 to 147), 734 (IQR 690 to 786), and 41 (IQR 38 to 44) seconds, respectively.ConclusionThis study showed that AI-based software demonstrated reliable radiological assessment of patients with HD with significant interpretation-related time savings.Cite this article: Bone Jt Open 2022;3(11):877–884.

Keywords