Indian Journal of Ophthalmology (Jul 2024)

Evaluation of an AI algorithm trained on an ethnically diverse dataset to screen a previously unseen population for diabetic retinopathy

  • Divya P Rao,
  • Florian M Savoy,
  • Anand Sivaraman,
  • Sreetama Dutt,
  • Marianne Shahsuvaryan,
  • Nairuhi Jrbashyan,
  • Narine Hambardzumyan,
  • Nune Yeghiazaryan,
  • Taraprasad Das

DOI
https://doi.org/10.4103/IJO.IJO_2151_23
Journal volume & issue
Vol. 72, no. 8
pp. 1162 – 1167

Abstract

Read online

Purpose: This study aimed to determine the generalizability of an artificial intelligence (AI) algorithm trained on an ethnically diverse dataset to screen for referable diabetic retinopathy (RDR) in the Armenian population unseen during AI development. Methods: This study comprised 550 patients with diabetes mellitus visiting the polyclinics of Armenia over 10 months requiring diabetic retinopathy (DR) screening. The Medios AI-DR algorithm was developed using a robust, diverse, ethnically balanced dataset with no inherent bias and deployed offline on a smartphone-based fundus camera. The algorithm here analyzed the retinal images captured using the target device for the presence of RDR (i.e., moderate non-proliferative diabetic retinopathy (NPDR) and/or clinically significant diabetic macular edema (CSDME) or more severe disease) and sight-threatening DR (STDR, i.e., severe NPDR and/or CSDME or more severe disease). The results compared the AI output to a consensus or majority image grading of three expert graders according to the International Clinical Diabetic Retinopathy severity scale. Results: On 478 subjects included in the analysis, the algorithm achieved a high classification sensitivity of 95.30% (95% CI: 91.9%–98.7%) and a specificity of 83.89% (95% CI: 79.9%–87.9%) for the detection of RDR. The sensitivity for STDR detection was 100%. Conclusion: The study proved that Medios AI-DR algorithm yields good accuracy in screening for RDR in the Armenian population. In our literature search, this is the only smartphone-based, offline AI model validated in different populations.

Keywords