The Lancet: Digital Health (Aug 2019)

Detection of glaucomatous optic neuropathy with spectral-domain optical coherence tomography: a retrospective training and validation deep-learning analysis

  • An Ran Ran, MD,
  • Carol Y Cheung, PhD,
  • Xi Wang, ME,
  • Hao Chen, PhD,
  • Lu-yang Luo, BE,
  • Poemen P Chan, FRCS,
  • Mandy O M Wong, FRCS,
  • Robert T Chang, MD,
  • Suria S Mannil, MD,
  • Alvin L Young, ProfFRCS,
  • Hon-wah Yung, FRCS,
  • Chi Pui Pang, ProfDPhil,
  • Pheng-Ann Heng, ProfPhD,
  • Clement C Tham, ProfFCOphthHK

Journal volume & issue
Vol. 1, no. 4
pp. e172 – e182

Abstract

Read online

Summary: Background: Spectral-domain optical coherence tomography (SDOCT) can be used to detect glaucomatous optic neuropathy, but human expertise in interpretation of SDOCT is limited. We aimed to develop and validate a three-dimensional (3D) deep-learning system using SDOCT volumes to detect glaucomatous optic neuropathy. Methods: We retrospectively collected a dataset including 4877 SDOCT volumes of optic disc cube for training (60%), testing (20%), and primary validation (20%) from electronic medical and research records at the Chinese University of Hong Kong Eye Centre (Hong Kong, China) and the Hong Kong Eye Hospital (Hong Kong, China). Residual network was used to build the 3D deep-learning system. Three independent datasets (two from Hong Kong and one from Stanford, CA, USA), including 546, 267, and 1231 SDOCT volumes, respectively, were used for external validation of the deep-learning system. Volumes were labelled as having or not having glaucomatous optic neuropathy according to the criteria of retinal nerve fibre layer thinning on reliable SDOCT images with position-correlated visual field defect. Heatmaps were generated for qualitative assessments. Findings: 6921 SDOCT volumes from 1 384 200 two-dimensional cross-sectional scans were studied. The 3D deep-learning system had an area under the receiver operation characteristics curve (AUROC) of 0·969 (95% CI 0·960–0·976), sensitivity of 89% (95% CI 83–93), specificity of 96% (92–99), and accuracy of 91% (89–93) in the primary validation, outperforming a two-dimensional deep-learning system that was trained on en face fundus images (AUROC 0·921 [0·905–0·937]; p<0·0001). The 3D deep-learning system performed similarly in the external validation datasets, with AUROCs of 0·893–0·897, sensitivities of 78–90%, specificities of 79–86%, and accuracies of 80–86%. The heatmaps of glaucomatous optic neuropathy showed that the learned features by the 3D deep-learning system used for detection of glaucomatous optic neuropathy were similar to those used by clinicians. Interpretation: The proposed 3D deep-learning system performed well in detection of glaucomatous optic neuropathy in both primary and external validations. Further prospective studies are needed to estimate the incremental cost-effectiveness of incorporation of an artificial intelligence-based model for glaucoma screening. Funding: Hong Kong Research Grants Council.