Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy

Michelle Y. T. Yip; Gilbert Lim; Zhan Wei Lim; Quang D. Nguyen; Crystal C. Y. Chong; Marco Yu; Valentina Bellemo; Yuchen Xie; Xin Qi Lee; Haslina Hamzah; Jinyi Ho; Tien-En Tan; Charumathi Sabanayagam; Andrzej Grzybowski; Gavin S. W. Tan; Wynne Hsu; Mong Li Lee; Tien Yin Wong; Daniel S. W. Ting

doi:10.1038/s41746-020-0247-1

npj Digital Medicine (Mar 2020)

Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy

Michelle Y. T. Yip,
Gilbert Lim,
Zhan Wei Lim,
Quang D. Nguyen,
Crystal C. Y. Chong,
Marco Yu,
Valentina Bellemo,
Yuchen Xie,
Xin Qi Lee,
Haslina Hamzah,
Jinyi Ho,
Tien-En Tan,
Charumathi Sabanayagam,
Andrzej Grzybowski,
Gavin S. W. Tan,
Wynne Hsu,
Mong Li Lee,
Tien Yin Wong,
Daniel S. W. Ting

Affiliations

Michelle Y. T. Yip: Singapore Eye Research Institute, Singapore National Eye Center
Gilbert Lim: Singapore Eye Research Institute, Singapore National Eye Center
Zhan Wei Lim: School of Computing, National University of Singapore
Quang D. Nguyen: Singapore Eye Research Institute, Singapore National Eye Center
Crystal C. Y. Chong: Singapore Eye Research Institute, Singapore National Eye Center
Marco Yu: Singapore Eye Research Institute, Singapore National Eye Center
Valentina Bellemo: Singapore Eye Research Institute, Singapore National Eye Center
Yuchen Xie: Singapore Eye Research Institute, Singapore National Eye Center
Xin Qi Lee: Singapore Eye Research Institute, Singapore National Eye Center
Haslina Hamzah: Singapore Eye Research Institute, Singapore National Eye Center
Jinyi Ho: Singapore Eye Research Institute, Singapore National Eye Center
Tien-En Tan: Singapore Eye Research Institute, Singapore National Eye Center
Charumathi Sabanayagam: Singapore Eye Research Institute, Singapore National Eye Center
Andrzej Grzybowski: Department of Ophthalmology, University of Warmia and Mazury
Gavin S. W. Tan: Singapore Eye Research Institute, Singapore National Eye Center
Wynne Hsu: School of Computing, National University of Singapore
Mong Li Lee: School of Computing, National University of Singapore
Tien Yin Wong: Singapore Eye Research Institute, Singapore National Eye Center
Daniel S. W. Ting: Singapore Eye Research Institute, Singapore National Eye Center

DOI: https://doi.org/10.1038/s41746-020-0247-1
Journal volume & issue: Vol. 3, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Deep learning (DL) has been shown to be effective in developing diabetic retinopathy (DR) algorithms, possibly tackling financial and manpower challenges hindering implementation of DR screening. However, our systematic review of the literature reveals few studies studied the impact of different factors on these DL algorithms, that are important for clinical deployment in real-world settings. Using 455,491 retinal images, we evaluated two technical and three image-related factors in detection of referable DR. For technical factors, the performances of four DL models (VGGNet, ResNet, DenseNet, Ensemble) and two computational frameworks (Caffe, TensorFlow) were evaluated while for image-related factors, we evaluated image compression levels (reducing image size, 350, 300, 250, 200, 150 KB), number of fields (7-field, 2-field, 1-field) and media clarity (pseudophakic vs phakic). In detection of referable DR, four DL models showed comparable diagnostic performance (AUC 0.936-0.944). To develop the VGGNet model, two computational frameworks had similar AUC (0.936). The DL performance dropped when image size decreased below 250 KB (AUC 0.936, 0.900, p < 0.001). The DL performance performed better when there were increased number of fields (dataset 1: 2-field vs 1-field—AUC 0.936 vs 0.908, p < 0.001; dataset 2: 7-field vs 2-field vs 1-field, AUC 0.949 vs 0.911 vs 0.895). DL performed better in the pseudophakic than phakic eyes (AUC 0.918 vs 0.833, p < 0.001). Various image-related factors play more significant roles than technical factors in determining the diagnostic performance, suggesting the importance of having robust training and testing datasets for DL training and deployment in the real-world settings.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal