Contrastive learning-based pretraining improves representation and transferability of diabetic retinopathy classification models

Minhaj Nur Alam; Rikiya Yamashita; Vignav Ramesh; Tejas Prabhune; Jennifer I. Lim; R. V. P. Chan; Joelle Hallak; Theodore Leng; Daniel Rubin

doi:10.1038/s41598-023-33365-y

Scientific Reports (Apr 2023)

Contrastive learning-based pretraining improves representation and transferability of diabetic retinopathy classification models

Minhaj Nur Alam,
Rikiya Yamashita,
Vignav Ramesh,
Tejas Prabhune,
Jennifer I. Lim,
R. V. P. Chan,
Joelle Hallak,
Theodore Leng,
Daniel Rubin

Affiliations

Minhaj Nur Alam: Department of Biomedical Data Science, Stanford University School of Medicine
Rikiya Yamashita: Department of Biomedical Data Science, Stanford University School of Medicine
Vignav Ramesh: Department of Biomedical Data Science, Stanford University School of Medicine
Tejas Prabhune: Department of Biomedical Data Science, Stanford University School of Medicine
Jennifer I. Lim: Department of Ophthalmology and Visual Sciences, University of Illinois at Chicago
R. V. P. Chan: Department of Ophthalmology and Visual Sciences, University of Illinois at Chicago
Joelle Hallak: Department of Ophthalmology and Visual Sciences, University of Illinois at Chicago
Theodore Leng: Department of Ophthalmology, Stanford University School of Medicine
Daniel Rubin: Department of Biomedical Data Science, Stanford University School of Medicine

DOI: https://doi.org/10.1038/s41598-023-33365-y
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Diabetic retinopathy (DR) is a major cause of vision impairment in diabetic patients worldwide. Due to its prevalence, early clinical diagnosis is essential to improve treatment management of DR patients. Despite recent demonstration of successful machine learning (ML) models for automated DR detection, there is a significant clinical need for robust models that can be trained with smaller cohorts of dataset and still perform with high diagnostic accuracy in independent clinical datasets (i.e., high model generalizability). Towards this need, we have developed a self-supervised contrastive learning (CL) based pipeline for classification of referable vs non-referable DR. Self-supervised CL based pretraining allows enhanced data representation, therefore, the development of robust and generalized deep learning (DL) models, even with small, labeled datasets. We have integrated a neural style transfer (NST) augmentation in the CL pipeline to produce models with better representations and initializations for the detection of DR in color fundus images. We compare our CL pretrained model performance with two state of the art baseline models pretrained with Imagenet weights. We further investigate the model performance with reduced labeled training data (down to 10 percent) to test the robustness of the model when trained with small, labeled datasets. The model is trained and validated on the EyePACS dataset and tested independently on clinical datasets from the University of Illinois, Chicago (UIC). Compared to baseline models, our CL pretrained FundusNet model had higher area under the receiver operating characteristics (ROC) curve (AUC) (CI) values (0.91 (0.898 to 0.930) vs 0.80 (0.783 to 0.820) and 0.83 (0.801 to 0.853) on UIC data). At 10 percent labeled training data, the FundusNet AUC was 0.81 (0.78 to 0.84) vs 0.58 (0.56 to 0.64) and 0.63 (0.60 to 0.66) in baseline models, when tested on the UIC dataset. CL based pretraining with NST significantly improves DL classification performance, helps the model generalize well (transferable from EyePACS to UIC data), and allows training with small, annotated datasets, therefore reducing ground truth annotation burden of the clinicians.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal