Heliyon (Jul 2024)
NCME-Net: Nuclear cataract mask encoder network for intelligent grading using self-supervised learning from anterior segment photographs
Abstract
Cataracts are a leading cause of blindness worldwide, making accurate diagnosis and effective surgical planning critical. However, grading the severity of the lens nucleus is challenging because deep learning (DL) models pretrained using ImageNet perform poorly when applied directly to medical data due to the limited availability of labeled medical images and high interclass similarity. Self-supervised pretraining offers a solution by circumventing the need for cost-intensive data annotations and bridging domain disparities. In this study, to address the challenges of intelligent grading, we proposed a hybrid model called nuclear cataract mask encoder network (NCME-Net), which utilizes self-supervised pretraining for the four-class analysis of nuclear cataract severity. A total of 792 images of nuclear cataracts were categorized into the training set (533 images), the validation set (139 images), and the test set (100 images). NCME-Net achieved a diagnostic accuracy of 91.0 % on the test set, a 5.0 % improvement over the best-performing DL model (ResNet50). Experimental results demonstrate NCME-Net's ability to distinguish between cataract severities, particularly in scenarios with limited samples, making it a valuable tool for intelligently diagnosing cataracts. In addition, the effect of different self-supervised tasks on the model's ability to capture the intrinsic structure of the data was studied. Findings indicate that image restoration tasks significantly enhance semantic information extraction.