Multi-label classification of retinal disease via a novel vision transformer model

Dong Wang; Jian Lian; Wanzhen Jiao

doi:10.3389/fnins.2023.1290803

Frontiers in Neuroscience (Jan 2024)

Multi-label classification of retinal disease via a novel vision transformer model

Dong Wang,
Jian Lian,
Wanzhen Jiao

Affiliations

Dong Wang: School of Information Science and Electrical Engineering, Shandong Jiaotong University, Jinan, China
Jian Lian: School of Intelligence Engineering, Shandong Management University, Jinan, China
Wanzhen Jiao: Department of Ophthalmology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, China

DOI: https://doi.org/10.3389/fnins.2023.1290803
Journal volume & issue: Vol. 17

Abstract

Read online

IntroductionThe precise identification of retinal disorders is of utmost importance in the prevention of both temporary and permanent visual impairment. Prior research has yielded encouraging results in the classification of retinal images pertaining to a specific retinal condition. In clinical practice, it is not uncommon for a single patient to present with multiple retinal disorders concurrently. Hence, the task of classifying retinal images into multiple labels remains a significant obstacle for existing methodologies, but its successful accomplishment would yield valuable insights into a diverse array of situations simultaneously.MethodsThis study presents a novel vision transformer architecture called retinal ViT, which incorporates the self-attention mechanism into the field of medical image analysis. To note that this study supposed to prove that the transformer-based models can achieve competitive performance comparing with the CNN-based models, hence the convolutional modules have been eliminated from the proposed model. The suggested model concludes with a multi-label classifier that utilizes a feed-forward network architecture. This classifier consists of two layers and employs a sigmoid activation function.Results and discussionThe experimental findings provide evidence of the improved performance exhibited by the suggested model when compared to state-of-the-art approaches such as ResNet, VGG, DenseNet, and MobileNet, on the publicly available dataset ODIR-2019, and the proposed approach has outperformed the state-of-the-art algorithms in terms of Kappa, F1 score, AUC, and AVG.

Published in Frontiers in Neuroscience

ISSN: 1662-4548 (Print); 1662-453X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/neuroscience

About the journal

Abstract

Keywords