A vision transformer architecture for the automated segmentation of retinal lesions in spectral domain optical coherence tomography images

Daniel Philippi; Kai Rothaus; Mauro Castelli

doi:10.1038/s41598-023-27616-1

Scientific Reports (Jan 2023)

A vision transformer architecture for the automated segmentation of retinal lesions in spectral domain optical coherence tomography images

Daniel Philippi,
Kai Rothaus,
Mauro Castelli

Affiliations

Daniel Philippi: NOVA Information Management School (NOVA IMS), Universidade Nova de Lisboa
Kai Rothaus: Department of Ophthalmology, St. Franziskus Hospital
Mauro Castelli: NOVA Information Management School (NOVA IMS), Universidade Nova de Lisboa

DOI: https://doi.org/10.1038/s41598-023-27616-1
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Neovascular age-related macular degeneration (nAMD) is one of the major causes of irreversible blindness and is characterized by accumulations of different lesions inside the retina. AMD biomarkers enable experts to grade the AMD and could be used for therapy prognosis and individualized treatment decisions. In particular, intra-retinal fluid (IRF), sub-retinal fluid (SRF), and pigment epithelium detachment (PED) are prominent biomarkers for grading neovascular AMD. Spectral-domain optical coherence tomography (SD-OCT) revolutionized nAMD early diagnosis by providing cross-sectional images of the retina. Automatic segmentation and quantification of IRF, SRF, and PED in SD-OCT images can be extremely useful for clinical decision-making. Despite the excellent performance of convolutional neural network (CNN)-based methods, the task still presents some challenges due to relevant variations in the location, size, shape, and texture of the lesions. This work adopts a transformer-based method to automatically segment retinal lesion from SD-OCT images and qualitatively and quantitatively evaluate its performance against CNN-based methods. The method combines the efficient long-range feature extraction and aggregation capabilities of Vision Transformers with data-efficient training of CNNs. The proposed method was tested on a private dataset containing 3842 2-dimensional SD-OCT retina images, manually labeled by experts of the Franziskus Eye-Center, Muenster. While one of the competitors presents a better performance in terms of Dice score, the proposed method is significantly less computationally expensive. Thus, future research will focus on the proposed network’s architecture to increase its segmentation performance while maintaining its computational efficiency.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal