Prior-guided attention fusion transformer for multi-lesion segmentation of diabetic retinopathy

Chenfangqian Xu; Xiaoxin Guo; Guangqi Yang; Yihao Cui; Longchen Su; Hongliang Dong; Xiaoying Hu; Songtian Che

doi:10.1038/s41598-024-71650-6

Scientific Reports (Sep 2024)

Prior-guided attention fusion transformer for multi-lesion segmentation of diabetic retinopathy

Chenfangqian Xu,
Xiaoxin Guo,
Guangqi Yang,
Yihao Cui,
Longchen Su,
Hongliang Dong,
Xiaoying Hu,
Songtian Che

Affiliations

Chenfangqian Xu: Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Jilin University
Xiaoxin Guo: Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Jilin University
Guangqi Yang: Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Jilin University
Yihao Cui: College of Software, Jilin University
Longchen Su: Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Jilin University
Hongliang Dong: College of Computer Science and Technology, Jilin University
Xiaoying Hu: Ophthalmology Department, Bethune First Hospital of Jilin University
Songtian Che: Ophthalmology Department, Bethune Second Hospital of Jilin University

DOI: https://doi.org/10.1038/s41598-024-71650-6
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract To solve the issue of diagnosis accuracy of diabetic retinopathy (DR) and reduce the workload of ophthalmologists, in this paper we propose a prior-guided attention fusion Transformer for multi-lesion segmentation of DR. An attention fusion module is proposed to improve the key generator to integrate self-attention and cross-attention and reduce the introduction of noise. The self-attention focuses on lesions themselves, capturing the correlation of lesions at a global scale, while the cross-attention, using pre-trained vessel masks as prior knowledge, utilizes the correlation between lesions and vessels to reduce the ambiguity of lesion detection caused by complex fundus structures. A shift block is introduced to expand association areas between lesions and vessels further and to enhance the sensitivity of the model to small-scale structures. To dynamically adjust the model’s perception of features at different scales, we propose the scale-adaptive attention to adaptively learn fusion weights of feature maps at different scales in the decoder, capturing features and details more effectively. The experimental results on two public datasets (DDR and IDRiD) demonstrate that our model outperforms other state-of-the-art models for multi-lesion segmentation.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords