GFRNet: Rethinking the global contexts extraction in medical images segmentation through matrix factorization and self‐attention

Lifang Chen; Shanglai Wang; Li Wan; Jianghu Su; Shunfeng Wang

doi:10.1049/cvi2.12243

IET Computer Vision (Mar 2024)

GFRNet: Rethinking the global contexts extraction in medical images segmentation through matrix factorization and self‐attention

Lifang Chen,
Shanglai Wang,
Li Wan,
Jianghu Su,
Shunfeng Wang

Affiliations

Lifang Chen: School of Artificial Intelligence and Computer Science Jiangnan University Wuxi China
Shanglai Wang: School of Artificial Intelligence and Computer Science Jiangnan University Wuxi China
Li Wan: School of Artificial Intelligence and Computer Science Jiangnan University Wuxi China
Jianghu Su: School of Artificial Intelligence and Computer Science Jiangnan University Wuxi China
Shunfeng Wang: School of Artificial Intelligence and Computer Science Jiangnan University Wuxi China

DOI: https://doi.org/10.1049/cvi2.12243
Journal volume & issue: Vol. 18, no. 2
pp. 260 – 272

Abstract

Read online

Abstract Due to the large fluctuations of the boundaries and internal variations of the lesion regions in medical image segmentation, current methods may have difficulty capturing sufficient global contexts effectively to deal with these inherent challenges, which may lead to a problem of segmented discrete masks undermining the performance of segmentation. Although self‐attention can be implemented to capture long‐distance dependencies between pixels, it has the disadvantage of computational complexity and the global contexts extracted by self‐attention are still insufficient. To this end, the authors propose the GFRNet, which resorts to the idea of low‐rank matrix factorization by forming global contexts locally to obtain global contexts that are totally different from contexts extracted by self‐attention. The authors effectively integrate the different global contexts extract by self‐attention and low‐rank matrix factorization to extract versatile global contexts. Also, to recover the spatial contexts lost during the matrix factorization process and enhance boundary contexts, the authors propose the Modified Matrix Decomposition module which employ depth‐wise separable convolution and spatial augmentation in the low‐rank matrix factorization process. Comprehensive experiments are performed on four benchmark datasets showing that GFRNet performs better than the relevant CNN and transformer‐based recipes.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords