Hybrid attention mechanism of feature fusion for medical image segmentation

Shanshan Tong; Zhentao Zuo; Zuxiang Liu; Dengdi Sun; Tiangang Zhou

doi:10.1049/ipr2.12934

IET Image Processing (Jan 2024)

Hybrid attention mechanism of feature fusion for medical image segmentation

Shanshan Tong,
Zhentao Zuo,
Zuxiang Liu,
Dengdi Sun,
Tiangang Zhou

Affiliations

Shanshan Tong: Institute of Artificial Intelligence Hefei Comprehensive National Science Center Hefei China
Zhentao Zuo: Institute of Artificial Intelligence Hefei Comprehensive National Science Center Hefei China
Zuxiang Liu: Institute of Artificial Intelligence Hefei Comprehensive National Science Center Hefei China
Dengdi Sun: AHU‐IAI AI Joint Laboratory Anhui University Hefei China
Tiangang Zhou: Institute of Artificial Intelligence Hefei Comprehensive National Science Center Hefei China

DOI: https://doi.org/10.1049/ipr2.12934
Journal volume & issue: Vol. 18, no. 1
pp. 77 – 87

Abstract

Read online

Abstract Traditional convolution neural networks (CNN) have achieved good performance in multi‐organ segmentation of medical images. Due to the lack of ability to model long‐range dependencies and correlations between image pixels, CNN usually ignores the information of channel dimension. To further improve the performance of multi‐organ segmentation, a hybrid attention mechanism model is proposed. First, a CNN was used to extract multi‐scale feature maps and fed into the Channel Attention Enhancement Module (CAEM) to selectively pay attention to target organs in medical images, and the Transformer encoded tokenized image patches from CNN feature maps as the input sequence to model long‐range dependencies. Second, the decoder upsampled the output from Transformer and fused with the CAEM features in multi‐scale through skip connections. Finally, we introduced a Refinement Module (RM) after the decoder to improve feature correlations of the same organ and the feature discriminability between different organs. The model outperformed on dice coefficient (%) and hd95 on both the synapse multi‐organ segmentation and cardiac diagnosis challenge datasets. The hybrid attention mechanisms exhibited high efficiency and high segmentation accuracy in medical images.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords