Swin Transformer Assisted Prior Attention Network for Medical Image Segmentation

Zhihao Liao; Neng Fan; Kai Xu

doi:10.3390/app12094735

Applied Sciences (May 2022)

Swin Transformer Assisted Prior Attention Network for Medical Image Segmentation

Zhihao Liao,
Neng Fan,
Kai Xu

Affiliations

Zhihao Liao: School of Information and Engineering, Nanchang University, Nanchang 330031, China
Neng Fan: School of Mathematics and Computer Sciences, Nanchang University, Nanchang 330031, China
Kai Xu: School of Mathematics and Computer Sciences, Nanchang University, Nanchang 330031, China

DOI: https://doi.org/10.3390/app12094735
Journal volume & issue: Vol. 12, no. 9
p. 4735

Abstract

Read online

Transformer complements convolutional neural network (CNN) has achieved better performance than improved CNN-based methods. Specially, Transformer is utilized to be combined with U-shaped structure, skip-connections, encoder, and even them all together. However, the intermediate supervision network based on the coarse-to-fine strategy has not been combined with Transformer to improve the generalization of CNN-based methods. In this paper, we propose Swin-PANet, which is applying a window-based self-attention mechanism by Swin Transformer in the intermediate supervision network, called prior attention network. A new enhanced attention block based on CCA is also proposed to aggregate the features from skip-connections and prior attention network, and further refine details of boundaries. Swin-PANet can address the dilemma that traditional Transformer network has poor interpretability in the process of attention calculation and Swin-PANet can insert its attention predictions into prior attention network for intermediate supervision learning which is humanly interpretable and controllable. Hence, the intermediate supervision network assisted by Swin Transformer provides better attention learning and interpretability in network for accurate and automatic medical image segmentation. The experimental results evaluate the effectiveness of Swin-PANet which outperforms state-of-the-art methods in some famous medical segmentation tasks including cell and skin lesion segmentation.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords