LGNet: Local and global representation learning for fast biomedical image segmentation

Guoping Xu; Xuan Zhang; Wentao Liao; Shangbin Chen; Xinglong Wu

doi:10.1142/S1793545822430015

Journal of Innovative Optical Health Sciences (Jul 2023)

LGNet: Local and global representation learning for fast biomedical image segmentation

Guoping Xu,
Xuan Zhang,
Wentao Liao,
Shangbin Chen,
Xinglong Wu

Affiliations

Guoping Xu: School of Computer Science & Engineering, Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, Hubei 430205, P. R. China
Xuan Zhang: School of Computer Science & Engineering, Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, Hubei 430205, P. R. China
Wentao Liao: School of Computer Science & Engineering, Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, Hubei 430205, P. R. China
Shangbin Chen: Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
Xinglong Wu: School of Computer Science & Engineering, Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, Hubei 430205, P. R. China

DOI: https://doi.org/10.1142/S1793545822430015
Journal volume & issue: Vol. 16, no. 04

Abstract

Read online

Medical image segmentation plays a crucial role in clinical diagnosis and therapy systems, yet still faces many challenges. Building on convolutional neural networks (CNNs), medical image segmentation has achieved tremendous progress. However, owing to the locality of convolution operations, CNNs have the inherent limitation in learning global context. To address the limitation in building global context relationship from CNNs, we propose LGNet, a semantic segmentation network aiming to learn local and global features for fast and accurate medical image segmentation in this paper. Specifically, we employ a two-branch architecture consisting of convolution layers in one branch to learn local features and transformer layers in the other branch to learn global features. LGNet has two key insights: (1) We bridge two-branch to learn local and global features in an interactive way; (2) we present a novel multi-feature fusion model (MSFFM) to leverage the global contexture information from transformer and the local representational features from convolutions. Our method achieves state-of-the-art trade-off in terms of accuracy and efficiency on several medical image segmentation benchmarks including Synapse, ACDC and MOST. Specifically, LGNet achieves the state-of-the-art performance with Dice’s indexes of 80.15% on Synapse, of 91.70% on ACDC, and of 95.56% on MOST. Meanwhile, the inference speed attains at 172 frames per second with [Formula: see text] input resolution. The extensive experiments demonstrate the effectiveness of the proposed LGNet for fast and accurate for medical image segmentation.

Published in Journal of Innovative Optical Health Sciences

ISSN: 1793-5458 (Print); 1793-7205 (Online)
Publisher: World Scientific Publishing
Country of publisher: Singapore
LCC subjects: Technology; Science: Physics: Optics. Light
Website: http://www.worldscientific.com/worldscinet/jiohs

About the journal

Abstract

Keywords