FCAU-Net for the Semantic Segmentation of Fine-Resolution Remotely Sensed Images

Xuerui Niu; Qiaolin Zeng; Xiaobo Luo; Liangfu Chen

doi:10.3390/rs14010215

Remote Sensing (Jan 2022)

FCAU-Net for the Semantic Segmentation of Fine-Resolution Remotely Sensed Images

Xuerui Niu,
Qiaolin Zeng,
Xiaobo Luo,
Liangfu Chen

Affiliations

Xuerui Niu: The College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Qiaolin Zeng: The College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Xiaobo Luo: The College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Liangfu Chen: The Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

DOI: https://doi.org/10.3390/rs14010215
Journal volume & issue: Vol. 14, no. 1
p. 215

Abstract

Read online

The semantic segmentation of fine-resolution remotely sensed images is an urgent issue in satellite image processing. Solving this problem can help overcome various obstacles in urban planning, land cover classification, and environmental protection, paving the way for scene-level landscape pattern analysis and decision making. Encoder-decoder structures based on attention mechanisms have been frequently used for fine-resolution image segmentation. In this paper, we incorporate a coordinate attention (CA) mechanism, adopt an asymmetric convolution block (ACB), and design a refinement fusion block (RFB), forming a network named the fusion coordinate and asymmetry-based U-Net (FCAU-Net). Furthermore, we propose novel convolutional neural network (CNN) architecture to fully capture long-term dependencies and fine-grained details in fine-resolution remotely sensed imagery. This approach has the following advantages: (1) the CA mechanism embeds position information into a channel attention mechanism to enhance the feature representations produced by the network while effectively capturing position information and channel relationships; (2) the ACB enhances the feature representation ability of the standard convolution layer and captures and refines the feature information in each layer of the encoder; and (3) the RFB effectively integrates low-level spatial information and high-level abstract features to eliminate background noise when extracting feature information, reduces the fitting residuals of the fused features, and improves the ability of the network to capture information flows. Extensive experiments conducted on two public datasets (ZY-3 and DeepGlobe) demonstrate the effectiveness of the FCAU-Net. The proposed FCAU-Net transcends U-Net, Attention U-Net, the pyramid scene parsing network (PSPNet), DeepLab v3+, the multistage attention residual U-Net (MAResU-Net), MACU-Net, and the Transformer U-Net (TransUNet). Specifically, the FCAU-Net achieves a 97.97% (95.05%) pixel accuracy (PA), a 98.53% (91.27%) mean PA (mPA), a 95.17% (85.54%) mean intersection over union (mIoU), and a 96.07% (90.74%) frequency-weighted IoU (FWIoU) on the ZY-3 (DeepGlobe) dataset.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords