A cross-stage features fusion network for building extraction from remote sensing images

Xiaolong Zuo; Zhenfeng Shao; Jiaming Wang; Xiao Huang; Yu Wang

doi:10.1080/10095020.2024.2307922

Geo-spatial Information Science (Apr 2024)

A cross-stage features fusion network for building extraction from remote sensing images

Xiaolong Zuo,
Zhenfeng Shao,
Jiaming Wang,
Xiao Huang,
Yu Wang

Affiliations

Xiaolong Zuo: State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, Wuhan University, Wuhan, China
Zhenfeng Shao: State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, Wuhan University, Wuhan, China
Jiaming Wang: Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, China
Xiao Huang: Department of Environmental Sciences, Emory University, Atlanta, GA, USA
Yu Wang: State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, Wuhan University, Wuhan, China

DOI: https://doi.org/10.1080/10095020.2024.2307922

Abstract

Read online

The deep learning-based building extraction methods produce different feature maps at different stages of the network, which contain different information features. The detailed information of the feature maps decreases along the depth of the network, and insufficiently detailed information results in limited accuracy. However, existing methods are incapable of making full use of low-level feature maps with rich details. To overcome these shortcomings, we proposed a Cross-stage Features Fusion Network (CFF-Net) for building extraction from remote sensing images. In the CFF-Net, we innovatively proposed a Cross-stage Features Fusion (CFF) module that fuses different features generated at different stages. And we used the attention mechanism to make the network more focused on important information at different scales. To further improve the accuracy of building extraction, we designed the Prediction Enhancement (PE) module, where the last convolutional layer and the feature map generated in the intermediate stage are used for prediction at the same time to enhance the final result. To evaluate the effectiveness of the proposed network, we conduct quantitative and qualitative experiments on the two publicly available datasets, i.e. the Inria dataset and the WHU datasets. CFF-Net outperformed other state-of-the-art algorithms on the two datasets in IoU and F1 metrics. The efficiency analysis reveals that the proposed CFF-Net achieves a great balance between building extraction performance and complexity/efficiency, with faster convergence and higher robustness.

Published in Geo-spatial Information Science

ISSN: 1009-5020 (Print); 1993-5153 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Geography. Anthropology. Recreation: Mathematical geography. Cartography; Science: Astronomy: Geodesy
Website: https://www.tandfonline.com/journals/tgsi

About the journal

Abstract

Keywords