Multi-Scale Attention Network for Building Extraction from High-Resolution Remote Sensing Images

Jing Chang; Xiaohui He; Panle Li; Ting Tian; Xijie Cheng; Mengjia Qiao; Tao Zhou; Beibei Zhang; Ziqian Chang; Tingwei Fan

doi:10.3390/s24031010

Sensors (Feb 2024)

Multi-Scale Attention Network for Building Extraction from High-Resolution Remote Sensing Images

Jing Chang,
Xiaohui He,
Panle Li,
Ting Tian,
Xijie Cheng,
Mengjia Qiao,
Tao Zhou,
Beibei Zhang,
Ziqian Chang,
Tingwei Fan

Affiliations

Jing Chang: School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
Xiaohui He: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
Panle Li: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
Ting Tian: School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
Xijie Cheng: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
Mengjia Qiao: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
Tao Zhou: School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
Beibei Zhang: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
Ziqian Chang: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
Tingwei Fan: School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China

DOI: https://doi.org/10.3390/s24031010
Journal volume & issue: Vol. 24, no. 3
p. 1010

Abstract

Read online

The precise building extraction from high-resolution remote sensing images holds significant application for urban planning, resource management, and environmental conservation. In recent years, deep neural networks (DNNs) have garnered substantial attention for their adeptness in learning and extracting features, becoming integral to building extraction methodologies and yielding noteworthy performance outcomes. Nonetheless, prevailing DNN-based models for building extraction often overlook spatial information during the feature extraction phase. Additionally, many existing models employ a simplistic and direct approach in the feature fusion stage, potentially leading to spurious target detection and the amplification of internal noise. To address these concerns, we present a multi-scale attention network (MSANet) tailored for building extraction from high-resolution remote sensing images. In our approach, we initially extracted multi-scale building feature information, leveraging the multi-scale channel attention mechanism and multi-scale spatial attention mechanism. Subsequently, we employed adaptive hierarchical weighting processes on the extracted building features. Concurrently, we introduced a gating mechanism to facilitate the effective fusion of multi-scale features. The efficacy of the proposed MSANet was evaluated using the WHU aerial image dataset and the WHU satellite image dataset. The experimental results demonstrate compelling performance metrics, with the F1 scores registering at 93.76% and 77.64% on the WHU aerial imagery dataset and WHU satellite dataset II, respectively. Furthermore, the intersection over union (IoU) values stood at 88.25% and 63.46%, surpassing benchmarks set by DeepLabV3 and GSMC.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords