CaSaFormer: A cross- and self-attention based lightweight network for large-scale building semantic segmentation

Jiayi Li; Yuping Hu; Xin Huang

doi:10.1016/j.jag.2024.103942

International Journal of Applied Earth Observations and Geoinformation (Jun 2024)

CaSaFormer: A cross- and self-attention based lightweight network for large-scale building semantic segmentation

Jiayi Li,
Yuping Hu,
Xin Huang

Affiliations

Jiayi Li: School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, PR China; Hubei Luojia Laboratory, Wuhan University, Wuhan 430079, PR China
Yuping Hu: School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, PR China
Xin Huang: School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, PR China; Corresponding author at: School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, PR China.

DOI: https://doi.org/10.1016/j.jag.2024.103942
Journal volume & issue: Vol. 130
p. 103942

Abstract

Read online

Buildings play a crucial role in geographic information systems, and advancements in the resolution of remote sensing imagery have facilitated their extraction on a larger scale. However, this progress has simultaneously heightened the requirements for methods to demonstrate efficiency and enhanced generalization performance. For this purpose, we propose a lightweight building semantic segmentation network, named CaSaFormer. Specifically, we propose an efficient module composed of Cross-attention and Self-attention Blocks connected in series (CaSa Block), to extract valuable semantic information from the feature pyramid. Furthermore, a novel Cross-Attention Gate Fusion (CAGF) module was developed to effectively integrate complementary components from global semantic features and local spatial features. Experiment results have demonstrated that our CaSaFormer outperforms state-of-the-art (SOTA) lightweight methods with best trade-off between accuracy and efficiency, showing a 1.92 % improvement in IoU and 16 % of the computation complexity. When compared to non-lightweight methods under equivalent computational resources, an impressive 1.69 % IoU gain is also achieved with only 1.7 % of the computation complexity. The code is available at: https://github.com/YpingHu/CaSaFormer.

Published in International Journal of Applied Earth Observations and Geoinformation

ISSN: 1569-8432 (Print); 1872-826X (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Geography. Anthropology. Recreation: Physical geography; Geography. Anthropology. Recreation: Environmental sciences
Website: https://www.journals.elsevier.com/international-journal-of-applied-earth-observation-and-geoinformation

About the journal

Abstract

Keywords