IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2023)

SPANet: Spatial Adaptive Convolution Based Content-Aware Network for Aerial Image Semantic Segmentation

  • Jianlong Hou,
  • Zhi Guo,
  • Yingchao Feng,
  • Youming Wu,
  • Wenhui Diao

DOI
https://doi.org/10.1109/JSTARS.2023.3244207
Journal volume & issue
Vol. 16
pp. 2192 – 2204

Abstract

Read online

Semantic segmentation of remote sensing images encounters four significant difficulties: 1) complex backgrounds, 2) large-scale differences, 3) numerous small objects, and 4) extreme foreground–background imbalance. However, the existing generic semantic segmentation models mainly focus on the modeling context information and rarely focus on these four issues. This article presents an enhanced remote sensing image semantic segmentation framework to solve these problems through the hierarchical atrous pyramid (HASP) module and spatial-adaptive convolution-based FPN decoder framework. On the one hand, HASP solved the problem of complex backgrounds and large-scale differences by further enlarging the receptive field of the network through the cascade of atrous convolution with various rates. On the other hand, spatial adaptive convolution is embedded in FPN decoder framework step by step to solve the problems of numerous small objects and extreme foreground–background imbalance. Besides, a boundary-based loss function is constructed to help the network optimize the relevant segmentation results. Extensive experiments over iSAID and ISPRS Vaihingen datasets reflect the superiority of the presented structure to conventional the state-of-the-art semantic segmentation approaches.

Keywords