IEEE Access (Jan 2020)

Attention Guided Encoder-Decoder Network With Multi-Scale Context Aggregation for Land Cover Segmentation

  • Shuyang Wang,
  • Xiaodong Mu,
  • Dongfang Yang,
  • Hao He,
  • Peng Zhao

DOI
https://doi.org/10.1109/ACCESS.2020.3040862
Journal volume & issue
Vol. 8
pp. 215299 – 215309

Abstract

Read online

Land cover segmentation is an important and challenging task in the field of remote sensing. Even though convolutional neural networks (CNNs) provide great support for semantic segmentation, standard models are still difficult to capture global information and long-range dependencies in remote sensing images. To overcome these limitations, we proposed an attention guided encoder-decoder network with multi-scale context aggregation to achieve more accurate segmentation of land cover. Based on the structure of the encoder-decoder network, we introduce a multi-scale feature fusion module with two attention modules to the top of the encoder. The multi-scale feature fusion module is employed to aggregate multi-scale features and capture global correlations. The attention modules are used to exploit the long-range dependencies and the interdependence between channels from the perspective of space and channel respectively. The experimental results on the GF-2 images show that our proposed method achieves state-of-the-art performance, with an OA of 84.1% and the mIoU of 62.3%. Compared with the baseline network, our method improves the OA by 3.3% and the mIoU by 4.4%. The comparative experiments also demonstrate that the proposed approach can significantly improve the accuracy of land cover segmentation than other compared methods.

Keywords