Applied Sciences (May 2024)

An Efficient Semantic Segmentation Method for Remote-Sensing Imagery Using Improved Coordinate Attention

  • Yan Huo,
  • Shuang Gang,
  • Liang Dong,
  • Chao Guan

DOI
https://doi.org/10.3390/app14104075
Journal volume & issue
Vol. 14, no. 10
p. 4075

Abstract

Read online

Semantic segmentation stands as a prominent domain within remote sensing that is currently garnering significant attention. This paper introduces a pioneering semantic segmentation model based on TransUNet architecture with improved coordinate attention for remote-sensing imagery. It is composed of an encoding stage and a decoding stage. Notably, an enhanced and improved coordinate attention module is employed by integrating two pooling methods to generate weights. Subsequently, the feature map undergoes reweighting to accentuate foreground information and suppress background information. To address the issue of time complexity, this paper introduces an improvement to the transformer model by sparsifying the attention matrix. This reduces the computing expense of calculating attention, making the model more efficient. Additionally, the paper uses a combined loss function that is designed to enhance the training performance of the model. The experimental results conducted on three public datasets manifest the efficiency of the proposed method. The results indicate that it excels in delivering outstanding performance for semantic segmentation tasks pertaining to remote-sensing images.

Keywords