Memory-Augmented Transformer for Remote Sensing Image Semantic Segmentation

Xin Zhao; Jiayi Guo; Yueting Zhang; Yirong Wu

doi:10.3390/rs13224518

Remote Sensing (Nov 2021)

Memory-Augmented Transformer for Remote Sensing Image Semantic Segmentation

Xin Zhao,
Jiayi Guo,
Yueting Zhang,
Yirong Wu

Affiliations

Xin Zhao: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China
Jiayi Guo: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China
Yueting Zhang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China
Yirong Wu: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China

DOI: https://doi.org/10.3390/rs13224518
Journal volume & issue: Vol. 13, no. 22
p. 4518

Abstract

Read online

The semantic segmentation of remote sensing images requires distinguishing local regions of different classes and exploiting a uniform global representation of the same-class instances. Such requirements make it necessary for the segmentation methods to extract discriminative local features between different classes and to explore representative features for all instances of a given class. While common deep convolutional neural networks (DCNNs) can effectively focus on local features, they are limited by their receptive field to obtain consistent global information. In this paper, we propose a memory-augmented transformer (MAT) to effectively model both the local and global information. The feature extraction pipeline of the MAT is split into a memory-based global relationship guidance module and a local feature extraction module. The local feature extraction module mainly consists of a transformer, which is used to extract features from the input images. The global relationship guidance module maintains a memory bank for the consistent encoding of the global information. Global guidance is performed by memory interaction. Bidirectional information flow between the global and local branches is conducted by a memory-query module, as well as a memory-update module, respectively. Experiment results on the ISPRS Potsdam and ISPRS Vaihingen datasets demonstrated that our method can perform competitively with state-of-the-art methods.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords