IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2022)

STransUNet: A Siamese TransUNet-Based Remote Sensing Image Change Detection Network

  • Jian Yuan,
  • Liejun Wang,
  • Shuli Cheng

DOI
https://doi.org/10.1109/JSTARS.2022.3217038
Journal volume & issue
Vol. 15
pp. 9241 – 9253

Abstract

Read online

In modern remote sensing image change detection (CD), convolution neural network (CNN), especially U-shaped structure (UNet), has achieved great success due to their powerful discriminative ability. However, UNet-based CNN networks usually have limitations in modeling global dependencies due to the intrinsic locality of convolution operations. Transformer has recently emerged as an alternative architecture for dense prediction tasks due to the global self-attention mechanism. However, due to the limitation of hardware resources, pure Transformer methods generally lack the ability to capture global information at a low level. Based on these existing problems, we propose STransUNet, which combines Transformer and UNet architecture. STransUNet can not only capture shallow detail features at an early stage but also model global context in high-level feature. In addition, we design an efficient feature fusion module named cross-enhanced adaptive fusion (CEAF). Our model mainly consists of three parts: encoder, fusion module, and decoder. The decoder is a CNN-Transformer hybrid structure. CNN extracts multilevel feature information. Transformer encodes tokenized sequence to capture global context. CEAF module cross-enhances and adaptively fuses bitemporal features to enhance feature representation. In the decoding stage, we introduce a cascaded upsampling decoder (CUP). CUP progressively aggregates low-level CNN features and high-level Transformer features to full resolution. On four public CD datasets, our STransUNet achieves better CD results than six state-of-the-art algorithms.

Keywords