IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

Multiscale Fusion CNN-Transformer Network for High-Resolution Remote Sensing Image Change Detection

  • Ming Jiang,
  • Yimin Chen,
  • Zhe Dong,
  • Xiaoping Liu,
  • Xinchang Zhang,
  • Honghui Zhang

DOI
https://doi.org/10.1109/JSTARS.2024.3361507
Journal volume & issue
Vol. 17
pp. 5280 – 5293

Abstract

Read online

Accurate change detection using remote sensing data is crucial for understanding surface dynamics. Despite the impressive success of current convolutional neural network (CNN)-based techniques, their feature extraction and representation capabilities are limited, leading to pseudochanges and omissions. To address this issue, we propose a multiscale fusion CNN-transformer network (MSFCTNet), which incorporates the strengths of CNN and transformer to improve change detection capability. The network utilizes a Siamese CNN to extract features from the bi-temporal image pairs and then combines multiscale features using a CNN-transformer hybrid structure to extract global and local features. In the decode stage, a gated attention module is used to filter the extracted features layer by layer. Moreover, before outputting the prediction results, a feature refinement head is employed to further refine the features, suppress background noise, and improve detection capability. LEVIR-CD, SVCD, and SYSU-CD datasets are used to evaluate the proposed network. Experimental results indicate that MSFCTNet outperforms other state-of-the-art change detection methods, proving the potential of MSFCTNet for change detection tasks.

Keywords