IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

Detail Enhanced Change Detection in VHR Images Using a Self-Supervised Multiscale Hybrid Network

  • Dalong Zheng,
  • Zebin Wu,
  • Jia Liu,
  • Chih-Cheng Hung,
  • Zhihui Wei

DOI
https://doi.org/10.1109/JSTARS.2023.3348630
Journal volume & issue
Vol. 17
pp. 3181 – 3196

Abstract

Read online

The integration of the transformer and convolutional neural network (CNN) has become a useful method for change detection in remote sensing images. The main function of the transformer is to capture the global features, while the CNN is more for obtaining the local features. However, such an integration is not efficient for change detection in the very-high-resolution (VHR) remote sensing images with fine surface detail information. Hence, to improve this traditional construction of the transformer and CNN, we propose a dense Swin-Transformer-V2 (DST) and VGG16, coined as DST-VGG, for extracting the discriminatory features for change detection. The difference between our proposed network and other networks is that the output of the VGG16 encoders will be used in the DST in which more Swin-V2 blocks are added for fine feature extraction. The learning model in the VGG16 encoders employs a self-supervised method, which is guided through the change in details. Our network not only inherits the advantages of the integration of the transformer and CNN, but also captures the features of change relationship through the DST and catches the primitive features in both prechanged and postchanged regions through the VGG16. In addition, we design a mixed feature pyramid within the DST, which provides interlayer interaction information and intralayer multiscale information for a more complete feature learning within the new network. Furthermore, we impose a self-supervised strategy to guide the VGG16 provide the semantic change information from the output features of the encoder. We compared our experimental results with those of the state-of-the-art methods on four commonly used public VHR remote sensing datasets. It shows that our network performs better, in terms of F1, IoU, and OA, than those of the existing networks for change detection.

Keywords