Remote Sensing (Sep 2023)

SDRSwin: A Residual Swin Transformer Network with Saliency Detection for Infrared and Visible Image Fusion

  • Shengshi Li,
  • Guanjun Wang,
  • Hui Zhang,
  • Yonghua Zou

DOI
https://doi.org/10.3390/rs15184467
Journal volume & issue
Vol. 15, no. 18
p. 4467

Abstract

Read online

Infrared and visible image fusion is a solution that generates an information-rich individual image with different modal information by fusing images obtained from various sensors. Salient detection can better emphasize the targets of concern. We propose a residual Swin Transformer fusion network based on saliency detection, termed SDRSwin, aiming to highlight the salient thermal targets in the infrared image while maintaining the texture details in the visible image. The SDRSwin network is trained with a two-stage training approach. In the first stage, we train an encoder–decoder network based on residual Swin Transformers to achieve powerful feature extraction and reconstruction capabilities. In the second stage, we develop a novel salient loss function to guide the network to fuse the salient targets in the infrared image and the background detail regions in the visible image. The extensive results indicate that our method has abundant texture details with clear bright infrared targets and achieves a better performance than the twenty-one state-of-the-art methods in both subjective and objective evaluation.

Keywords