AerialIRGAN: unpaired aerial visible-to-infrared image translation with dual-encoder structure

Decao Ma; Juan Su; Shaopeng Li; Yong Xian

doi:10.1038/s41598-024-73381-0

Scientific Reports (Sep 2024)

AerialIRGAN: unpaired aerial visible-to-infrared image translation with dual-encoder structure

Decao Ma,
Juan Su,
Shaopeng Li,
Yong Xian

Affiliations

Decao Ma: Xi’an Research Institute of High Technology
Juan Su: Xi’an Research Institute of High Technology
Shaopeng Li: Xi’an Research Institute of High Technology
Yong Xian: Xi’an Research Institute of High Technology

DOI: https://doi.org/10.1038/s41598-024-73381-0
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Due to the high cost of equipment and the constraints of shooting conditions, obtaining aerial infrared images of specific targets is very challenging. Most methods using Generative Adversarial Networks for translating visible images to infrared greatly depend on registered data and struggle to handle the diversity and complexity of scenes in aerial infrared targets. This paper proposes a one side end-to-end unpaired aerial visible-to-infrared image translation algorithm, termed AerialIRGAN. AerialIRGAN introduces a dual-encoder structure, where one encoder is designed based on the Segment Anything Model to extract deep semantic features from visible images, and the other encoder is designed based on UniRepLKNet to capture small-scale patterns and sparse patterns from visible images. Subsequently, AerialIRGAN constructs a bridging module to deeply integrate the features of both encoders and their corresponding decoders. Finally, AerialIRGAN proposes a structural appearance consistency loss to guide the synthetic infrared images to maintain the structure of the source image while possessing distinct infrared characteristics. The experimental results show that compared to the existing typical infrared image generation algorithms, the proposed method can generate higher-quality infrared images and achieve better performance in both subjective visual description and objective metric evaluation.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords