ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation

Yang Zhao; Peng Guo; Zihao Sun; Xiuwan Chen; Han Gao

doi:10.3390/rs15051428

Remote Sensing (Mar 2023)

ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation

Yang Zhao,
Peng Guo,
Zihao Sun,
Xiuwan Chen,
Han Gao

Affiliations

Yang Zhao: Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China
Peng Guo: Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China
Zihao Sun: Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China
Xiuwan Chen: Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China
Han Gao: Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China

DOI: https://doi.org/10.3390/rs15051428
Journal volume & issue: Vol. 15, no. 5
p. 1428

Abstract

Read online

The performance of a semantic segmentation model for remote sensing (RS) images pre-trained on an annotated dataset greatly decreases when testing on another unannotated dataset because of the domain gap. Adversarial generative methods, e.g., DualGAN, are utilized for unpaired image-to-image translation to minimize the pixel-level domain gap, which is one of the common approaches for unsupervised domain adaptation (UDA). However, the existing image translation methods face two problems when performing RS image translation: (1) ignoring the scale discrepancy between two RS datasets, which greatly affects the accuracy performance of scale-invariant objects; (2) ignoring the characteristic of real-to-real translation of RS images, which brings an unstable factor for the training of the models. In this paper, ResiDualGAN is proposed for RS image translation, where an in-network resizer module is used for addressing the scale discrepancy of RS datasets and a residual connection is used for strengthening the stability of real-to-real images translation and improving the performance in cross-domain semantic segmentation tasks. Combined with an output space adaptation method, the proposed method greatly improves the accuracy performance on common benchmarks, which demonstrates the superiority and reliability of ResiDualGAN. At the end of the paper, a thorough discussion is conducted to provide a reasonable explanation for the improvement of ResiDualGAN. Our source code is also available.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords