MCRN: A Multi-source Cross-modal Retrieval Network for remote sensing

Zhiqiang Yuan; Wenkai Zhang; Changyuan Tian; Yongqiang Mao; Ruixue Zhou; Hongqi Wang; Kun Fu; Xian Sun

International Journal of Applied Earth Observations and Geoinformation (Dec 2022)

MCRN: A Multi-source Cross-modal Retrieval Network for remote sensing

Zhiqiang Yuan,
Wenkai Zhang,
Changyuan Tian,
Yongqiang Mao,
Ruixue Zhou,
Hongqi Wang,
Kun Fu,
Xian Sun

Affiliations

Zhiqiang Yuan: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China
Wenkai Zhang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China; Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
Changyuan Tian: Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
Yongqiang Mao: University of Chinese Academy of Sciences, Beijing 100190, China
Ruixue Zhou: School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100190, China
Hongqi Wang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China; Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
Kun Fu: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China; Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
Xian Sun: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China; Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China; Corresponding author at: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China.

Journal volume & issue: Vol. 115
p. 103071

Abstract

Read online

Cross-modal remote sensing retrieval (RSCR) has an increasing importance due to the ability to quickly and flexibly retrieve valuable data from enormous remote sensing (RS) images. However, traditional RSCR methods tend to focus on the retrieval between two modalities, when the number of modalities increases, the contradiction between the increasing semantic gap and the small amount of paired data causes the model fail to learn a superior modal representation. In this paper, inspired by the visual-based modal center in RS, we construct a multi-source cross-modal retrieval network (MCRN) that manages to unify RS retrieval tasks under multiple retrieval sources. To solve the data heterogeneity caused by multiple data sources, we propose a shared pattern transfer module (SPTM) based on pattern memory and combine the theory of generative adversarial to achieve the semantic representation unbound from modality. Simultaneously, to cope with the lack of annotation data in the RS scenario, multiple unimodal self-supervised frameworks are unified to obtain robust pre-training parameters for the designed MCRN by combining domain alignment and contrastive learning. Finally, we come up with the multi-source triplet loss, the unimodal contrast loss, and the semantic consistency loss, which efficaciously make MCRN achieve competitive results through multitask learning for semantic alignment. We construct multimodal datasets M-RSICD and M-RSITMD, conduct extensive experiments and provide a complete benchmark to facilitate the development of RS multi-source cross-modal retrieval. The code of the MCRN method and the proposed dataset have been open to access at [Link].

Published in International Journal of Applied Earth Observations and Geoinformation

ISSN: 1569-8432 (Print); 1872-826X (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Geography. Anthropology. Recreation: Physical geography; Geography. Anthropology. Recreation: Environmental sciences
Website: https://www.journals.elsevier.com/international-journal-of-applied-earth-observation-and-geoinformation

About the journal

Abstract

Keywords