Exhaustive Search of Correspondences between Multimodal Remote Sensing Images Using Convolutional Neural Network

Mykhail Uss; Benoit Vozel; Vladimir Lukin; Kacem Chehdi

doi:10.3390/s22031231

Sensors (Feb 2022)

Exhaustive Search of Correspondences between Multimodal Remote Sensing Images Using Convolutional Neural Network

Mykhail Uss,
Benoit Vozel,
Vladimir Lukin,
Kacem Chehdi

Affiliations

Mykhail Uss: Department of Information-Communication Technologies, National Aerospace University, Kharkov 61070, Ukraine
Benoit Vozel: Institut d’Electronique et des Technologies du numéRique, IETR UMR CNRS 6164, University of Rennes 1, 22305 Lannion, France
Vladimir Lukin: Department of Information-Communication Technologies, National Aerospace University, Kharkov 61070, Ukraine
Kacem Chehdi: Institut d’Electronique et des Technologies du numéRique, IETR UMR CNRS 6164, University of Rennes 1, 22305 Lannion, France

DOI: https://doi.org/10.3390/s22031231
Journal volume & issue: Vol. 22, no. 3
p. 1231

Abstract

Read online

Finding putative correspondences between a pair of images is an important prerequisite for image registration. In complex cases such as multimodal registration, a true match could be less plausible than a false match within a search zone. Under these conditions, it is important to detect all plausible matches. This could be achieved by an exhaustive search using a handcrafted similarity measure (SM, e.g., mutual information). It is promising to replace handcrafted SMs with deep learning ones that offer better performance. However, the latter are not designed for an exhaustive search of all matches but for finding the most plausible one. In this paper, we propose a deep-learning-based solution for exhaustive multiple match search between two images within a predefined search area. We design a computationally efficient convolutional neural network (CNN) that takes as input a template fragment from one image, a search fragment from another image and produces an SM map covering the entire search area in spatial dimensions. This SM map finds multiple plausible matches, locates each match with subpixel accuracy and provides a covariance matrix of localization errors for each match. The proposed CNN is trained with a specially designed loss function that enforces the translation and rotation invariance of the SM map and enables the detection of matches that have no associated ground truth data (e.g., multiple matches for repetitive textures). We validate the approach on multimodal remote sensing images and show that the proposed “area” SM performs better than “point” SM.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords