IMAGE-TO-IMAGE TRANSLATION FOR ENHANCED FEATURE MATCHING, IMAGE RETRIEVAL AND VISUAL LOCALIZATION

M. S. Mueller; T. Sattler; M. Pollefeys; M. Pollefeys; B. Jutzi

doi:10.5194/isprs-annals-IV-2-W7-111-2019

ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (Sep 2019)

IMAGE-TO-IMAGE TRANSLATION FOR ENHANCED FEATURE MATCHING, IMAGE RETRIEVAL AND VISUAL LOCALIZATION

M. S. Mueller,
T. Sattler,
M. Pollefeys,
M. Pollefeys,
B. Jutzi

Affiliations

M. S. Mueller: Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology, Germany
T. Sattler: Department of Electrical Engineering, Chalmers University of Technology, Sweden
M. Pollefeys: Department of Computer Science, ETH Zurich, Switzerland
M. Pollefeys: Microsoft
B. Jutzi: Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology, Germany

DOI: https://doi.org/10.5194/isprs-annals-IV-2-W7-111-2019
Journal volume & issue: Vol. IV-2-W7
pp. 111 – 119

Abstract

Read online

The performance of machine learning and deep learning algorithms for image analysis depends significantly on the quantity and quality of the training data. The generation of annotated training data is often costly, time-consuming and laborious. Data augmentation is a powerful option to overcome these drawbacks. Therefore, we augment training data by rendering images with arbitrary poses from 3D models to increase the quantity of training images. These training images usually show artifacts and are of limited use for advanced image analysis. Therefore, we propose to use image-to-image translation to transform images from a rendered domain to a captured domain. We show that translated images in the captured domain are of higher quality than the rendered images. Moreover, we demonstrate that image-to-image translation based on rendered 3D models enhances the performance of common computer vision tasks, namely feature matching, image retrieval and visual localization. The experimental results clearly show the enhancement on translated images over rendered images for all investigated tasks. In addition to this, we present the advantages utilizing translated images over exclusively captured images for visual localization.

Published in ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

ISSN: 2194-9042 (Print); 2194-9050 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Technology: Engineering (General). Civil engineering (General): Applied optics. Photonics
Website: http://www.isprs.org/publications/annals.aspx

About the journal