A COMPARISON OF TWO STRATEGIES FOR AVOIDING NEGATIVE TRANSFER IN DOMAIN ADAPTATION BASED ON LOGISTIC REGRESSION

A. Paul; K. Vogt; F. Rottensteiner; J. Ostermann; C. Heipke

doi:10.5194/isprs-archives-XLII-2-845-2018

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences (May 2018)

A COMPARISON OF TWO STRATEGIES FOR AVOIDING NEGATIVE TRANSFER IN DOMAIN ADAPTATION BASED ON LOGISTIC REGRESSION

A. Paul,
K. Vogt,
F. Rottensteiner,
J. Ostermann,
C. Heipke

Affiliations

A. Paul: Institute of Photogrammetry and GeoInformation, Leibniz Universität Hannover, Germany
K. Vogt: Institut für Informationsverarbeitung, Leibniz Universität Hannover, Germany
F. Rottensteiner: Institute of Photogrammetry and GeoInformation, Leibniz Universität Hannover, Germany
J. Ostermann: Institut für Informationsverarbeitung, Leibniz Universität Hannover, Germany
C. Heipke: Institute of Photogrammetry and GeoInformation, Leibniz Universität Hannover, Germany

DOI: https://doi.org/10.5194/isprs-archives-XLII-2-845-2018
Journal volume & issue: Vol. XLII-2
pp. 845 – 852

Abstract

Read online

In this paper we deal with the problem of measuring the similarity between training and tests datasets in the context of transfer learning (TL) for image classification. TL tries to transfer knowledge from a source domain, where labelled training samples are abundant but the data may follow a different distribution, to a target domain, where labelled training samples are scarce or even unavailable, assuming that the domains are related. Thus, the requirements w.r.t. the availability of labelled training samples in the target domain are reduced. In particular, if no labelled target data are available, it is inherently difficult to find a robust measure of relatedness between the source and target domains. This is of crucial importance for the performance of TL, because the knowledge transfer between unrelated data may lead to negative transfer, i.e. to a decrease of classification performance after transfer. We address the problem of measuring the relatedness between source and target datasets and investigate three different strategies to predict and, consequently, to avoid negative transfer in this paper. The first strategy is based on circular validation. The second strategy relies on the Maximum Mean Discrepancy (MMD) similarity metric, whereas the third one is an extension of MMD which incorporates the knowledge about the class labels in the source domain. Our method is evaluated using two different benchmark datasets. The experiments highlight the strengths and weaknesses of the investigated methods. We also show that it is possible to reduce the amount of negative transfer using these strategies for a TL method and to generate a consistent performance improvement over the whole dataset.

Published in The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences

ISSN: 1682-1750 (Print); 2194-9034 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Technology: Engineering (General). Civil engineering (General): Applied optics. Photonics
Website: http://www.isprs.org/publications/archives.aspx

About the journal