F1000Research (Oct 2016)

Understanding covariate shift in model performance [version 3; referees: 2 approved]

  • Georgia McGaughey,
  • W. Patrick Walters,
  • Brian Goldman

DOI
https://doi.org/10.12688/f1000research.8317.3
Journal volume & issue
Vol. 5

Abstract

Read online

Three (3) different methods (logistic regression, covariate shift and k-NN) were applied to five (5) internal datasets and one (1) external, publically available dataset where covariate shift existed. In all cases, k-NN’s performance was inferior to either logistic regression or covariate shift. Surprisingly, there was no obvious advantage for using covariate shift to reweight the training data in the examined datasets.

Keywords