Applied Artificial Intelligence (Nov 2017)

A Meta-analysis on Classification Model Performance in Real-World Datasets: An Exploratory View

  • David Gómez Guillén,
  • Alfonso Rojas Espinosa

DOI
https://doi.org/10.1080/08839514.2018.1430993
Journal volume & issue
Vol. 31, no. 9-10
pp. 715 – 732

Abstract

Read online

The No Free Lunch (NFL) Theorem imposes a theoretical restriction on optimization algorithms and their equal average performance on different problems, under some particular assumptions. Nevertheless, when brought into practice, a perceived “ranking” on the performance is usually perceived by engineers developing machine learning applications. Questions that naturally arise are what kinds of biases the real world has and in which ways can we take advantage from them. Using exploratory data analysis (EDA) on classification examples, we gather insight on some traits that set apart algorithms, datasets and evaluation measures and to what extent the NFL theorem, a theoretical result, applies under typical real-world constraints.