IEEE Access (Jan 2019)

A Novel Hyperparameter-Free Approach to Decision Tree Construction That Avoids Overfitting by Design

  • Rafael Garcia Leiva,
  • Antonio Fernandez Anta,
  • Vincenzo Mancuso,
  • Paolo Casari

DOI
https://doi.org/10.1109/ACCESS.2019.2930235
Journal volume & issue
Vol. 7
pp. 99978 – 99987

Abstract

Read online

Decision trees are an extremely popular machine learning technique. Unfortunately, overfitting in decision trees still remains an open issue that sometimes prevents achieving good performance. In this paper, we present a novel approach for the construction of decision trees that avoids the overfitting by design, without losing accuracy. A distinctive feature of our algorithm is that it requires neither the optimization of any hyperparameters, nor the use of regularization techniques, thus significantly reducing the decision tree training time. Moreover, our algorithm produces much smaller and shallower trees than traditional algorithms, facilitating the interpretability of the resulting models. For reproducibility, we provide an open source version of the algorithm.

Keywords