International Journal of Computational Intelligence Systems (Dec 2015)

Improving Meta-learning for Algorithm Selection by Using Multi-label Classification: A Case of Study with Educational Data Sets

  • Juan Luis Olmo,
  • Cristóbal Romero,
  • Eva Gibaja,
  • Sebastián Ventura

DOI
https://doi.org/10.1080/18756891.2015.1113748
Journal volume & issue
Vol. 8, no. 6

Abstract

Read online

Recommending classification algorithms is an open research problem the solution to which is of tremendous value for practitioners and non-experts data mining users such as educators. This paper proposes a new meta-learning framework for educational domains based on the use of multi-label learning for selecting the best classification algorithms in order to predict students’ performance. In short, the framework considers an offline phase where statistical tests are performed to find the subset of algorithms that achieves the best performance over the repository of educational data sets. The subset of algorithms along with the meta-features extracted from the training data are used to generate a multi-label data set. A multi-label classifier is then trained and, in an online phase, this model is used to recommend the most suitable classification algorithms to be applied to new unseen data sets. This new multi-label meta-learning approach has been applied to a repository of educational data sets generated from Moodle usage data. The results obtained show significant improvement compared with a previous nearest neighbor proposal, demonstrating the suitability of the new framework.

Keywords