Knowledge (Oct 2024)

Exploiting the Regularized Greedy Forest Algorithm Through Active Learning for Predicting Student Grades: A Case Study

  • Maria Tsiakmaki,
  • Georgios Kostopoulos,
  • Sotiris Kotsiantis

DOI
https://doi.org/10.3390/knowledge4040028
Journal volume & issue
Vol. 4, no. 4
pp. 543 – 556

Abstract

Read online

Student performance prediction is a critical research challenge in the field of educational data mining. To address this issue, various machine learning methods have been employed with significant success, including instance-based algorithms, decision trees, neural networks, and ensemble methods, among others. In this study, we introduce an innovative approach that leverages the Regularized Greedy Forest (RGF) algorithm within an active learning framework to enhance student performance prediction. Active learning is a powerful paradigm that utilizes both labeled and unlabeled data, while RGF serves as an effective decision forest learning algorithm acting as the base learner. This synergy aims to improve the predictive performance of the model while minimizing the labeling effort, making the approach both efficient and scalable. Moreover, applying the active learning framework for predicting student performance focuses on the early and accurate identification of students at risk of failure. This enables targeted interventions and personalized learning strategies to support low-performing students and improve their outcomes. The experimental results demonstrate the potential of our proposed approach as it outperforms well-established supervised methods using a limited pool of labeled examples, achieving an accuracy of 81.60%.

Keywords