Je-LKS: Journal of E-Learning and Knowledge Society (Aug 2022)
Educational data mining, student academic performance prediction, prediction methods, algorithms and tools: an overview of reviews
Abstract
This overview study set out to compare and synthesise the findings of review studies conducted on predicting student academic performance (SAP) in higher education using educational data mining (EDM) methods, EDM algorithms, and EDM tools from 2013 to September 2021. It conducted multiple searches for suitable and relevant peer-reviewed articles on two online search engines, on nine online databases, and on two online academic social networks. It, then, selected 33 eligible articles from 2,500 articles. Some of the findings of this overview study are worth mentioning. First, only 3 studies explicitly stated their precise sample sizes, and only 5 studies explicitly mentioned their subject areas with maths and science, and computer science and engineering as the four most mentioned subject areas. Second, 20 review studies had purposes related to either EDM techniques, EDM methods, EDM models, or EDM algorithms employed to predict SAP and student success in the higher education sector. Third, there are six commonly used typologies of input variables reported by 33 review studies, of which student demographics was the most commonly utilised variable for predicting SAP. Fourth and last, seven common EDM algorithms employed for predicting SAP were identified, of which Decision Tree emerged both as the most used algorithm and as the algorithm with the highest prediction accuracy rate for predicting SAP.
Keywords