Performance Prediction of Students in Higher Education Using Multi-Model Ensemble Approach

Naveed Anwer Butt; Zafar Mahmood; Khawar Shakeel; Sultan Alfarhood; Mejdl Safran; Imran Ashraf

doi:10.1109/ACCESS.2023.3336987

IEEE Access (Jan 2023)

Performance Prediction of Students in Higher Education Using Multi-Model Ensemble Approach

Naveed Anwer Butt,
Zafar Mahmood,
Khawar Shakeel,
Sultan Alfarhood,
Mejdl Safran,
Imran Ashraf

Affiliations

Naveed Anwer Butt: ORCiD; Department of Computer Science, University of Gujrat, Gujrat, Pakistan
Zafar Mahmood: Department of Computer Science, University of Gujrat, Gujrat, Pakistan
Khawar Shakeel: Department of Computer Science, University of Gujrat, Gujrat, Pakistan
Sultan Alfarhood: ORCiD; Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
Mejdl Safran: ORCiD; Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
Imran Ashraf: ORCiD; Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, Republic of Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3336987
Journal volume & issue: Vol. 11
pp. 136091 – 136108

Abstract

Read online

Many stakeholders including students, teachers, and educational institutions, benefit from accurately predicting student performance and facilitating data-driven policies. In this field, providing users with accurate and understandable predictions is challenging, but equally important. The goals of this study are multifaceted: to identify students at-risk; to identify differences in assessment across different environments; methods for assessing students; and to determine the relationship between teacher employment status and student achievement. This study performs an empirical comparison of the performance and efficiency of ensemble classification methods based on bagging, boosting, stacking, and voting for successful predictions. An ensemble model is developed and validated using double, triple, and quadruple combinations of classification algorithms using Naive Bayes, J48 decision trees, Adaboost, logistics, and multilayer perceptron. This study uses primary quantitative data from the learning management system of a university in Pakistan to analyze the performance of these models. The boosted tree detection method outperforms bagged trees when the standard deviation is higher and the data size is large, while stacking is best for smaller datasets. Based on behavioral analysis results of students, academic advice can be given for selected case studies. These will help educational administrators and policymakers working in education to introduce new policies and curricula accordingly.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords