Sistemas de Informação (Jun 2022)
Applying Data Mining for School Dropout Prediction in Higher Education at a Federal Institute for Education
Abstract
Data Mining is a process that seeks to extract useful knowledge and uncover patterns from data. School dropout is still one of the challenges to be tackled in the Higher Education environment. This paper presents a tool that uses a model that aims to predict a potential dropout of undergraduate students of a higher education institution using Machine Learning algorithms. In order to perform the predictions, we used the Decision Tree and Neural Network techniques, where the former achieved the best performance, with 84% precision and 87% accuracy in detecting dropout, while the second achieved 82% accuracy with 78% precision. Besides, given the data obtained from the institution, the most important features that help prediction school dropout are the average number of classes skipped in previous semester and the student’s age.