Stats (Aug 2021)

Learning Time Acceleration in Support Vector Regression: A Case Study in Educational Data Mining

  • Jonatha Sousa Pimentel,
  • Raydonal Ospina,
  • Anderson Ara

DOI
https://doi.org/10.3390/stats4030041
Journal volume & issue
Vol. 4, no. 3
pp. 682 – 700

Abstract

Read online

The development of a country involves directly investing in the education of its citizens. Learning analytics/educational data mining (LA/EDM) allows access to big observational structured/unstructured data captured from educational settings and relies mostly on machine learning algorithms to extract useful information. Support vector regression (SVR) is a supervised statistical learning approach that allows modelling and predicts the performance tendency of students to direct strategic plans for the development of high-quality education. In Brazil, performance can be evaluated at the national level using the average grades of a student on their National High School Exams (ENEMs) based on their socioeconomic information and school records. In this paper, we focus on increasing the computational efficiency of SVR applied to ENEM for online requisitions. The results are based on an analysis of a massive data set composed of more than five million observations, and they also indicate computational learning time savings of more than 90%, as well as providing a prediction of performance that is compatible with traditional modeling.

Keywords