Revista Facultad de Ingeniería (Dec 2022)

Applying Predictive Data Mining to Discover Factors Associated to the Language Skill Performance from Elementary School Students

  • Ricardo Timarán-Pereira,
  • Javier Caicedo-Zambrano,
  • Andrea Timarán-Buchely

DOI
https://doi.org/10.19053/01211129.v31.n62.2022.14814
Journal volume & issue
Vol. 31, no. 62

Abstract

Read online

In this paper, predictive data mining techniques are applied to determine the academic performance from fifth grade students in the Saber 5° tests Language skill at Colombian elementary schools in 2017. We employed the CRISP-DM methodology. Socioeconomic, academic, and institutional information was available at the ICFES databases. A minable dataset was obtained using data cleaning and transformation techniques. A decision tree was built with the Weka tool J48 algorithm. Some of the predictors of the discovered patterns are the nature and location of the school, whether or not students failed a school year, the age group, the mother's educational attainment, and the rates of ICTs and household appliances. The findings of this research serve as quality information for the decision-making at the Ministry of National Education (MEN) and the secretaries of education, and for the directors of elementary educational institutions to define improvement plans that result in the quality of elementary school education in Colombia.

Keywords