PeerJ Computer Science (Oct 2023)

Early prediction of student performance in CS1 programming courses

  • Jose Llanos,
  • Víctor A. Bucheli,
  • Felipe Restrepo-Calle

DOI
https://doi.org/10.7717/peerj-cs.1655
Journal volume & issue
Vol. 9
p. e1655

Abstract

Read online Read online

There is a high failure rate and low academic performance observed in programming courses. To address these issues, it is crucial to predict student performance at an early stage. This allows teachers to provide timely support and interventions to help students achieve their learning objectives. The prediction of student performance has gained significant attention, with researchers focusing on machine learning features and algorithms to improve predictions. This article proposes a model for predicting student performance in a 16-week CS1 programming course, specifically in weeks 3, 5, and 7. The model utilizes three key factors: grades, delivery time, and the number of attempts made by students in programming labs and an exam. Eight classification algorithms were employed to train and evaluate the model, with performance assessed using metrics such as accuracy, recall, F1 score, and AUC. In week 3, the gradient boosting classifier (GBC) achieved the best results with an F1 score of 86%, followed closely by the random forest classifier (RFC) with 83%. These findings demonstrate the potential of the proposed model in accurately predicting student performance.

Keywords