PeerJ Computer Science (Dec 2023)

Predicting the final grade using a machine learning regression model: insights from fifty percent of total course grades in CS1 courses

  • Carlos Giovanny Hidalgo Suarez,
  • Jose Llanos,
  • Víctor A. Bucheli

DOI
https://doi.org/10.7717/peerj-cs.1689
Journal volume & issue
Vol. 9
p. e1689

Abstract

Read online Read online

This article introduces a model for accurately predicting students’ final grades in the CS1 course by utilizing their grades from the first half of the course. The methodology includes three phases: training, testing, and validation, employing four regression algorithms: AdaBoost, Random Forest, Support Vector Regression (SVR), and XGBoost. Notably, the SVR algorithm outperformed the others, achieving an impressive R-squared (R2) value ranging from 72% to 91%. The discussion section focuses on four crucial aspects: the selection of data features and the percentage of course grades used for training, the comparison between predicted and actual values to demonstrate reliability, and the model’s performance compared to existing literature models, highlighting its effectiveness.

Keywords