Comparative Study of Predictive Classification Models on Data with Severely Imbalanced Predictors

Embay Rohaeti; Ani Andriyati

doi:10.30595/juita.v12i1.21491

Jurnal Informatika (May 2024)

Comparative Study of Predictive Classification Models on Data with Severely Imbalanced Predictors

Embay Rohaeti,
Ani Andriyati

Affiliations

Embay Rohaeti: Department of Mathematics, Pakuan University
Ani Andriyati: Department of Mathematics, Pakuan University

DOI: https://doi.org/10.30595/juita.v12i1.21491
Journal volume & issue: Vol. 12, no. 1
pp. 121 – 129

Abstract

Read online

Analysing pre-COVID-19 unemployment in West Java is vital for comprehending and tackling Indonesia’s economic challenges. This significance arises not only due to the region’s high unemployment rate, but also from the need to understand unemployment patterns before COVID-19, which has become more relevant now during the country’s post-pandemic recovery phase. This study evaluates four machine learning models (Random Forest, Linear SVM, RBF SVM, and Polynomial SVM) to classify employment status using demographic and job-related variables. The objective is to find the most suitable model, particularly considering the imbalanced nature of the study-case data. Data from the National Labor Force Survey (SAKERNAS) in August 2019 is utilized, comprising 54,429 respondents across districts in West Java. The four models are evaluated using holdout validation with a 70:30 stratified proportion, repeated for 100 times. Results indicate that the random forest model outperforms others in balanced accuracy, F1-score, and computational time. The random forest model also underscores the importance of gender and age in classifying employment status in West Java, suggesting a need for targeted intervention, especially for female citizens and individuals in productive age groups.

unemployment, random forest, linear svm, rbf svm, polynomial svm

Published in Jurnal Informatika

ISSN: 2086-9398 (Print); 2579-8901 (Online)
Publisher: Universitas Muhammadiyah Purwokerto
Country of publisher: Indonesia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://jurnalnasional.ump.ac.id/index.php/JUITA/

About the journal

Abstract

Keywords