Kmeans-SMOTE Integration for Handling Imbalance Data in Classifying Financial Distress Companies using SVM and Naïve Bayes

Didit Johar Maulana; Siti Saadah; Prasti Eko Yunanto

doi:10.29207/resti.v8i1.5140

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) (Feb 2024)

Kmeans-SMOTE Integration for Handling Imbalance Data in Classifying Financial Distress Companies using SVM and Naïve Bayes

Didit Johar Maulana,
Siti Saadah,
Prasti Eko Yunanto

Affiliations

Didit Johar Maulana: Telkom University
Siti Saadah: Telkom University
Prasti Eko Yunanto: Telkom University

DOI: https://doi.org/10.29207/resti.v8i1.5140
Journal volume & issue: Vol. 8, no. 1
pp. 54 – 61

Abstract

Read online

Imbalanced data presents significant challenges in machine learning, leading to biased classification outcomes that favor the majority class. This issue is especially pronounced in the classification of financial distress, where data imbalance is common due to the scarcity of such instances in real-world datasets. This study aims to mitigate data imbalance in financial distress companies using the Kmeans-SMOTE method by combining Kmeans clustering and the synthetic minority oversampling technique (SMOTE). Various classification approaches, including Nave Bayes and support vector machine (SVM), are implemented on a Kaggle financial distress data set to evaluate the effectiveness of Kmeans-SMOTE. Experimental results show that SVM outperforms Nave Bayes with impressive accuracy (99.1%), f1-score (99.1%), area under precision recall (AUPRC) (99.1%), and geometric mean (Gmean) (98.1%). On the basis of these results, Kmeans-SMOTE can balance the data effectively, leading to a quite significant improvement in performance.

Published in Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

ISSN: 2580-0760 (Online)
Publisher: Ikatan Ahli Informatika Indonesia
Country of publisher: Indonesia
LCC subjects: Technology: Engineering (General). Civil engineering (General): Systems engineering; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://jurnal.iaii.or.id/index.php/RESTI

About the journal

Abstract

Keywords