Jambura Journal of Mathematics (Aug 2023)

Perbandingan Metode KNN, Naive Bayes, dan Regresi Logistik Binomial dalam Pengklasifikasian Status Ekonomi Negara

  • N. K. Kutha Ardana,
  • Ruhiyat Ruhiyat,
  • Nurfatimah Amany,
  • Teofilus Kevin Irawan,
  • Raymond Raymond,
  • Rizalius Karunia,
  • Syifa Fauzia

DOI
https://doi.org/10.34312/jjom.v5i2.21103
Journal volume & issue
Vol. 5, no. 2

Abstract

Read online

The classification of a country's economic status as developed or developing often involves factors such as life expectancy and its underlying variables. This research aims to compare the performance of three machine learning algorithms, namely KNN (K-Nearest Neighbors), naive Bayes, and binomial logistic regression, in classifying the economic status of countries as developed or developing. The data used in this study is "Life Expectancy (WHO) Fixed," obtained from the Kaggle website. The first statistical analysis conducted was Principal Component Analysis (PCA) using 16 predictor variables. PCA resulted in three principal components capable of explaining 71.41% of the variance, which were subsequently used in the KNN, naive Bayes, and binomial logistic regression methods. The analysis results from the KNN, naive Bayes, and binomial logistic regression methods produced F1-scores of 100\%, 98.19%, and 97.36%, respectively.

Keywords