PeerJ Computer Science (Mar 2024)

Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification

  • Kurnia Muludi,
  • Revita Setianingsih,
  • Ridho Sholehurrohman,
  • Akmal Junaidi

DOI
https://doi.org/10.7717/peerj-cs.1968
Journal volume & issue
Vol. 10
p. e1968

Abstract

Read online Read online

The accuracy of most classification methods is significantly affected by missing values. Therefore, this study aimed to propose a data imputation method to handle missing values through the application of nearest neighbor data and fuzzy membership function as well as to compare the results with standard methods. A total of five datasets related to classification problems obtained from the UCI Machine Learning Repository were used. The results showed that the proposed method had higher accuracy than standard imputation methods. Moreover, triangular method performed better than Gaussian fuzzy membership function. This showed that the combination of nearest neighbor data and fuzzy membership function was more effective in handling missing values and improving classification accuracy.

Keywords