ICTACT Journal on Soft Computing (Jul 2021)

A COMPARISON OF MISSING DATA HANDLING TECHNIQUES

  • S David Samuel Azariya,
  • V Mohanraj,
  • J Jeba Emilyn,
  • G Jothi

DOI
https://doi.org/10.21917/ijsc.2021.0347
Journal volume & issue
Vol. 11, no. 4
pp. 2433 – 2437

Abstract

Read online

Missing data is a regular concern on data that professionals have to deal with. Efficient analysis techniques have to be followed to find interesting patterns. In this study, we are comparing 16 different imputation methods namely Linear, Index, Values, Nearest, Zero, slinear, Quadratic, Cubic, Barycentric, Krogh, Polynomial, Spline, Piecewise Polynomial, From derivatives, Pchip and Akima. These techniques are performed on real time UCI dataset and are under Missing Completely at a Random (MCAR) assumption, our result suggests the nearest, zero, quadratic and polynomial imputation methods which provides above 96% of accuracy when compared to the other techniques.

Keywords