Entropy (May 2022)

A Fast kNN Algorithm Using Multiple Space-Filling Curves

  • Konstantin Barkalov,
  • Anton Shtanyuk,
  • Alexander Sysoyev

DOI
https://doi.org/10.3390/e24060767
Journal volume & issue
Vol. 24, no. 6
p. 767

Abstract

Read online

The paper considers a time-efficient implementation of the k nearest neighbours (kNN) algorithm. A well-known approach for accelerating the kNN algorithm is to utilise dimensionality reduction methods based on the use of space-filling curves. In this paper, we take this approach further and propose an algorithm that employs multiple space-filling curves and is faster (with comparable quality) compared with the kNN algorithm, which uses kd-trees to determine the nearest neighbours. A specific method for constructing multiple Peano curves is outlined, and statements are given about the preservation of object proximity information in the course of dimensionality reduction. An experimental comparison with known kNN implementations using kd-trees was performed using test and real-life data.

Keywords