LK-Index: A Learned Index for KNN Queries

Yongxin Peng

doi:10.1109/ACCESS.2024.3433524

IEEE Access (Jan 2024)

LK-Index: A Learned Index for KNN Queries

Yongxin Peng

Affiliations

Yongxin Peng: ORCiD; School of Mathematics and Computer Application, Shangluo University, Shangluo, China

DOI: https://doi.org/10.1109/ACCESS.2024.3433524
Journal volume & issue: Vol. 12
pp. 103096 – 103103

Abstract

Read online

The k-Nearest Neighbor (kNN) search is a crucial problem in database and data mining, especially in high-dimensional space. However, traditional kNN algorithms based on distance metrics and brute-force search often have low search efficiency and accuracy, and high computational complexity when dealing with large-scale high-dimensional datasets. These limitations have made them a bottleneck in practical applications. Inspired by the recently learned index that replaces B-tree with machine learning models, I propose a method for kNN search based on a learned index, named LK-index. Initially, a conventional tree-based index is created to process queries. The tree index is then utilized to find the k-nearest neighbor points, and the neural network is trained to act as a learned index. Finally, the actual k-nearest neighbors are obtained by computing the potential k-nearest neighbor points. Experiments conducted on synthetic and real-world datasets demonstrate that the LK-index yields certain advantages regarding search accuracy and time consumption.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords