Journal of Big Data (May 2024)

Skyline query under multidimensional incomplete data based on classification tree

  • Dengke Yuan,
  • Liping Zhang,
  • Song Li,
  • Guanglu Sun

DOI
https://doi.org/10.1186/s40537-024-00923-8
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 24

Abstract

Read online

Abstract A method for skyline query of multidimensional incomplete data based on a classification tree has been proposed to address the problem of a large amount of useless data in existing skyline queries with multidimensional incomplete data, which leads to low query efficiency and algorithm performance. This method consists of two main parts. The first part is the proposed incomplete data weighted classification tree algorithm. In the first part, an incomplete data weighted classification tree is proposed, and the incomplete data set is classified using this tree. The data classified in the first part serves as the basis for the second step of the query. The second part proposes a skyline query algorithm for multidimensional incomplete data. The concept of optimal virtual points has been recently introduced, effectively reducing the number of comparisons of a large amount of data, thereby improving the query efficiency for incomplete data. Theoretical research and experimental analysis have shown that the proposed method can perform skyline queries for multidimensional incomplete data well, with high query efficiency and accuracy of the algorithm.

Keywords