Tehnički Vjesnik (Jan 2018)

Keyword Search in Large-Scale Databases with Topic Cluster Units

  • Yingqi Wang,
  • Nianbin Wang,
  • Lianke Zhou

DOI
https://doi.org/10.17559/TV-20160419053402
Journal volume & issue
Vol. 25, no. 3
pp. 748 – 758

Abstract

Read online

To solve the inefficiency of the existing keyword search methods in large databases, this paper proposes TCU-based query, an offline query method based on topic cluster units. First, topic cluster units (TCUs) are constructed through vertical grouping and horizontal grouping on tables and tuples. In contrast to traditional keyword query methods, this offline method cannot only reduce the query response time, but also return results comprising richer and more complete semantic information. In order to further improve the efficiency of data preprocessing, an optimized solution for table join ordering based on the genetic algorithm is presented. Second, we select index terms using the association rule, and then we build an index on every topic cluster; by doing so we can improve the query speed significantly. Finally, we conduct extensive experiments to demonstrate that our approach greatly improves the performance of keyword search.

Keywords