IEEE Access (Jan 2021)

<italic>B</italic>&#x002B;-Tree Based Multi-Keyword Ranked Similarity Search Scheme Over Encrypted Cloud Data

  • Huanglin Shen,
  • Linlin Xue,
  • Haijiang Wang,
  • Lei Zhang,
  • Jinying Zhang

DOI
https://doi.org/10.1109/ACCESS.2021.3125729
Journal volume & issue
Vol. 9
pp. 150865 – 150877

Abstract

Read online

With the sustained evolution and expeditious popularization of cloud computing, an ever-increasing number of individuals and enterprises are encouraged to outsource data to cloud servers for reducing management overhead and ease of access. Privacy requirements demand encryption of sensitive information before outsourcing, which, on the other hand, diminishes the usability of data and makes considerable efficient keyword search techniques used on plaintext inapplicable. In this paper, we propose a secure multi-keyword ranked search scheme based on document similarity to work out the problem. In order to achieve the goals of multi-keyword search and ranking search results, we adopt the vector space model and TF-IDF model to generate index and query vectors. By introducing the secure kNN computation, index and query vectors can be encrypted to prevent cloud servers from obtaining sensitive frequency information. For the need of efficiency advancement, we adopt the $B^{+}$ -tree as the basic structure to build the index and construct a similar document collection for each document. Due to the use of our unique index structure, compared to linear search, the search efficiency is more exceptional. Extensive experiments on the real-world document collection are conducted to demonstrate the feasibility and efficiency of the proposed solution.

Keywords