Fast Top-K Graph Similarity Search Via Representative Matrices

Zhigang Sun; Hongwei Huo; Xiaoyang Chen

doi:10.1109/ACCESS.2018.2819426

IEEE Access (Jan 2018)

Fast Top-K Graph Similarity Search Via Representative Matrices

Zhigang Sun,
Hongwei Huo,
Xiaoyang Chen

Affiliations

Zhigang Sun: Department of Computer Science, Xidian University, Xi’an, China
Hongwei Huo: ORCiD; Department of Computer Science, Xidian University, Xi’an, China
Xiaoyang Chen: Department of Computer Science, Xidian University, Xi’an, China

DOI: https://doi.org/10.1109/ACCESS.2018.2819426
Journal volume & issue: Vol. 6
pp. 21408 – 21417

Abstract

Read online

Graph similarity search is a crucial problem in many applications, such as cheminformatics, data mining, and pattern recognition. Top-k graph similarity search aims to find the most similar k graphs to a query graph in graph databases. In this paper, we present a fast top-k graph similarity search algorithm with high classification accuracy. We introduce a new graph similarity measure based upon the number of occurrences of subtree patterns in graphs. In order to accelerate search, we also construct hierarchical representative matrices for graph databases, where each row of the matrices represents a graph set. Using representative matrices, we can derive a similarity upper bound of a query graph and the graph set so as to reduce search space. Comprehensive experiments on real data sets demonstrate that our algorithm has a better performance than compared methods on classification accuracy and query time, and it also can scale to large data sets including 15 million chemical structure graphs.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords