Electronics (Aug 2023)

Distributed Subgraph Query Processing Using Filtering Scores on Spark

  • Kyoungsoo Bok,
  • Minyoung Kim,
  • Hyeonbyeong Lee,
  • Dojin Choi,
  • Jongtae Lim,
  • Jaesoo Yoo

DOI
https://doi.org/10.3390/electronics12173645
Journal volume & issue
Vol. 12, no. 17
p. 3645

Abstract

Read online

As various services have been generating large-scale graphs to represent multiple relationships between objects, studies have been conducted to obtain subgraphs with particular patterns. In this paper, we propose a distributed query processing method to efficiently search a subgraph for a large graph on Spark. To reduce unnecessary processing costs, the search order is determined by filtering scores using the probability distribution. The partitioned queries are searched in parallel in the distributed graph of each slave node according to the search order, and the local search results obtained from each slave node are combined and returned. The query is partitioned in triplets based on the determined search order. The performance of the proposed method is compared with the performance of existing methods to demonstrate its superiority.

Keywords