Data Science and Engineering (Feb 2024)

Leveraging Semantic Information for Enhanced Community Search in Heterogeneous Graphs

  • Yuqi Li,
  • Guosheng Zang,
  • Chunyao Song,
  • Xiaojie Yuan,
  • Tingjian Ge

DOI
https://doi.org/10.1007/s41019-024-00244-z
Journal volume & issue
Vol. 9, no. 2
pp. 220 – 237

Abstract

Read online

Abstract Community search (CS) is a vital research area in network science that focuses on discovering personalized communities for query vertices from graphs. However, existing CS methods mainly concentrate on homogeneous or simple attributed graphs, often disregarding complex semantic information and rich contents carried by entities in heterogeneous graphs (HGs). In this paper, we propose a novel problem, namely the “Semantic Network Oriented Community Search with Meta-Structures in Heterogeneous Graphs (SNCS),” which aims to find dense communities that contain the query vertex, with vertices of the same type sharing similar topics. In response to this new problem, we present a novel approach, also named SNCS, representing the first solution employing meta-structures and topic constraints to tackle community search, leveraging both topological and latent features. To overcome the high-time complexity challenge posed by searching through meta-structures, we introduce a unique graph reconstruction technique. Our proposed method’s superiority is validated through extensive evaluations on real-world datasets. The results demonstrate a significant improvement in the quality of the obtained communities, with increases of 3.5–4.4% in clustering coefficient and 5–11% in density while requiring only 4–46% of the running time when compared with the state-of-the-art methods.

Keywords