Efficient <italic>(k, &#x03B1;)</italic>-Maximal-Cliques Enumeration Over Uncertain Graphs

Jing Bai; Junfeng Zhou; Ming Du; Peng Zhong

doi:10.1109/ACCESS.2021.3125198

IEEE Access (Jan 2021)

Efficient <italic>(k, α)</italic>-Maximal-Cliques Enumeration Over Uncertain Graphs

Jing Bai,
Junfeng Zhou,
Ming Du,
Peng Zhong

Affiliations

Jing Bai: ORCiD; School of Computer Science and Technology, Donghua University, Shanghai, China
Junfeng Zhou: School of Computer Science and Technology, Donghua University, Shanghai, China
Ming Du: ORCiD; School of Computer Science and Technology, Donghua University, Shanghai, China
Peng Zhong: School of Computer Science and Technology, Donghua University, Shanghai, China

DOI: https://doi.org/10.1109/ACCESS.2021.3125198
Journal volume & issue: Vol. 9
pp. 149338 – 149348

Abstract

Read online

A maximal clique ( ${MC}$ ) is a complete subgraph satisfying that no other cliques can take it as their proper subgraph. Given an uncertain graph, the $top\text{-}K\,\,{MC}\text{s}$ enumeration problem studies how to return $k {MC}\text{s}$ with the highest rank value. Existing algorithms rank ${MC}\text{s}$ according to their probabilities, thus usually return ${MC}\text{s}$ with higher probabilities but less number of vertices, and fail to return large ${MC}\text{s}$ that convey more useful information. Considering this problem, this paper studies the problem of enumerating $top\text{-}K\,\,{MC}\text{s}$ . Our approach returns $k {MC}\text{s}$ with the most number of vertices satisfying that their probabilities $\geq \alpha $ , where each ${MC}$ is called an $\alpha $ - ${MC}$ , and computing $k$ largest $\alpha $ - ${MC}\text{s}$ is called as $(k,\alpha)$ - ${MC}\text{s}$ . We propose an efficient $(k,\alpha)$ - ${MC}\text{s}$ enumeration algorithm, $Top\text{-}KMC$ , which works in three steps, including partition, enumeration and verification. Here, partition means that we compute the set $\mathcal {M}$ of all ${MC}\text{s}$ without considering the probability information, as if the graph is partitioned into a set of subgraphs. Enumeration means that we compute $\alpha $ - ${MC}\text{s}$ from each ${MC}$ of $\mathcal {M}$ . As each such subgraph is an ${MC}$ , the cost of computing common neighbors for finding $\alpha $ - ${MC}\text{s}$ can be reduced. Verification means that we need to verify whether an $\alpha $ - ${MC}$ is a subgraph of another $\alpha $ - ${MC}$ . If not, it is an $\alpha $ - ${MC}$ ; otherwise, it is a useless $\alpha $ - ${MC}$ and should be removed. We further propose an optimized algorithm $Top\text{-}KMC+$ to reduce both time and space by merging the above three steps into a whole step. The experimental results on real datasets show that both $Top\text{-}KMC$ and $Top\text{-}KMC+$ can return $k$ largest $\alpha $ - ${MC}\text{s}$ efficiently.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords