Optimization of identifiability for efficient community detection

Hui-Jia Li; Lin Wang; Yan Zhang; Matjaž Perc

doi:10.1088/1367-2630/ab8e5e

New Journal of Physics (Jan 2020)

Optimization of identifiability for efficient community detection

Hui-Jia Li,
Lin Wang,
Yan Zhang,
Matjaž Perc

Affiliations

Hui-Jia Li: ORCiD; School of Science, Beijing University of Posts and Telecommunications , Beijing 100876, People’s Republic of China
Lin Wang: ORCiD; Department of Genetics, University of Cambridge , Cambridge, CB2 3EH, United Kingdom
Yan Zhang: Alibaba Local Services Lab, Alibaba Group , Shanghai 200333, People’s Republic of China
Matjaž Perc: ORCiD; Faculty of Natural Sciences and Mathematics, University of Maribor , Koroška cesta 160, 2000 Maribor, Slovenia; Department of Medical Research, China Medical University Hospital, China Medical University , Taichung, Taiwan; Complexity Science Hub Vienna , Josefstädterstraße 39, 1080 Vienna, Austria

DOI: https://doi.org/10.1088/1367-2630/ab8e5e
Journal volume & issue: Vol. 22, no. 6
p. 063035

Abstract

Read online

Many physical and social systems are best described by networks. And the structural properties of these networks often critically determine the properties and function of the resulting mathematical models. An important method to infer the correlations between topology and function is the detection of community structure, which plays a key role in the analysis, design, and optimization of many complex systems. The nonnegative matrix factorization has been used prolifically to that effect in recent years, although it cannot guarantee balanced partitions, and it also does not allow a proactive computation of the number of communities in a network. This indicates that the nonnegative matrix factorization does not satisfy all the nonnegative low-rank approximation conditions. Here we show how to resolve this important open problem by optimizing the identifiability of community structure. We propose a new form of nonnegative matrix decomposition and a probabilistic surrogate learning function that can be solved according to the majorization–minimization principle. Extensive in silico tests on artificial and real-world data demonstrate the efficient performance in community detection, regardless of the size and complexity of the network.

Published in New Journal of Physics

ISSN: 1367-2630 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Science: Physics
Website: https://iopscience.iop.org/journal/1367-2630

About the journal

Abstract

Keywords