New Journal of Physics (Jan 2020)

Optimization of identifiability for efficient community detection

  • Hui-Jia Li,
  • Lin Wang,
  • Yan Zhang,
  • Matjaž Perc

DOI
https://doi.org/10.1088/1367-2630/ab8e5e
Journal volume & issue
Vol. 22, no. 6
p. 063035

Abstract

Read online

Many physical and social systems are best described by networks. And the structural properties of these networks often critically determine the properties and function of the resulting mathematical models. An important method to infer the correlations between topology and function is the detection of community structure, which plays a key role in the analysis, design, and optimization of many complex systems. The nonnegative matrix factorization has been used prolifically to that effect in recent years, although it cannot guarantee balanced partitions, and it also does not allow a proactive computation of the number of communities in a network. This indicates that the nonnegative matrix factorization does not satisfy all the nonnegative low-rank approximation conditions. Here we show how to resolve this important open problem by optimizing the identifiability of community structure. We propose a new form of nonnegative matrix decomposition and a probabilistic surrogate learning function that can be solved according to the majorization–minimization principle. Extensive in silico tests on artificial and real-world data demonstrate the efficient performance in community detection, regardless of the size and complexity of the network.

Keywords