Applied Network Science (Jul 2024)

Model selection for network data based on spectral information

  • Jairo Iván Peña Hidalgo,
  • Jonathan R. Stewart

DOI
https://doi.org/10.1007/s41109-024-00640-4
Journal volume & issue
Vol. 9, no. 1
pp. 1 – 21

Abstract

Read online

Abstract In this work, we explore the extent to which the spectrum of the graph Laplacian can characterize the probability distribution of random graphs for the purpose of model evaluation and model selection for network data applications. Network data, often represented as a graph, consist of a set of pairwise observations between elements of a population of interests. The statistical network analysis literature has developed many different classes of network data models, with notable model classes including stochastic block models, latent node position models, and exponential families of random graph models. We develop a novel methodology which exploits the information contained in the spectrum of the graph Laplacian to predict the data-generating model from a set of candidate models. Through simulation studies, we explore the extent to which network data models can be differentiated by the spectrum of the graph Laplacian. We demonstrate the potential of our method through two applications to well-studied network data sets and validate our findings against existing analyses in the statistical network analysis literature.

Keywords