PLoS Computational Biology (Oct 2019)

SourceSet: A graphical model approach to identify primary genes in perturbed biological pathways.

  • Elisa Salviato,
  • Vera Djordjilović,
  • Monica Chiogna,
  • Chiara Romualdi

DOI
https://doi.org/10.1371/journal.pcbi.1007357
Journal volume & issue
Vol. 15, no. 10
p. e1007357

Abstract

Read online

Topological gene-set analysis has emerged as a powerful means for omic data interpretation. Although numerous methods for identifying dysregulated genes have been proposed, few of them aim to distinguish genes that are the real source of perturbation from those that merely respond to the signal dysregulation. Here, we propose a new method, called SourceSet, able to distinguish between the primary and the secondary dysregulation within a Gaussian graphical model context. The proposed method compares gene expression profiles in the control and in the perturbed condition and detects the differences in both the mean and the covariance parameters with a series of likelihood ratio tests. The resulting evidence is used to infer the primary and the secondary set, i.e. the genes responsible for the primary dysregulation, and the genes affected by the perturbation through network propagation. The proposed method demonstrates high specificity and sensitivity in different simulated scenarios and on several real biological case studies. In order to fit into the more traditional pathway analysis framework, SourceSet R package also extends the analysis from a single to multiple pathways and provides several graphical outputs, including Cytoscape visualization to browse the results.