Molecular Systems Biology (Feb 2019)

De novo gene signature identification from single‐cell RNA‐seq with hierarchical Poisson factorization

  • Hanna Mendes Levitin,
  • Jinzhou Yuan,
  • Yim Ling Cheng,
  • Francisco JR Ruiz,
  • Erin C Bush,
  • Jeffrey N Bruce,
  • Peter Canoll,
  • Antonio Iavarone,
  • Anna Lasorella,
  • David M Blei,
  • Peter A Sims

DOI
https://doi.org/10.15252/msb.20188557
Journal volume & issue
Vol. 15, no. 2
pp. n/a – n/a

Abstract

Read online

Abstract Common approaches to gene signature discovery in single‐cell RNA‐sequencing (scRNA‐seq) depend upon predefined structures like clusters or pseudo‐temporal order, require prior normalization, or do not account for the sparsity of single‐cell data. We present single‐cell hierarchical Poisson factorization (scHPF), a Bayesian factorization method that adapts hierarchical Poisson factorization (Gopalan et al, , Proceedings of the 31st Conference on Uncertainty in Artificial Intelligence, 326) for de novo discovery of both continuous and discrete expression patterns from scRNA‐seq. scHPF does not require prior normalization and captures statistical properties of single‐cell data better than other methods in benchmark datasets. Applied to scRNA‐seq of the core and margin of a high‐grade glioma, scHPF uncovers marked differences in the abundance of glioma subpopulations across tumor regions and regionally associated expression biases within glioma subpopulations. scHFP revealed an expression signature that was spatially biased toward the glioma‐infiltrated margins and associated with inferior survival in glioblastoma.

Keywords