Genome Biology (Sep 2023)

ZINBMM: a general mixture model for simultaneous clustering and gene selection using single-cell transcriptomic data

  • Yang Li,
  • Mingcong Wu,
  • Shuangge Ma,
  • Mengyun Wu

DOI
https://doi.org/10.1186/s13059-023-03046-0
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 28

Abstract

Read online

Abstract Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which can promote biological understanding of cell heterogeneity. In this study, we propose a zero-inflated negative binomial mixture model (ZINBMM) that simultaneously achieves effective scRNA-seq data clustering and gene selection. ZINBMM conducts a systemic analysis on raw counts, accommodating both batch effects and dropout events. Simulations and the analysis of five scRNA-seq datasets demonstrate the practical applicability of ZINBMM.

Keywords