G3: Genes, Genomes, Genetics (Mar 2021)

Single-cell data clustering based on sparse optimization and low-rank matrix factorization

  • Yinlei Hu,
  • Bin Li,
  • Falai Chen,
  • Kun Qu

DOI
https://doi.org/10.1093/g3journal/jkab098
Journal volume & issue
Vol. 11, no. 6

Abstract

Read online

AbstractUnsupervised clustering is a fundamental step of single-cell RNA-sequencing (scRNA-seq) data analysis. This issue has inspired several clustering methods to classify cells in scRNA-seq data. However, accurate prediction of the cell clusters remains a substantial challenge. In this study, we propose a new algorithm for scRNA-seq data clustering based on Sparse Optimization and low-rank matrix factorization (scSO). We applied our scSO algorithm to analyze multiple benchmark datasets and showed that the cluster number predicted by scSO was close to the number of reference cell types and that most cells were correctly classified. Our scSO algorithm is available at https://github.com/QuKunLab/scSO