PLoS Computational Biology (Dec 2022)

Secuer: Ultrafast, scalable and accurate clustering of single-cell RNA-seq data.

  • Nana Wei,
  • Yating Nie,
  • Lin Liu,
  • Xiaoqi Zheng,
  • Hua-Jun Wu

DOI
https://doi.org/10.1371/journal.pcbi.1010753
Journal volume & issue
Vol. 18, no. 12
p. e1010753

Abstract

Read online

Identifying cell clusters is a critical step for single-cell transcriptomics study. Despite the numerous clustering tools developed recently, the rapid growth of scRNA-seq volumes prompts for a more (computationally) efficient clustering method. Here, we introduce Secuer, a Scalable and Efficient speCtral clUstERing algorithm for scRNA-seq data. By employing an anchor-based bipartite graph representation algorithm, Secuer enjoys reduced runtime and memory usage over one order of magnitude for datasets with more than 1 million cells. Meanwhile, Secuer also achieves better or comparable accuracy than competing methods in small and moderate benchmark datasets. Furthermore, we showcase that Secuer can also serve as a building block for a new consensus clustering method, Secuer-consensus, which again improves the runtime and scalability of state-of-the-art consensus clustering methods while also maintaining the accuracy. Overall, Secuer is a versatile, accurate, and scalable clustering framework suitable for small to ultra-large single-cell clustering tasks.