Genome Biology (Apr 2024)

SuperCellCyto: enabling efficient analysis of large scale cytometry datasets

  • Givanna H. Putri,
  • George Howitt,
  • Felix Marsh-Wakefield,
  • Thomas M. Ashhurst,
  • Belinda Phipson

DOI
https://doi.org/10.1186/s13059-024-03229-3
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 27

Abstract

Read online

Abstract Advancements in cytometry technologies have enabled quantification of up to 50 proteins across millions of cells at single cell resolution. Analysis of cytometry data routinely involves tasks such as data integration, clustering, and dimensionality reduction. While numerous tools exist, many require extensive run times when processing large cytometry data containing millions of cells. Existing solutions, such as random subsampling, are inadequate as they risk excluding rare cell subsets. To address this, we propose SuperCellCyto, an R package that builds on the SuperCell tool which groups highly similar cells into supercells. SuperCellCyto is available on GitHub ( https://github.com/phipsonlab/SuperCellCyto ) and Zenodo ( https://doi.org/10.5281/zenodo.10521294 ).

Keywords