Genome Biology (Apr 2023)

Consequences and opportunities arising due to sparser single-cell RNA-seq datasets

  • Gerard A. Bouland,
  • Ahmed Mahfouz,
  • Marcel J. T. Reinders

DOI
https://doi.org/10.1186/s13059-023-02933-w
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 10

Abstract

Read online

Abstract With the number of cells measured in single-cell RNA sequencing (scRNA-seq) datasets increasing exponentially and concurrent increased sparsity due to more zero counts being measured for many genes, we demonstrate here that downstream analyses on binary-based gene expression give similar results as count-based analyses. Moreover, a binary representation scales up to ~ 50-fold more cells that can be analyzed using the same computational resources. We also highlight the possibilities provided by binarized scRNA-seq data. Development of specialized tools for bit-aware implementations of downstream analytical tasks will enable a more fine-grained resolution of biological heterogeneity.