Nature Communications (Jan 2024)

Anti-correlated feature selection prevents false discovery of subpopulations in scRNAseq

  • Scott R. Tyler,
  • Daniel Lozano-Ojalvo,
  • Ernesto Guccione,
  • Eric E. Schadt

DOI
https://doi.org/10.1038/s41467-023-43406-9
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 15

Abstract

Read online

Abstract While sub-clustering cell-populations has become popular in single cell-omics, negative controls for this process are lacking. Popular feature-selection/clustering algorithms fail the null-dataset problem, allowing erroneous subdivisions of homogenous clusters until nearly each cell is called its own cluster. Using real and synthetic datasets, we find that anti-correlated gene selection reduces or eliminates erroneous subdivisions, increases marker-gene selection efficacy, and efficiently scales to millions of cells.