Genome Biology (Sep 2024)

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

  • Sarah M. Goggin,
  • Eli R. Zunder

DOI
https://doi.org/10.1186/s13059-024-03386-5
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 27

Abstract

Read online

Abstract Clustering is widely used for single-cell analysis, but current methods are limited in accuracy, robustness, ease of use, and interpretability. To address these limitations, we developed an ensemble clustering method that outperforms other methods at hard clustering without the need for hyperparameter tuning. It also performs soft clustering to characterize continuum-like regions and quantify clustering uncertainty, demonstrated here by mapping the connectivity and intermediate transitions between MNIST handwritten digits and between hypothalamic tanycyte subpopulations. This hyperparameter-randomized ensemble approach improves the accuracy, robustness, ease of use, and interpretability of single-cell clustering, and may prove useful in other fields as well.

Keywords