BMC Bioinformatics (Sep 2024)

scBubbletree: computational approach for visualization of single cell RNA-seq data

  • Simo Kitanovski,
  • Yingying Cao,
  • Dimitris Ttoouli,
  • Farnoush Farahpour,
  • Jun Wang,
  • Daniel Hoffmann

DOI
https://doi.org/10.1186/s12859-024-05927-y
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Background Visualization approaches transform high-dimensional data from single cell RNA sequencing (scRNA-seq) experiments into two-dimensional plots that are used for analysis of cell relationships, and as a means of reporting biological insights. Yet, many standard approaches generate visuals that suffer from overplotting, lack of quantitative information, and distort global and local properties of biological patterns relative to the original high-dimensional space. Results We present scBubbletree, a new, scalable method for visualization of scRNA-seq data. The method identifies clusters of cells of similar transcriptomes and visualizes such clusters as “bubbles” at the tips of dendrograms (bubble trees), corresponding to quantitative summaries of cluster properties and relationships. scBubbletree stacks bubble trees with further cluster-associated information in a visually easily accessible way, thus facilitating quantitative assessment and biological interpretation of scRNA-seq data. We demonstrate this with large scRNA-seq data sets, including one with over 1.2 million cells. Conclusions To facilitate coherent quantification and visualization of scRNA-seq data we developed the R-package scBubbletree, which is freely available as part of the Bioconductor repository at: https://bioconductor.org/packages/scBubbletree/

Keywords