A systematic overview of single-cell transcriptomics databases, their use cases, and limitations

Mahnoor N. Gondal; Mahnoor N. Gondal; Saad Ur Rehman Shah; Arul M. Chinnaiyan; Arul M. Chinnaiyan; Arul M. Chinnaiyan; Arul M. Chinnaiyan; Arul M. Chinnaiyan; Arul M. Chinnaiyan; Marcin Cieslik; Marcin Cieslik; Marcin Cieslik; Marcin Cieslik

doi:10.3389/fbinf.2024.1417428

Frontiers in Bioinformatics (Jul 2024)

A systematic overview of single-cell transcriptomics databases, their use cases, and limitations

Mahnoor N. Gondal,
Mahnoor N. Gondal,
Saad Ur Rehman Shah,
Arul M. Chinnaiyan,
Arul M. Chinnaiyan,
Arul M. Chinnaiyan,
Arul M. Chinnaiyan,
Arul M. Chinnaiyan,
Arul M. Chinnaiyan,
Marcin Cieslik,
Marcin Cieslik,
Marcin Cieslik,
Marcin Cieslik

Affiliations

Mahnoor N. Gondal: Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
Mahnoor N. Gondal: Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI, United States
Saad Ur Rehman Shah: Gies College of Business, University of Illinois Business College, Champaign, MI, United States
Arul M. Chinnaiyan: Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
Arul M. Chinnaiyan: Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI, United States
Arul M. Chinnaiyan: Department of Pathology, University of Michigan, Ann Arbor, MI, United States
Arul M. Chinnaiyan: Department of Urology, University of Michigan, Ann Arbor, MI, United States
Arul M. Chinnaiyan: Howard Hughes Medical Institute, Ann Arbor, MI, United States
Arul M. Chinnaiyan: University of Michigan Rogel Cancer Center, Ann Arbor, MI, United States
Marcin Cieslik: Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
Marcin Cieslik: Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI, United States
Marcin Cieslik: Department of Pathology, University of Michigan, Ann Arbor, MI, United States
Marcin Cieslik: University of Michigan Rogel Cancer Center, Ann Arbor, MI, United States

DOI: https://doi.org/10.3389/fbinf.2024.1417428
Journal volume & issue: Vol. 4

Abstract

Read online

Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq) technologies and experimental protocols have led to the generation of vast amounts of transcriptomic data that populates several online databases and repositories. Here, we systematically examined large-scale scRNA-seq databases, categorizing them based on their scope and purpose such as general, tissue-specific databases, disease-specific databases, cancer-focused databases, and cell type-focused databases. Next, we discuss the technical and methodological challenges associated with curating large-scale scRNA-seq databases, along with current computational solutions. We argue that understanding scRNA-seq databases, including their limitations and assumptions, is crucial for effectively utilizing this data to make robust discoveries and identify novel biological insights. Such platforms can help bridge the gap between computational and wet lab scientists through user-friendly web-based interfaces needed for democratizing access to single-cell data. These platforms would facilitate interdisciplinary research, enabling researchers from various disciplines to collaborate effectively. This review underscores the importance of leveraging computational approaches to unravel the complexities of single-cell data and offers a promising direction for future research in the field.

Published in Frontiers in Bioinformatics

ISSN: 2673-7647 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.frontiersin.org/journals/bioinformatics

About the journal

Abstract

Keywords