F1000Research (Jul 2020)

clustifyr: an R package for automated single-cell RNA sequencing cluster classification [version 2; peer review: 2 approved]

  • Rui Fu,
  • Austin E. Gillen,
  • Ryan M. Sheridan,
  • Chengzhe Tian,
  • Michelle Daya,
  • Yue Hao,
  • Jay R. Hesselberth,
  • Kent A. Riemondy

DOI
https://doi.org/10.12688/f1000research.22969.2
Journal volume & issue
Vol. 9

Abstract

Read online

Assignment of cell types from single-cell RNA sequencing (scRNA-seq) data remains a time-consuming and error-prone process. Current packages for identity assignment use limited types of reference data and often have rigid data structure requirements. We developed the clustifyr R package to leverage several external data types, including gene expression profiles to assign likely cell types using data from scRNA-seq, bulk RNA-seq, microarray expression data, or signature gene lists. We benchmark various parameters of a correlation-based approach and implement gene list enrichment methods. clustifyr is a lightweight and effective cell-type assignment tool developed for compatibility with various scRNA-seq analysis workflows. clustifyr is publicly available at https://github.com/rnabioco/clustifyr