Genome Biology (Jun 2018)

dropEst: pipeline for accurate estimation of molecular counts in droplet-based single-cell RNA-seq experiments

  • Viktor Petukhov,
  • Jimin Guo,
  • Ninib Baryawno,
  • Nicolas Severe,
  • David T. Scadden,
  • Maria G. Samsonova,
  • Peter V. Kharchenko

DOI
https://doi.org/10.1186/s13059-018-1449-6
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Recent single-cell RNA-seq protocols based on droplet microfluidics use massively multiplexed barcoding to enable simultaneous measurements of transcriptomes for thousands of individual cells. The increasing complexity of such data creates challenges for subsequent computational processing and troubleshooting of these experiments, with few software options currently available. Here, we describe a flexible pipeline for processing droplet-based transcriptome data that implements barcode corrections, classification of cell quality, and diagnostic information about the droplet libraries. We introduce advanced methods for correcting composition bias and sequencing errors affecting cellular and molecular barcodes to provide more accurate estimates of molecular counts in individual cells.