Genome Biology (Nov 2021)

recount3: summaries and queries for large-scale RNA-seq expression and splicing

  • Christopher Wilks,
  • Shijie C. Zheng,
  • Feng Yong Chen,
  • Rone Charles,
  • Brad Solomon,
  • Jonathan P. Ling,
  • Eddie Luidy Imada,
  • David Zhang,
  • Lance Joseph,
  • Jeffrey T. Leek,
  • Andrew E. Jaffe,
  • Abhinav Nellore,
  • Leonardo Collado-Torres,
  • Kasper D. Hansen,
  • Ben Langmead

DOI
https://doi.org/10.1186/s13059-021-02533-6
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 40

Abstract

Read online

Abstract We present recount3, a resource consisting of over 750,000 publicly available human and mouse RNA sequencing (RNA-seq) samples uniformly processed by our new Monorail analysis pipeline. To facilitate access to the data, we provide the recount3 and snapcount R/Bioconductor packages as well as complementary web resources. Using these tools, data can be downloaded as study-level summaries or queried for specific exon-exon junctions, genes, samples, or other features. Monorail can be used to process local and/or private data, allowing results to be directly compared to any study in recount3. Taken together, our tools help biologists maximize the utility of publicly available RNA-seq data, especially to improve their understanding of newly collected data. recount3 is available from http://rna.recount.bio .