PLoS ONE (Jan 2016)

Using Synthetic Mouse Spike-In Transcripts to Evaluate RNA-Seq Analysis Tools.

  • Dena Leshkowitz,
  • Ester Feldmesser,
  • Gilgi Friedlander,
  • Ghil Jona,
  • Elena Ainbinder,
  • Yisrael Parmet,
  • Shirley Horn-Saban

DOI
https://doi.org/10.1371/journal.pone.0153782
Journal volume & issue
Vol. 11, no. 4
p. e0153782

Abstract

Read online

One of the key applications of next-generation sequencing (NGS) technologies is RNA-Seq for transcriptome genome-wide analysis. Although multiple studies have evaluated and benchmarked RNA-Seq tools dedicated to gene level analysis, few studies have assessed their effectiveness on the transcript-isoform level. Alternative splicing is a naturally occurring phenomenon in eukaryotes, significantly increasing the biodiversity of proteins that can be encoded by the genome. The aim of this study was to assess and compare the ability of the bioinformatics approaches and tools to assemble, quantify and detect differentially expressed transcripts using RNA-Seq data, in a controlled experiment. To this end, in vitro synthesized mouse spike-in control transcripts were added to the total RNA of differentiating mouse embryonic bodies, and their expression patterns were measured. This novel approach was used to assess the accuracy of the tools, as established by comparing the observed results versus the results expected of the mouse controlled spiked-in transcripts. We found that detection of differential expression at the gene level is adequate, yet on the transcript-isoform level, all tools tested lacked accuracy and precision.