Systematic evaluation of RNA-Seq preparation protocol performance

Hsueh-Ping Chao; Yueping Chen; Yoko Takata; Mary W. Tomida; Kevin Lin; Jason S. Kirk; Melissa S. Simper; Carol D. Mikulec; Joyce E. Rundhaug; Susan M. Fischer; Taiping Chen; Dean G. Tang; Yue Lu; Jianjun Shen

doi:10.1186/s12864-019-5953-1

BMC Genomics (Jul 2019)

Systematic evaluation of RNA-Seq preparation protocol performance

Hsueh-Ping Chao,
Yueping Chen,
Yoko Takata,
Mary W. Tomida,
Kevin Lin,
Jason S. Kirk,
Melissa S. Simper,
Carol D. Mikulec,
Joyce E. Rundhaug,
Susan M. Fischer,
Taiping Chen,
Dean G. Tang,
Yue Lu,
Jianjun Shen

Affiliations

Hsueh-Ping Chao: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Yueping Chen: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Yoko Takata: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Mary W. Tomida: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Kevin Lin: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Jason S. Kirk: Department of Pharmacology and Therapeutics, Roswell Park Cancer Institute
Melissa S. Simper: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Carol D. Mikulec: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Joyce E. Rundhaug: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Susan M. Fischer: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Taiping Chen: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Dean G. Tang: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Yue Lu: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
Jianjun Shen: Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center

DOI: https://doi.org/10.1186/s12864-019-5953-1
Journal volume & issue: Vol. 20, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background RNA-Seq is currently the most widely used tool to analyze whole-transcriptome profiles. There are numerous commercial kits available to facilitate preparing RNA-Seq libraries; however, it is still not clear how some of these kits perform in terms of: 1) ribosomal RNA removal; 2) read coverage or recovery of exonic vs. intronic sequences; 3) identification of differentially expressed genes (DEGs); and 4) detection of long non-coding RNA (lncRNA). In RNA-Seq analysis, understanding the strengths and limitations of commonly used RNA-Seq library preparation protocols is important, as this technology remains costly and time-consuming. Results In this study, we present a comprehensive evaluation of four RNA-Seq kits. We used three standard input protocols: Illumina TruSeq Stranded Total RNA and mRNA kits, a modified NuGEN Ovation v2 kit, and the TaKaRa SMARTer Ultra Low RNA Kit v3. Our evaluation of these kits included quality control measures such as overall reproducibility, 5′ and 3′ end-bias, and the identification of DEGs, lncRNAs, and alternatively spliced transcripts. Overall, we found that the two Illumina kits were most similar in terms of recovering DEGs, and the Illumina, modified NuGEN, and TaKaRa kits allowed identification of a similar set of DEGs. However, we also discovered that the Illumina, NuGEN and TaKaRa kits each enriched for different sets of genes. Conclusions At the manufacturers’ recommended input RNA levels, all the RNA-Seq library preparation protocols evaluated were suitable for distinguishing between experimental groups, and the TruSeq Stranded mRNA kit was universally applicable to studies focusing on protein-coding gene profiles. The TruSeq protocols tended to capture genes with higher expression and GC content, whereas the modified NuGEN protocol tended to capture longer genes. The SMARTer Ultra Low RNA Kit may be a good choice at the low RNA input level, although it was inferior to the TruSeq mRNA kit at standard input level in terms of rRNA removal, exonic mapping rates and recovered DEGs. Therefore, the choice of RNA-Seq library preparation kit can profoundly affect data outcomes. Consequently, it is a pivotal parameter to consider when designing an RNA-Seq experiment.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal

Abstract

Keywords