Technology in Cancer Research & Treatment (Feb 2022)

Comparison of Two Illumina Whole Transcriptome RNA Sequencing Library Preparation Methods Using Human Cancer FFPE Specimens

  • Danyi Wang PhD,
  • P. Alexander Rolfe PhD,
  • Dorothee Foernzler PhD,
  • Dennis O’Rourke MS,
  • Sheng Zhao PhD,
  • Juergen Scheuenpflug PhD,
  • Zheng Feng MD, PhD

DOI
https://doi.org/10.1177/15330338221076304
Journal volume & issue
Vol. 21

Abstract

Read online

Objective: RNA extraction and library preparation from formalin-fixed, paraffin-embedded (FFPE) samples are crucial pre-analytical steps towards achieving optimal downstream RNA sequencing (RNASeq) results. In this study, we assessed 2 Illumina library preparation methods for RNA-Seq analysis using archived FFPE samples from human cancer indications at 2 independent vendors. Methods: Twenty-five FFPE samples from 5 indications (non-small cell lung cancer, colorectal cancer, renal carcinoma, breast cancer, and hepatocellular carcinoma) were included, covering a wide range of sample storage durations (3-25 years-old), sample qualities, and specimen types (resection vs core needle biopsy). Each sample was processed independently by both vendors. Total RNA was isolated using the Qiagen miRNeasy FFPE kit followed by library construction using either TruSeq Stranded Total RNA library preparation kit with Ribo-Zero Gold, or TruSeq RNA Access library preparation kit. Libraries were normalized to 20 pM and sequenced on an Illumina HiSeq 2500 using V3 chemistry in paired-end mode with a read length of 2 × 50 bp. The data were processed through a standard RNASeq pipeline to produce counts and transcripts per millions for each gene in each sample to compare 2 library kits at 2 different vendors. Results: Our data showed that TruSeq RNA Access libraries yield over 80% exonic reads across different quality samples, indicating higher selectivity of the exome pull down by the capture approach compared to the random priming of the TruSeq Stranded Total kit. The overall QC data for FFPE RNA extraction, library preparation, and sequencing generated by the 2 vendors are comparable, and downstream gene expression quantification results show high concordance as well. With the TruSeq Stranded Total kit, the mean Spearman correlation between vendors was 0.87 and the mean Pearson correlation was 0.76. With the TruSeq RNA Access kit, the mean Spearman correlation between vendors was 0.89 and the mean Pearson correlation was 0.73. Interestingly, examination of the cross-vendor correlations compared to various common QC statistics suggested that library concentration is better correlated with consistency between vendors than is the RNA quantity. Conclusions: Our analyses provide evidence to guide selection of sequencing methods for FFPE samples in which the sample quality may be severely compromised.