BMC Genomics (Sep 2019)

Bioinformatics and DNA-extraction strategies to reliably detect genetic variants from FFPE breast tissue samples

  • Aditya Vijay Bhagwate,
  • Yuanhang Liu,
  • Stacey J. Winham,
  • Samantha J. McDonough,
  • Melody L. Stallings-Mann,
  • Ethan P. Heinzen,
  • Jaime I. Davila,
  • Robert A. Vierkant,
  • Tanya L. Hoskin,
  • Marlene Frost,
  • Jodi M. Carter,
  • Derek C. Radisky,
  • Julie M. Cunningham,
  • Amy C. Degnim,
  • Chen Wang

DOI
https://doi.org/10.1186/s12864-019-6056-8
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Background Archived formalin fixed paraffin embedded (FFPE) samples are valuable clinical resources to examine clinically relevant morphology features and also to study genetic changes. However, DNA quality and quantity of FFPE samples are often sub-optimal, and resulting NGS-based genetics variant detections are prone to false positives. Evaluations of wet-lab and bioinformatics approaches are needed to optimize variant detection from FFPE samples. Results As a pilot study, we designed within-subject triplicate samples of DNA derived from paired FFPE and fresh frozen breast tissues to highlight FFPE-specific artifacts. For FFPE samples, we tested two FFPE DNA extraction methods to determine impact of wet-lab procedures on variant calling: QIAGEN QIAamp DNA Mini Kit (“QA”), and QIAGEN GeneRead DNA FFPE Kit (“QGR”). We also used negative-control (NA12891) and positive control samples (Horizon Discovery Reference Standard FFPE). All DNA sample libraries were prepared for NGS according to the QIAseq Human Breast Cancer Targeted DNA Panel protocol and sequenced on the HiSeq 4000. Variant calling and filtering were performed using QIAGEN Gene Globe Data Portal. Detailed variant concordance comparisons and mutational signature analysis were performed to investigate effects of FFPE samples compared to paired fresh frozen samples, along with different DNA extraction methods. In this study, we found that five times or more variants were called with FFPE samples, compared to their paired fresh-frozen tissue samples even after applying molecular barcoding error-correction and default bioinformatics filtering recommended by the vendor. We also found that QGR as an optimized FFPE-DNA extraction approach leads to much fewer discordant variants between paired fresh frozen and FFPE samples. Approximately 92% of the uniquely called FFPE variants were of low allelic frequency range ( T|G > A” mutational signature known to be representative of FFPE artifacts resulting from cytosine deamination. Based on control samples and FFPE-frozen replicates, we derived an effective filtering strategy with associated empirical false-discovery estimates. Conclusions Through this study, we demonstrated feasibility of calling and filtering genetic variants from FFPE tissue samples using a combined strategy with molecular barcodes, optimized DNA extraction, and bioinformatics methods incorporating genomics context such as mutational signature and variant allelic frequency.

Keywords