Reliability of Whole-Exome Sequencing for Assessing Intratumor Genetic Heterogeneity
Weiwei Shi,
Charlotte K.Y. Ng,
Raymond S. Lim,
Tingting Jiang,
Sushant Kumar,
Xiaotong Li,
Vikram B. Wali,
Salvatore Piscuoglio,
Mark B. Gerstein,
Anees B. Chagpar,
Britta Weigelt,
Lajos Pusztai,
Jorge S. Reis-Filho,
Christos Hatzis
Affiliations
Weiwei Shi
Department of Medicine, Yale School of Medicine, Yale University, New Haven, CT, USA
Charlotte K.Y. Ng
Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA; Institute of Pathology, University Hospital Basel, Basel, Switzerland; Department of Biomedicine, University of Basel, Basel, Switzerland
Raymond S. Lim
Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Tingting Jiang
Department of Medicine, Yale School of Medicine, Yale University, New Haven, CT, USA
Sushant Kumar
Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
Xiaotong Li
Department of Medicine, Yale School of Medicine, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
Vikram B. Wali
Department of Medicine, Yale School of Medicine, Yale University, New Haven, CT, USA
Salvatore Piscuoglio
Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA; Institute of Pathology, University Hospital Basel, Basel, Switzerland
Mark B. Gerstein
Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Computer Science, Yale University, New Haven, CT, USA
Anees B. Chagpar
Department of Surgery, Yale School of Medicine, Yale University, New Haven, CT, USA; Yale Cancer Center, New Haven, CT, USA
Britta Weigelt
Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Lajos Pusztai
Department of Medicine, Yale School of Medicine, Yale University, New Haven, CT, USA; Yale Cancer Center, New Haven, CT, USA
Jorge S. Reis-Filho
Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA; Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA; Corresponding author
Christos Hatzis
Department of Medicine, Yale School of Medicine, Yale University, New Haven, CT, USA; Yale Cancer Center, New Haven, CT, USA; Corresponding author
Summary: Multi-region sequencing is used to detect intratumor genetic heterogeneity (ITGH) in tumors. To assess whether genuine ITGH can be distinguished from sequencing artifacts, we performed whole-exome sequencing (WES) on three anatomically distinct regions of the same tumor with technical replicates to estimate technical noise. Somatic variants were detected with three different WES pipelines and subsequently validated by high-depth amplicon sequencing. The cancer-only pipeline was unreliable, with about 69% of the identified somatic variants being false positive. Even with matched normal DNA for which 82% of the somatic variants were detected reliably, only 36%–78% were found consistently in technical replicate pairs. Overall, 34%–80% of the discordant somatic variants, which could be interpreted as ITGH, were found to constitute technical noise. Excluding mutations affecting low-mappability regions or occurring in certain mutational contexts was found to reduce artifacts, yet detection of subclonal mutations by WES in the absence of orthogonal validation remains unreliable. : Shi et al. report that standard coverage whole-exome sequencing and bioinformatics pipelines cannot discriminate between genuine intratumor genetic heterogeneity and sequencing artifacts. Although aggressive minimum depth filtering would not improve the false detection rate of subclonal mutations, excluding mutations in low-mappability regions or in certain mutational contexts could help. Keywords: massively parallel sequencing, whole-exome sequencing, somatic mutations, intratumor genetic heterogeneity, multi-region profiling, breast cancer, mutational signatures, mappability, subclonal