BMC Medical Genomics (Mar 2022)

Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets

  • Praveen F. Cherukuri,
  • Melissa M. Soe,
  • David E. Condon,
  • Shubhi Bartaria,
  • Kaitlynn Meis,
  • Shaopeng Gu,
  • Frederick G. Frost,
  • Lindsay M. Fricke,
  • Krzysztof P. Lubieniecki,
  • Joanna M. Lubieniecka,
  • Robert E. Pyatt,
  • Catherine Hajek,
  • Cornelius F. Boerkoel,
  • Lynn Carmichael

DOI
https://doi.org/10.1186/s12920-022-01199-8
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Background Clinical use of genotype data requires high positive predictive value (PPV) and thorough understanding of the genotyping platform characteristics. BeadChip arrays, such as the Global Screening Array (GSA), potentially offer a high-throughput, low-cost clinical screen for known variants. We hypothesize that quality assessment and comparison to whole-genome sequence and benchmark data establish the analytical validity of GSA genotyping. Methods To test this hypothesis, we selected 263 samples from Coriell, generated GSA genotypes in triplicate, generated whole genome sequence (rWGS) genotypes, assessed the quality of each set of genotypes, and compared each set of genotypes to each other and to the 1000 Genomes Phase 3 (1KG) genotypes, a performance benchmark. For 59 genes (MAP59), we also performed theoretical and empirical evaluation of variants deemed medically actionable predispositions. Results Quality analyses detected sample contamination and increased assay failure along the chip margins. Comparison to benchmark data demonstrated that > 82% of the GSA assays had a PPV of 1. GSA assays targeting transitions, genomic regions of high complexity, and common variants performed better than those targeting transversions, regions of low complexity, and rare variants. Comparison of GSA data to rWGS and 1KG data showed > 99% performance across all measured parameters. Consistent with predictions from prior studies, the GSA detection of variation within the MAP59 genes was 3/261. Conclusion We establish the analytical validity of GSA assays using quality analytics and comparison to benchmark and rWGS data. GSA assays meet the standards of a clinical screen although assays interrogating rare variants, transversions, and variants within low-complexity regions require careful evaluation.

Keywords