Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets

Praveen F. Cherukuri; Melissa M. Soe; David E. Condon; Shubhi Bartaria; Kaitlynn Meis; Shaopeng Gu; Frederick G. Frost; Lindsay M. Fricke; Krzysztof P. Lubieniecki; Joanna M. Lubieniecka; Robert E. Pyatt; Catherine Hajek; Cornelius F. Boerkoel; Lynn Carmichael

doi:10.1186/s12920-022-01199-8

BMC Medical Genomics (Mar 2022)

Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets

Praveen F. Cherukuri,
Melissa M. Soe,
David E. Condon,
Shubhi Bartaria,
Kaitlynn Meis,
Shaopeng Gu,
Frederick G. Frost,
Lindsay M. Fricke,
Krzysztof P. Lubieniecki,
Joanna M. Lubieniecka,
Robert E. Pyatt,
Catherine Hajek,
Cornelius F. Boerkoel,
Lynn Carmichael

Affiliations

Praveen F. Cherukuri: Imagenetics, Sanford Health
Melissa M. Soe: Imagenetics, Sanford Health
David E. Condon: Imagenetics, Sanford Health
Shubhi Bartaria: Imagenetics, Sanford Health
Kaitlynn Meis: Imagenetics, Sanford Health
Shaopeng Gu: Imagenetics, Sanford Health
Frederick G. Frost: Imagenetics, Sanford Health
Lindsay M. Fricke: Imagenetics, Sanford Health
Krzysztof P. Lubieniecki: Imagenetics, Sanford Health
Joanna M. Lubieniecka: Imagenetics, Sanford Health
Robert E. Pyatt: Imagenetics, Sanford Health
Catherine Hajek: Imagenetics, Sanford Health
Cornelius F. Boerkoel: Imagenetics, Sanford Health
Lynn Carmichael: Imagenetics, Sanford Health

DOI: https://doi.org/10.1186/s12920-022-01199-8
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Background Clinical use of genotype data requires high positive predictive value (PPV) and thorough understanding of the genotyping platform characteristics. BeadChip arrays, such as the Global Screening Array (GSA), potentially offer a high-throughput, low-cost clinical screen for known variants. We hypothesize that quality assessment and comparison to whole-genome sequence and benchmark data establish the analytical validity of GSA genotyping. Methods To test this hypothesis, we selected 263 samples from Coriell, generated GSA genotypes in triplicate, generated whole genome sequence (rWGS) genotypes, assessed the quality of each set of genotypes, and compared each set of genotypes to each other and to the 1000 Genomes Phase 3 (1KG) genotypes, a performance benchmark. For 59 genes (MAP59), we also performed theoretical and empirical evaluation of variants deemed medically actionable predispositions. Results Quality analyses detected sample contamination and increased assay failure along the chip margins. Comparison to benchmark data demonstrated that > 82% of the GSA assays had a PPV of 1. GSA assays targeting transitions, genomic regions of high complexity, and common variants performed better than those targeting transversions, regions of low complexity, and rare variants. Comparison of GSA data to rWGS and 1KG data showed > 99% performance across all measured parameters. Consistent with predictions from prior studies, the GSA detection of variation within the MAP59 genes was 3/261. Conclusion We establish the analytical validity of GSA assays using quality analytics and comparison to benchmark and rWGS data. GSA assays meet the standards of a clinical screen although assays interrogating rare variants, transversions, and variants within low-complexity regions require careful evaluation.

Published in BMC Medical Genomics

ISSN: 1755-8794 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine; Science: Biology (General): Genetics
Website: https://bmcmedgenomics.biomedcentral.com

About the journal

Abstract

Keywords