PLoS ONE (Jan 2012)

Technical reproducibility of genotyping SNP arrays used in genome-wide association studies.

  • Huixiao Hong,
  • Lei Xu,
  • Jie Liu,
  • Wendell D Jones,
  • Zhenqiang Su,
  • Baitang Ning,
  • Roger Perkins,
  • Weigong Ge,
  • Kelci Miclaus,
  • Li Zhang,
  • Kyunghee Park,
  • Bridgett Green,
  • Tao Han,
  • Hong Fang,
  • Christophe G Lambert,
  • Silvia C Vega,
  • Simon M Lin,
  • Nadereh Jafari,
  • Wendy Czika,
  • Russell D Wolfinger,
  • Federico Goodsaid,
  • Weida Tong,
  • Leming Shi

DOI
https://doi.org/10.1371/journal.pone.0044483
Journal volume & issue
Vol. 7, no. 9
p. e44483

Abstract

Read online

During the last several years, high-density genotyping SNP arrays have facilitated genome-wide association studies (GWAS) that successfully identified common genetic variants associated with a variety of phenotypes. However, each of the identified genetic variants only explains a very small fraction of the underlying genetic contribution to the studied phenotypic trait. Moreover, discordance observed in results between independent GWAS indicates the potential for Type I and II errors. High reliability of genotyping technology is needed to have confidence in using SNP data and interpreting GWAS results. Therefore, reproducibility of two widely genotyping technology platforms from Affymetrix and Illumina was assessed by analyzing four technical replicates from each of the six individuals in five laboratories. Genotype concordance of 99.40% to 99.87% within a laboratory for the sample platform, 98.59% to 99.86% across laboratories for the same platform, and 98.80% across genotyping platforms was observed. Moreover, arrays with low quality data were detected when comparing genotyping data from technical replicates, but they could not be detected according to venders' quality control (QC) suggestions. Our results demonstrated the technical reliability of currently available genotyping platforms but also indicated the importance of incorporating some technical replicates for genotyping QC in order to improve the reliability of GWAS results. The impact of discordant genotypes on association analysis results was simulated and could explain, at least in part, the irreproducibility of some GWAS findings when the effect size (i.e. the odds ratio) and the minor allele frequencies are low.