Human Genomics (Jun 2005)

Large-scale SNP analysis reveals clustered and continuous patterns of human genetic variation

  • Shriver Mark D,
  • Mei Rui,
  • Parra Esteban J,
  • Sonpar Vibhor,
  • Halder Indrani,
  • Tishkoff Sarah A,
  • Schurr Theodore G,
  • Zhadanov Sergev I,
  • Osipova Ludmila P,
  • Brutsaert Tom D,
  • Friedlaender Jonathan,
  • Jorde Lynn B,
  • Watkins W Scott,
  • Bamshad Michael J,
  • Gutierrez Gerardo,
  • Loi Halina,
  • Matsuzaki Hajime,
  • Kittles Rick A,
  • Argyropoulos George,
  • Fernandez Jose R,
  • Akey Joshua M,
  • Jones Keith W

DOI
https://doi.org/10.1186/1479-7364-2-2-81
Journal volume & issue
Vol. 2, no. 2
pp. 81 – 89

Abstract

Read online

Abstract Understanding the distribution of human genetic variation is an important foundation for research into the genetics of common diseases. Some of the alleles that modify common disease risk are themselves likely to be common and, thus, amenable to identification using gene-association methods. A problem with this approach is that the large sample sizes required for sufficient statistical power to detect alleles with moderate effect make gene-association studies susceptible to false-positive findings as the result of population stratification 12. Such type I errors can be eliminated by using either family-based association tests or methods that sufficiently adjust for population stratification 345. These methods require the availability of genetic markers that can detect and, thus, control for sources of genetic stratification among populations. In an effort to investigate population stratification and identify appropriate marker panels, we have analysed 11,555 single nucleotide polymorphisms in 203 individuals from 12 diverse human populations. Individuals in each population cluster to the exclusion of individuals from other populations using two clustering methods. Higher-order branching and clustering of the populations are consistent with the geographic origins of populations and with previously published genetic analyses. These data provide a valuable resource for the definition of marker panels to detect and control for population stratification in population-based gene identification studies. Using three US resident populations (European-American, African-American and Puerto Rican), we demonstrate how such studies can proceed, quantifying proportional ancestry levels and detecting significant admixture structure in each of these populations.

Keywords