PLoS Genetics (Dec 2016)

Mutation Rate Variation is a Primary Determinant of the Distribution of Allele Frequencies in Humans.

  • Arbel Harpak,
  • Anand Bhaskar,
  • Jonathan K Pritchard

DOI
https://doi.org/10.1371/journal.pgen.1006489
Journal volume & issue
Vol. 12, no. 12
p. e1006489

Abstract

Read online

The site frequency spectrum (SFS) has long been used to study demographic history and natural selection. Here, we extend this summary by examining the SFS conditional on the alleles found at the same site in other species. We refer to this extension as the "phylogenetically-conditioned SFS" or cSFS. Using recent large-sample data from the Exome Aggregation Consortium (ExAC), combined with primate genome sequences, we find that human variants that occurred independently in closely related primate lineages are at higher frequencies in humans than variants with parallel substitutions in more distant primates. We show that this effect is largely due to sites with elevated mutation rates causing significant departures from the widely-used infinite sites mutation model. Our analysis also suggests substantial variation in mutation rates even among mutations involving the same nucleotide changes. In summary, we show that variable mutation rates are key determinants of the SFS in humans.