PLoS ONE (Jan 2012)

HGDP and HapMap analysis by Ancestry Mapper reveals local and global population relationships.

  • Tiago R Magalhães,
  • Jillian P Casey,
  • Judith Conroy,
  • Regina Regan,
  • Darren J Fitzpatrick,
  • Naisha Shah,
  • João Sobral,
  • Sean Ennis

DOI
https://doi.org/10.1371/journal.pone.0049438
Journal volume & issue
Vol. 7, no. 11
p. e49438

Abstract

Read online

Knowledge of human origins, migrations, and expansions is greatly enhanced by the availability of large datasets of genetic information from different populations and by the development of bioinformatic tools used to analyze the data. We present Ancestry Mapper, which we believe improves on existing methods, for the assignment of genetic ancestry to an individual and to study the relationships between local and global populations. The principle function of the method, named Ancestry Mapper, is to give each individual analyzed a genetic identifier, made up of just 51 genetic coordinates, that corresponds to its relationship to the HGDP reference population. As a consequence, the Ancestry Mapper Id (AMid) has intrinsic biological meaning and provides a tool to measure similarity between world populations. We applied Ancestry Mapper to a dataset comprised of the HGDP and HapMap data. The results show distinctions at the continental level, while simultaneously giving details at the population level. We clustered AMids of HGDP/HapMap and observe a recapitulation of human migrations: for a small number of clusters, individuals are grouped according to continental origins; for a larger number of clusters, regional and population distinctions are evident. Calculating distances between AMids allows us to infer ancestry. The number of coordinates is expandable, increasing the power of Ancestry Mapper. An R package called Ancestry Mapper is available to apply this method to any high density genomic data set.