Challenges in selecting admixture models and marker sets to infer genetic ancestry in a Brazilian admixed population

Luciana Maia Escher; Michel S. Naslavsky; Marília O. Scliar; Yeda A. O. Duarte; Mayana Zatz; Kelly Nunes; Silviene F. Oliveira

doi:10.1038/s41598-022-25521-7

Scientific Reports (Dec 2022)

Challenges in selecting admixture models and marker sets to infer genetic ancestry in a Brazilian admixed population

Luciana Maia Escher,
Michel S. Naslavsky,
Marília O. Scliar,
Yeda A. O. Duarte,
Mayana Zatz,
Kelly Nunes,
Silviene F. Oliveira

Affiliations

Luciana Maia Escher: Human Genetics Laboratory, Institute of Biological Sciences, University of Brasilia
Michel S. Naslavsky: Department of Genetics and Evolutionary Biology, Biosciences Institute, University of São Paulo
Marília O. Scliar: Human Genome and Stem Cell Research Center, University of São Paulo
Yeda A. O. Duarte: Medical-Surgical Nursing Department, School of Nursing, University of São Paulo
Mayana Zatz: Department of Genetics and Evolutionary Biology, Biosciences Institute, University of São Paulo
Kelly Nunes: Department of Genetics and Evolutionary Biology, Biosciences Institute, University of São Paulo
Silviene F. Oliveira: Human Genetics Laboratory, Institute of Biological Sciences, University of Brasilia

DOI: https://doi.org/10.1038/s41598-022-25521-7
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 12

Abstract

Read online

Abstract The inference of genetic ancestry plays an increasingly prominent role in clinical, population, and forensic genetics studies. Several genotyping strategies and analytical methodologies have been developed over the last few decades to assign individuals to specific biogeographic regions. However, despite these efforts, ancestry inference in populations with a recent history of admixture, such as those in Brazil, remains a challenge. In admixed populations, proportion and components of genetic ancestry vary on different levels: (i) between populations; (ii) between individuals of the same population, and (iii) throughout the individual's genome. The present study evaluated 1171 admixed Brazilian samples to compare the genetic ancestry inferred by tri-/tetra-hybrid admixture models and evaluated different marker sets from those with small numbers of ancestry informative markers panels (AIMs), to high-density SNPs (HDSNP) and whole-genome-sequence (WGS) data. Analyses revealed greater variation in the correlation coefficient of ancestry components within and between admixed populations, especially for minority ancestral components. We also observed positive correlation between the number of markers in the AIMs panel and HDSNP/WGS. Furthermore, the greater the number of markers, the more accurate the tri-/tetra-hybrid admixture models.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal