Frontiers in Genetics (Jun 2023)

Performance of SNP barcodes to determine genetic diversity and population structure of Plasmodium falciparum in Africa

  • Dionne C. Argyropoulos,
  • Mun Hua Tan,
  • Courage Adobor,
  • Benedicta Mensah,
  • Frédéric Labbé,
  • Kathryn E. Tiedje,
  • Kwadwo A. Koram,
  • Anita Ghansah,
  • Karen P. Day

DOI
https://doi.org/10.3389/fgene.2023.1071896
Journal volume & issue
Vol. 14

Abstract

Read online

Panels of informative biallelic single nucleotide polymorphisms (SNPs) have been proposed to be an economical method to fast-track the population genetic analysis of Plasmodium falciparum in malaria-endemic areas. Whilst used successfully in low-transmission areas where infections are monoclonal and highly related, we present the first study to evaluate the performance of these 24- and 96-SNP molecular barcodes in African countries, characterised by moderate-to-high transmission, where multiclonal infections are prevalent. For SNP barcodes it is generally recommended that the SNPs chosen i) are biallelic, ii) have a minor allele frequency greater than 0.10, and iii) are independently segregating, to minimise bias in the analysis of genetic diversity and population structure. Further, to be standardised and used in many population genetic studies, these barcodes should maintain characteristics i) to iii) across various iv) geographies and v) time points. Using haplotypes generated from the MalariaGEN P. falciparum Community Project version six database, we investigated the ability of these two barcodes to fulfil these criteria in moderate-to-high transmission African populations in 25 sites across 10 countries. Predominantly clinical infections were analysed, with 52.3% found to be multiclonal, generating high proportions of mixed-allele calls (MACs) per isolate thereby impeding haplotype construction. Of the 24- and 96-SNPs, loci were removed if they were not biallelic and had low minor allele frequencies in all study populations, resulting in 20- and 75-SNP barcodes respectively for downstream population genetics analysis. Both SNP barcodes had low expected heterozygosity estimates in these African settings and consequently biased analyses of similarity. Both minor and major allele frequencies were temporally unstable. These SNP barcodes were also shown to identify weak genetic differentiation across large geographic distances based on Mantel Test and DAPC. These results demonstrate that these SNP barcodes are vulnerable to ascertainment bias and as such cannot be used as a standardised approach for malaria surveillance in moderate-to-high transmission areas in Africa, where the greatest genomic diversity of P. falciparum exists at local, regional and country levels.

Keywords