Wellcome Open Research (Sep 2021)

Haplotype heterogeneity and low linkage disequilibrium reduce reliable prediction of genotypes for the ‑α3.7I form of α-thalassaemia using genome-wide microarray data [version 2; peer review: 2 approved]

  • Carolyne M. Ndila,
  • Vysaul Nyirongo,
  • Alexander W. Macharia,
  • Anna E. Jeffreys,
  • Kate Rowlands,
  • Christina Hubbart,
  • George B. J. Busby,
  • Gavin Band,
  • Rosalind M. Harding,
  • Kirk A. Rockett,
  • Thomas N. Williams,
  • MalariaGEN Consortium

DOI
https://doi.org/10.12688/wellcomeopenres.16320.2
Journal volume & issue
Vol. 5

Abstract

Read online

Background: The -α3.7I-thalassaemia deletion is very common throughout Africa because it protects against malaria. When undertaking studies to investigate human genetic adaptations to malaria or other diseases, it is important to account for any confounding effects of α-thalassaemia to rule out spurious associations. Methods: In this study, we have used direct α-thalassaemia genotyping to understand why GWAS data from a large malaria association study in Kilifi Kenya did not identify the α-thalassaemia signal. We then explored the potential use of a number of new approaches to using GWAS data for imputing α-thalassaemia as an alternative to direct genotyping by PCR. Results: We found very low linkage-disequilibrium of the directly typed data with the GWAS SNP markers around α-thalassaemia and across the haemoglobin-alpha (HBA) gene region, which along with a complex haplotype structure, could explain the lack of an association signal from the GWAS SNP data. Some indirect typing methods gave results that were in broad agreement with those derived from direct genotyping and could identify an association signal, but none were sufficiently accurate to allow correct interpretation compared with direct typing, leading to confusing or erroneous results. Conclusions: We conclude that going forwards, direct typing methods such as PCR will still be required to account for α-thalassaemia in GWAS studies.