Frontiers in Genetics (Sep 2024)

Quantifying uncertainty of molecular mismatch introduced by mislabeled ancestry using haplotype-based HLA genotype imputation

  • Benedict M. Matern,
  • Eric Spierings,
  • Selle Bandstra,
  • Abeer Madbouly,
  • Stefan Schaub,
  • Stefan Schaub,
  • Stefan Schaub,
  • Eric T. Weimer,
  • Eric T. Weimer,
  • Matthias Niemann

DOI
https://doi.org/10.3389/fgene.2024.1444554
Journal volume & issue
Vol. 15

Abstract

Read online

IntroductionModern histocompatibility algorithms depend on the comparison and analysis of high-resolution HLA protein sequences and structures, especially when considering epitope-based algorithms, which aim to model the interactions involved in antibody or T cell binding. HLA genotype imputation can be performed in the cases where only low/intermediate-resolution HLA genotype is available or if specific loci are missing, and by providing an individuals’ race/ethnicity/ancestry information, imputation results can be more accurate. This study assesses the effect of imputing high-resolution genotypes on molecular mismatch scores under a variety of ancestry assumptions.MethodsWe compared molecular matching scores from “ground-truth” high-resolution genotypes against scores from genotypes which are imputed from low-resolution genotypes. Analysis was focused on a simulated patient-donor dataset and confirmed using two real-world datasets, and deviations were aggregated based on various ancestry assumptions.ResultsWe observed that using multiple imputation generally results in lower error in molecular matching scores compared to single imputation, and that using the correct ancestry assumptions can reduce error introduced during imputation.DiscussionWe conclude that for epitope analysis, imputation is a valuable and low-risk strategy, as long as care is taken regarding epitope analysis context, ancestry assumptions, and (multiple) imputation strategy.

Keywords