Frontiers in Immunology (Apr 2023)

Prediction of HLA genotypes from single-cell transcriptome data

  • Benjamin D. Solomon,
  • Hong Zheng,
  • Hong Zheng,
  • Laura W. Dillon,
  • Jason D. Goldman,
  • Jason D. Goldman,
  • Jason D. Goldman,
  • Christopher S. Hourigan,
  • James R. Heath,
  • James R. Heath,
  • Purvesh Khatri,
  • Purvesh Khatri

DOI
https://doi.org/10.3389/fimmu.2023.1146826
Journal volume & issue
Vol. 14

Abstract

Read online

The human leukocyte antigen (HLA) locus plays a central role in adaptive immune function and has significant clinical implications for tissue transplant compatibility and allelic disease associations. Studies using bulk-cell RNA sequencing have demonstrated that HLA transcription may be regulated in an allele-specific manner and single-cell RNA sequencing (scRNA-seq) has the potential to better characterize these expression patterns. However, quantification of allele-specific expression (ASE) for HLA loci requires sample-specific reference genotyping due to extensive polymorphism. While genotype prediction from bulk RNA sequencing is well described, the feasibility of predicting HLA genotypes directly from single-cell data is unknown. Here we evaluate and expand upon several computational HLA genotyping tools by comparing predictions from human single-cell data to gold-standard, molecular genotyping. The highest 2-field accuracy averaged across all loci was 76% by arcasHLA and increased to 86% using a composite model of multiple genotyping tools. We also developed a highly accurate model (AUC 0.93) for predicting HLA-DRB345 copy number in order to improve genotyping accuracy of the HLA-DRB locus. Genotyping accuracy improved with read depth and was reproducible at repeat sampling. Using a metanalytic approach, we also show that HLA genotypes from PHLAT and OptiType can generate ASE ratios that are highly correlated (R2 = 0.8 and 0.94, respectively) with those derived from gold-standard genotyping.

Keywords