Nature Communications (Jul 2024)

Multi-modal generative modeling for joint analysis of single-cell T cell receptor and gene expression data

  • Felix Drost,
  • Yang An,
  • Irene Bonafonte-Pardàs,
  • Lisa M. Dratva,
  • Rik G. H. Lindeboom,
  • Muzlifah Haniffa,
  • Sarah A. Teichmann,
  • Fabian Theis,
  • Mohammad Lotfollahi,
  • Benjamin Schubert

DOI
https://doi.org/10.1038/s41467-024-49806-9
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Recent advances in single-cell immune profiling have enabled the simultaneous measurement of transcriptome and T cell receptor (TCR) sequences, offering great potential for studying immune responses at the cellular level. However, integrating these diverse modalities across datasets is challenging due to their unique data characteristics and technical variations. Here, to address this, we develop the multimodal generative model mvTCR to fuse modality-specific information across transcriptome and TCR into a shared representation. Our analysis demonstrates the added value of multimodal over unimodal approaches to capture antigen specificity. Notably, we use mvTCR to distinguish T cell subpopulations binding to SARS-CoV-2 antigens from bystander cells. Furthermore, when combined with reference mapping approaches, mvTCR can map newly generated datasets to extensive T cell references, facilitating knowledge transfer. In summary, we envision mvTCR to enable a scalable analysis of multimodal immune profiling data and advance our understanding of immune responses.