npj Systems Biology and Applications (Apr 2024)

Positive-unlabeled learning identifies vaccine candidate antigens in the malaria parasite Plasmodium falciparum

  • Renee Ti Chou,
  • Amed Ouattara,
  • Matthew Adams,
  • Andrea A. Berry,
  • Shannon Takala-Harrison,
  • Michael P. Cummings

DOI
https://doi.org/10.1038/s41540-024-00365-1
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Malaria vaccine development is hampered by extensive antigenic variation and complex life stages of Plasmodium species. Vaccine development has focused on a small number of antigens, many of which were identified without utilizing systematic genome-level approaches. In this study, we implement a machine learning-based reverse vaccinology approach to predict potential new malaria vaccine candidate antigens. We assemble and analyze P. falciparum proteomic, structural, functional, immunological, genomic, and transcriptomic data, and use positive-unlabeled learning to predict potential antigens based on the properties of known antigens and remaining proteins. We prioritize candidate antigens based on model performance on reference antigens with different genetic diversity and quantify the protein properties that contribute most to identifying top candidates. Candidate antigens are characterized by gene essentiality, gene ontology, and gene expression in different life stages to inform future vaccine development. This approach provides a framework for identifying and prioritizing candidate vaccine antigens for a broad range of pathogens.