Journal of Biomedical Semantics (Jun 2024)

Empowering standardization of cancer vaccines through ontology: enhanced modeling and data analysis

  • Jie Zheng,
  • Xingxian Li,
  • Anna Maria Masci,
  • Hayleigh Kahn,
  • Anthony Huffman,
  • Eliyas Asfaw,
  • Yuanyi Pan,
  • Jinjing Guo,
  • Virginia He,
  • Justin Song,
  • Andrey I. Seleznev,
  • Asiyah Yu Lin,
  • Yongqun He

DOI
https://doi.org/10.1186/s13326-024-00312-3
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background The exploration of cancer vaccines has yielded a multitude of studies, resulting in a diverse collection of information. The heterogeneity of cancer vaccine data significantly impedes effective integration and analysis. While CanVaxKB serves as a pioneering database for over 670 manually annotated cancer vaccines, it is important to distinguish that a database, on its own, does not offer the structured relationships and standardized definitions found in an ontology. Recognizing this, we expanded the Vaccine Ontology (VO) to include those cancer vaccines present in CanVaxKB that were not initially covered, enhancing VO’s capacity to systematically define and interrelate cancer vaccines. Results An ontology design pattern (ODP) was first developed and applied to semantically represent various cancer vaccines, capturing their associated entities and relations. By applying the ODP, we generated a cancer vaccine template in a tabular format and converted it into the RDF/OWL format for generation of cancer vaccine terms in the VO. ‘12MP vaccine’ was used as an example of cancer vaccines to demonstrate the application of the ODP. VO also reuses reference ontology terms to represent entities such as cancer diseases and vaccine hosts. Description Logic (DL) and SPARQL query scripts were developed and used to query for cancer vaccines based on different vaccine’s features and to demonstrate the versatility of the VO representation. Additionally, ontological modeling was applied to illustrate cancer vaccine related concepts and studies for in-depth cancer vaccine analysis. A cancer vaccine-specific VO view, referred to as “CVO,” was generated, and it contains 928 classes including 704 cancer vaccines. The CVO OWL file is publicly available on: http://purl.obolibrary.org/obo/vo/cvo.owl , for sharing and applications. Conclusion To facilitate the standardization, integration, and analysis of cancer vaccine data, we expanded the Vaccine Ontology (VO) to systematically model and represent cancer vaccines. We also developed a pipeline to automate the inclusion of cancer vaccines and associated terms in the VO. This not only enriches the data’s standardization and integration, but also leverages ontological modeling to deepen the analysis of cancer vaccine information, maximizing benefits for researchers and clinicians. Availability The VO-cancer GitHub website is: https://github.com/vaccineontology/VO/tree/master/CVO .

Keywords