Journal of Biomedical Semantics (Dec 2017)

Matching disease and phenotype ontologies in the ontology alignment evaluation initiative

  • Ian Harrow,
  • Ernesto Jiménez-Ruiz,
  • Andrea Splendiani,
  • Martin Romacker,
  • Peter Woollard,
  • Scott Markel,
  • Yasmin Alam-Faruque,
  • Martin Koch,
  • James Malone,
  • Arild Waaler

DOI
https://doi.org/10.1186/s13326-017-0162-9
Journal volume & issue
Vol. 8, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background The disease and phenotype track was designed to evaluate the relative performance of ontology matching systems that generate mappings between source ontologies. Disease and phenotype ontologies are important for applications such as data mining, data integration and knowledge management to support translational science in drug discovery and understanding the genetics of disease. Results Eleven systems (out of 21 OAEI participating systems) were able to cope with at least one of the tasks in the Disease and Phenotype track. AML, FCA-Map, LogMap(Bio) and PhenoMF systems produced the top results for ontology matching in comparison to consensus alignments. The results against manually curated mappings proved to be more difficult most likely because these mapping sets comprised mostly subsumption relationships rather than equivalence. Manual assessment of unique equivalence mappings showed that AML, LogMap(Bio) and PhenoMF systems have the highest precision results. Conclusions Four systems gave the highest performance for matching disease and phenotype ontologies. These systems coped well with the detection of equivalence matches, but struggled to detect semantic similarity. This deserves more attention in the future development of ontology matching systems. The findings of this evaluation show that such systems could help to automate equivalence matching in the workflow of curators, who maintain ontology mapping services in numerous domains such as disease and phenotype.

Keywords