PLoS ONE (Jan 2015)

Knowledge retrieval from PubMed abstracts and electronic medical records with the Multiple Sclerosis Ontology.

  • Ashutosh Malhotra,
  • Michaela Gündel,
  • Abdul Mateen Rajput,
  • Heinz-Theodor Mevissen,
  • Albert Saiz,
  • Xavier Pastor,
  • Raimundo Lozano-Rubi,
  • Elena H Martinez-Lapiscina,
  • Irati Zubizarreta,
  • Bernd Mueller,
  • Ekaterina Kotelnikova,
  • Luca Toldo,
  • Martin Hofmann-Apitius,
  • Pablo Villoslada

DOI
https://doi.org/10.1371/journal.pone.0116718
Journal volume & issue
Vol. 10, no. 2
p. e0116718

Abstract

Read online

BACKGROUND:In order to retrieve useful information from scientific literature and electronic medical records (EMR) we developed an ontology specific for Multiple Sclerosis (MS). METHODS:The MS Ontology was created using scientific literature and expert review under the Protégé OWL environment. We developed a dictionary with semantic synonyms and translations to different languages for mining EMR. The MS Ontology was integrated with other ontologies and dictionaries (diseases/comorbidities, gene/protein, pathways, drug) into the text-mining tool SCAIView. We analyzed the EMRs from 624 patients with MS using the MS ontology dictionary in order to identify drug usage and comorbidities in MS. Testing competency questions and functional evaluation using F statistics further validated the usefulness of MS ontology. RESULTS:Validation of the lexicalized ontology by means of named entity recognition-based methods showed an adequate performance (F score = 0.73). The MS Ontology retrieved 80% of the genes associated with MS from scientific abstracts and identified additional pathways targeted by approved disease-modifying drugs (e.g. apoptosis pathways associated with mitoxantrone, rituximab and fingolimod). The analysis of the EMR from patients with MS identified current usage of disease modifying drugs and symptomatic therapy as well as comorbidities, which are in agreement with recent reports. CONCLUSION:The MS Ontology provides a semantic framework that is able to automatically extract information from both scientific literature and EMR from patients with MS, revealing new pathogenesis insights as well as new clinical information.