BMC Bioinformatics (Feb 2010)

Word add-in for ontology recognition: semantic enrichment of scientific literature

  • Naim Oscar,
  • Wade Alex,
  • Parastatidis Savas,
  • Chandran Rahul,
  • Fernicola Pablo,
  • Fink J Lynn,
  • Quinn Gregory B,
  • Bourne Philip E

DOI
https://doi.org/10.1186/1471-2105-11-103
Journal volume & issue
Vol. 11, no. 1
p. 103

Abstract

Read online

Abstract Background In the current era of scientific research, efficient communication of information is paramount. As such, the nature of scholarly and scientific communication is changing; cyberinfrastructure is now absolutely necessary and new media are allowing information and knowledge to be more interactive and immediate. One approach to making knowledge more accessible is the addition of machine-readable semantic data to scholarly articles. Results The Word add-in presented here will assist authors in this effort by automatically recognizing and highlighting words or phrases that are likely information-rich, allowing authors to associate semantic data with those words or phrases, and to embed that data in the document as XML. The add-in and source code are publicly available at http://www.codeplex.com/UCSDBioLit. Conclusions The Word add-in for ontology term recognition makes it possible for an author to add semantic data to a document as it is being written and it encodes these data using XML tags that are effectively a standard in life sciences literature. Allowing authors to mark-up their own work will help increase the amount and quality of machine-readable literature metadata.