Foundations of Computing and Decision Sciences (Feb 2022)

ReqTagger: A Rule-Based Tagger for Automatic Glossary of Terms Extraction from Ontology Requirements

  • Wiśniewski Dawid,
  • Potoniec Jędrzej,
  • Ławrynowicz Agnieszka

DOI
https://doi.org/10.2478/fcds-2022-0003
Journal volume & issue
Vol. 47, no. 1
pp. 65 – 86

Abstract

Read online

Glossary of Terms extraction from textual requirements is an important step in ontology engineering methodologies. Although initially it was intended to be performed manually, last years have shown that some degree of automatization is possible. Based on these promising approaches, we introduce a novel, human interpretable, rule-based method named ReqTagger, which can extract candidates for ontology entities (classes or instances) and relations (data or object properties) from textual requirements automatically. We compare ReqTagger to existing automatic methods on an evaluation benchmark consisting of over 550 requirements and tagged with over 1700 entities and relations expected to be extracted. We discuss the quality of ReqTagger and provide details showing why it outperforms other methods. We also publish both the evaluation dataset and the implementation of ReqTagger.

Keywords