Applied Sciences (Aug 2021)

An Ontology to Model the International Rules for Multiple Primary Malignant Tumours in Cancer Registration

  • Nicholas Charles Nicholson,
  • Francesco Giusti,
  • Manola Bettio,
  • Raquel Negrao Carvalho,
  • Nadya Dimitrova,
  • Tadeusz Dyba,
  • Manuela Flego,
  • Luciana Neamtiu,
  • Giorgia Randi,
  • Carmen Martos

DOI
https://doi.org/10.3390/app11167233
Journal volume & issue
Vol. 11, no. 16
p. 7233

Abstract

Read online

Population-based cancer registry data provide a key epidemiological resource for monitoring cancer in defined populations. Validation of the data variables contributing to a common data set is necessary to remove statistical bias; the process is currently performed centrally. An ontology-based approach promises advantages in devolving the validation process to the registry level but the checks regarding multiple primary tumours have presented a hurdle. This work presents a solution by modelling the international rules for multiple primary cancers in description logic. Topography groupings described in the rules had to be further categorised in order to simplify the axioms. Description logic expressivity was constrained as far as possible for reasons of automatic reasoning performance. The axioms were consistently able to trap all the different types of scenarios signalling violation of the rules. Batch processing of many records were performed using the Web Ontology Language application programme interface. Performance issues were circumvented for large data sets using the software interface to perform the reasoning operations on the basis of the axioms encoded in the ontology. These results remove one remaining hurdle in developing a purely ontology-based solution for performing the European harmonised data-quality checks, with a number of inherent advantages including the formalisation and integration of the validation rules within the domain data model itself.

Keywords