Data Science and Engineering (Jun 2018)

RDF-F: RDF Datatype inFerring Framework

  • Irvin Dongo,
  • Yudith Cardinale,
  • Richard Chbeir

DOI
https://doi.org/10.1007/s41019-018-0064-6
Journal volume & issue
Vol. 3, no. 2
pp. 115 – 135

Abstract

Read online

Abstract In the context of RDF document matching/integration, the datatype information, which is related to literal objects, is an important aspect to be analyzed in order to better determine similar RDF documents. In this paper, we present an RDF Datatype in Ferring Framework, called RDF-F, which provides two independent datatype inference processes: 1) a four-step process consisting of (i) a predicate information analysis (i.e., deduce the datatype from existing range property), (ii) an analysis of the object value itself by a pattern-matching process (i.e., recognize the object lexical space), (iii) a semantic analysis of the predicate name and its context, and (iv) generalization of Numeric and Binary datatypes to ensure the integration; and 2) a non-ambiguous lexical-space-matching process, where literal values are inferred by the modification of their representation, following new lexical spaces. We evaluated the performance and the accuracy of both processes with datasets from DBpedia. Results show that the execution time of both indicators is linear and their accuracy can increase up to 97.10 and 99.30%, respectively.

Keywords