Data in Brief (Dec 2024)

RDF graph pair profile dataset for the data linking communityDataverse

  • Raphaël Conde Salazar,
  • Clément Jonquet,
  • Danai Symeonidou

Journal volume & issue
Vol. 57
p. 111017

Abstract

Read online

As the number of RDF datasets published on the Web grows, it becomes increasingly important to link similar entities across these datasets. We present the “RDF graph pair profiles dataset”, designed to help the data linking community develop tools and carry out evaluation work. This dataset includes profiles of 30 RDF graph pairs, classified according to ontology matching (OM), instance matching (IM) or both (OM + IM). Each profile includes statistical measures and lists of qualitative and quantitative information and descriptive models generated using automated tools. These profiles help in understanding dataset characteristics, facilitating the development, selection and validation of data linking tools. They are particularly useful in machine learning applications where the profiles can serve as input parameters. The dataset includes both the quasi-original RDF graphs and their profiles represented in a specific described format offering a comprehensive resource for researchers and practitioners. The methodology applied to obtain the profiles is also briefly presented.Available publicly (DOI: 10.57745/K7JDGV) this dataset will facilitate data linking, hence contribute to the integration and enhancement of RDF data published in the Web of data.

Keywords