Challenges as enablers for high quality Linked Data: insights from the Semantic Publishing Challenge

Anastasia Dimou; Sahar Vahdati; Angelo Di Iorio; Christoph Lange; Ruben Verborgh; Erik Mannens

doi:10.7717/peerj-cs.105

PeerJ Computer Science (Jan 2017)

Challenges as enablers for high quality Linked Data: insights from the Semantic Publishing Challenge

Anastasia Dimou,
Sahar Vahdati,
Angelo Di Iorio,
Christoph Lange,
Ruben Verborgh,
Erik Mannens

Affiliations

Anastasia Dimou: Faculty of Engineering and Architecture, Ghent University, Ghent, Belgium
Sahar Vahdati: Department of Intelligent Systems, University of Bonn, Bonn, Germany
Angelo Di Iorio: Department of Computer Science and Engineering, University of Bologna, Bologna, Italy
Christoph Lange: Department of Intelligent Systems, University of Bonn, Bonn, Germany
Ruben Verborgh: Faculty of Engineering and Architecture, Ghent University, Ghent, Belgium
Erik Mannens: Faculty of Engineering and Architecture, Ghent University, Ghent, Belgium

DOI: https://doi.org/10.7717/peerj-cs.105
Journal volume & issue: Vol. 3
p. e105

Abstract

Read online Read online

While most challenges organized so far in the Semantic Web domain are focused on comparing tools with respect to different criteria such as their features and competencies, or exploiting semantically enriched data, the Semantic Web Evaluation Challenges series, co-located with the ESWC Semantic Web Conference, aims to compare them based on their output, namely the produced dataset. The Semantic Publishing Challenge is one of these challenges. Its goal is to involve participants in extracting data from heterogeneous sources on scholarly publications, and producing Linked Data that can be exploited by the community itself. This paper reviews lessons learned from both (i) the overall organization of the Semantic Publishing Challenge, regarding the definition of the tasks, building the input dataset and forming the evaluation, and (ii) the results produced by the participants, regarding the proposed approaches, the used tools, the preferred vocabularies and the results produced in the three editions of 2014, 2015 and 2016. We compared these lessons to other Semantic Web Evaluation Challenges. In this paper, we (i) distill best practices for organizing such challenges that could be applied to similar events, and (ii) report observations on Linked Data publishing derived from the submitted solutions. We conclude that higher quality may be achieved when Linked Data is produced as a result of a challenge, because the competition becomes an incentive, while solutions become better with respect to Linked Data publishing best practices when they are evaluated against the rules of the challenge.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords