Comparing the quality of crowdsourced data contributed by expert and non-experts.

Linda See; Alexis Comber; Carl Salk; Steffen Fritz; Marijn van der Velde; Christoph Perger; Christian Schill; Ian McCallum; Florian Kraxner; Michael Obersteiner

doi:10.1371/journal.pone.0069958

PLoS ONE (Jan 2013)

Comparing the quality of crowdsourced data contributed by expert and non-experts.

Linda See,
Alexis Comber,
Carl Salk,
Steffen Fritz,
Marijn van der Velde,
Christoph Perger,
Christian Schill,
Ian McCallum,
Florian Kraxner,
Michael Obersteiner

Affiliations

Linda See
Alexis Comber
Carl Salk
Steffen Fritz
Marijn van der Velde
Christoph Perger
Christian Schill
Ian McCallum
Florian Kraxner
Michael Obersteiner

DOI: https://doi.org/10.1371/journal.pone.0069958
Journal volume & issue: Vol. 8, no. 7
p. e69958

Abstract

Read online

There is currently a lack of in-situ environmental data for the calibration and validation of remotely sensed products and for the development and verification of models. Crowdsourcing is increasingly being seen as one potentially powerful way of increasing the supply of in-situ data but there are a number of concerns over the subsequent use of the data, in particular over data quality. This paper examined crowdsourced data from the Geo-Wiki crowdsourcing tool for land cover validation to determine whether there were significant differences in quality between the answers provided by experts and non-experts in the domain of remote sensing and therefore the extent to which crowdsourced data describing human impact and land cover can be used in further scientific research. The results showed that there was little difference between experts and non-experts in identifying human impact although results varied by land cover while experts were better than non-experts in identifying the land cover type. This suggests the need to create training materials with more examples in those areas where difficulties in identification were encountered, and to offer some method for contributors to reflect on the information they contribute, perhaps by feeding back the evaluations of their contributed data or by making additional training materials available. Accuracies were also found to be higher when the volunteers were more consistent in their responses at a given location and when they indicated higher confidence, which suggests that these additional pieces of information could be used in the development of robust measures of quality in the future.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal