ISPRS International Journal of Geo-Information (Aug 2020)

OSMWatchman: Learning How to Detect Vandalized Contributions in OSM Using a Random Forest Classifier

  • Quy Thy Truong,
  • Guillaume Touya,
  • Cyril de Runz

DOI
https://doi.org/10.3390/ijgi9090504
Journal volume & issue
Vol. 9, no. 9
p. 504

Abstract

Read online

Though Volunteered Geographic Information (VGI) has the advantage of providing free open spatial data, it is prone to vandalism, which may heavily decrease the quality of these data. Therefore, detecting vandalism in VGI may constitute a first way of assessing the data in order to improve their quality. This article explores the ability of supervised machine learning approaches to detect vandalism in OpenStreetMap (OSM) in an automated way. For this purpose, our work includes the construction of a corpus of vandalism data, given that no OSM vandalism corpus is available so far. Then, we investigate the ability of random forest methods to detect vandalism on the created corpus. Experimental results show that random forest classifiers perform well in detecting vandalism in the same geographical regions that were used for training the model and has more issues with vandalism detection in “unfamiliar regions”.

Keywords