Measuring the Impact of Natural Hazards with Citizen Science: The Case of Flooded Area Estimation Using Twitter

Pierrick Bruneau; Etienne Brangbour; Stéphane Marchand-Maillet; Renaud Hostache; Marco Chini; Ramona-Maria Pelich; Patrick Matgen; Thomas Tamisier

doi:10.3390/rs13061153

Remote Sensing (Mar 2021)

Measuring the Impact of Natural Hazards with Citizen Science: The Case of Flooded Area Estimation Using Twitter

Pierrick Bruneau,
Etienne Brangbour,
Stéphane Marchand-Maillet,
Renaud Hostache,
Marco Chini,
Ramona-Maria Pelich,
Patrick Matgen,
Thomas Tamisier

Affiliations

Pierrick Bruneau: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg
Etienne Brangbour: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg
Stéphane Marchand-Maillet: Computer Science Department, University of Geneva, 1227 Carouge, Switzerland
Renaud Hostache: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg
Marco Chini: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg
Ramona-Maria Pelich: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg
Patrick Matgen: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg
Thomas Tamisier: Luxembourg Institute of Science and Technology, L-4361 Esch-sur-Alzette, Luxembourg

DOI: https://doi.org/10.3390/rs13061153
Journal volume & issue: Vol. 13, no. 6
p. 1153

Abstract

Read online

Twitter has significant potential as a source of Volunteered Geographic Information (VGI), as its content is updated at high frequency, with high availability thanks to dedicated interfaces. However, the diversity of content types and the low average accuracy of geographic information attached to individual tweets remain obstacles in this context. The contributions in this paper relate to the general goal of extracting actionable information regarding the impact of natural hazards on a specific region from social platforms, such as Twitter. Specifically, our contributions describe the construction of a model classifying whether given spatio-temporal coordinates, materialized by raster cells in a remote sensing context, lie in a flooded area. For training, remotely sensed data are used as the target variable, and the input covariates are built on the sole basis of textual and spatial data extracted from a Twitter corpus. Our contributions enable the use of trained models for arbitrary new Twitter corpora collected for the same region, but at different times, allowing for the construction of a flooded area measurement proxy available at a higher temporal frequency. Experimental validation uses true data that were collected during Hurricane Harvey, which caused significant flooding in the Houston urban area between mid-August and mid-September 2017. Our experimental section compares several spatial information extraction methods, as well as various textual representation and aggregation techniques, which were applied to the collected Twitter data. The best configuration yields a F1 score of 0.425, boosted to 0.834 if restricted to the 10% most confident predictions.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords