Geo-spatial Information Science (Jul 2018)

Description and characterization of place properties using topic modeling on georeferenced tags

  • Azam R. Bahrehdar,
  • Ross S. Purves

DOI
https://doi.org/10.1080/10095020.2018.1493238
Journal volume & issue
Vol. 21, no. 3
pp. 173 – 184

Abstract

Read online

User-Generated Content (UGC) provides a potential data source which can help us to better describe and understand how places are conceptualized, and in turn better represent the places in Geographic Information Science (GIScience). In this article, we aim at aggregating the shared meanings associated with places and linking these to a conceptual model of place. Our focus is on the metadata of Flickr images, in the form of locations and tags. We use topic modeling to identify regions associated with shared meanings. We choose a grid approach and generate topics associated with one or more cells using Latent Dirichlet Allocation. We analyze the sensitivity of our results to both grid resolution and the chosen number of topics using a range of measures including corpus distance and the coherence value. Using a resolution of 500 m and with 40 topics, we are able to generate meaningful topics which characterize places in London based on 954 unique tags associated with around 300,000 images and more than 7000 individuals.

Keywords