Description and characterization of place properties using topic modeling on georeferenced tags

User-Generated Content (UGC) provides a potential data source which can help us to better describe and understand how places are conceptualized, and in turn better represent the places in Geographic Information Science (GIScience). In this article, we aim at aggregating the shared meanings associate...

Full description

Bibliographic Details
Main Authors: Azam R. Bahrehdar, Ross S. Purves
Format: Article
Language:English
Published: Taylor & Francis Group 2018-07-01
Series:Geo-spatial Information Science
Subjects:
Online Access:http://dx.doi.org/10.1080/10095020.2018.1493238
Description
Summary:User-Generated Content (UGC) provides a potential data source which can help us to better describe and understand how places are conceptualized, and in turn better represent the places in Geographic Information Science (GIScience). In this article, we aim at aggregating the shared meanings associated with places and linking these to a conceptual model of place. Our focus is on the metadata of Flickr images, in the form of locations and tags. We use topic modeling to identify regions associated with shared meanings. We choose a grid approach and generate topics associated with one or more cells using Latent Dirichlet Allocation. We analyze the sensitivity of our results to both grid resolution and the chosen number of topics using a range of measures including corpus distance and the coherence value. Using a resolution of 500 m and with 40 topics, we are able to generate meaningful topics which characterize places in London based on 954 unique tags associated with around 300,000 images and more than 7000 individuals.
ISSN:1009-5020
1993-5153