Home // GEOProcessing 2012, The Fourth International Conference on Advanced Geographic Information Systems, Applications, and Services // View article


Automated Extraction and Geographical Structuring of Flickr Tags

Authors:
Omair Chaudhry
William Mackaness

Keywords: data mining; information retrieval; vernacular geography; granularity modelling

Abstract:
The volume and potential value of user generated content (UGC) is ever growing. One such source is geotagged images on Flickr. Typically, images on Flickr are tagged with location and attribute information variously describing location, events or objects in the image. Though inconsistent and ‘noisy’, the terms can reflect concepts at a range of geographic scales. From a spatial data integration perspective, the information relating to ‘place’ is of primary interest and the challenge is in selecting the most appropriate tag(s) that best describe the geography of the image. This paper presents a methodology for searching among the ‘tag noise’ in order to identify the most appropriate tags across a range of scales, by varying the size of the sampling area within which Flickr imagery falls. This is applied in the context of urban environments. Empirical analysis was then used to assess the correctness of the chosen tags (whether the tag correctly described the geographic region in which the image was taken). Logistic regression was then used to build a model that could assign a probability or confidence value to each selected tag as being a appropriate geographic tag. The high correlation values achieved bodes well for automated environments - environments in which this methodology could be used to automatically select meaningful tags and hierarchically structure UGC in order that it can be semantically integrated with other data sources.

Pages: 134 to 139

Copyright: Copyright (c) IARIA, 2012

Publication date: January 30, 2012

Published in: conference

ISSN: 2308-393X

ISBN: 978-1-61208-178-6

Location: Valencia, Spain

Dates: from January 30, 2012 to February 4, 2012