Home // GEOProcessing 2015, The Seventh International Conference on Advanced Geographic Information Systems, Applications, and Services // View article
Geocoding with OpenStreetMap Data
Authors:
Konstantin Clemens
Keywords: Geocoding, Address Search, OSM, Nominatim, Elasticsearch, TF/IDF
Abstract:
OpenStreetMap (OSM) is a platform where users contribute geographic data. To serve multiple use cases, these data are held in a very generic format. This makes processing and indexing OSM data a challenge. Nominatim is an open source search and geocoding engine that consumes OSM data. While Nominatim does process OSM data well, it does not use term frequency – inversed document frequency (TF/IDF) based ranking of search results. Lucene is a framework offering TF/IDF for ranking of indexed documents. In this paper Nominatim’s processing of OSM data is utilized to assemble full addresses with their geocoordinates. These addresses are then indexed in Elasticsearch, a web service on top of Lucene. The resulting TF/IDF based geocoding system is benchmarked in comparison with plain Nominatim. The analysis shows: TF/IDF based ranking yields more accurate results, especially for queries with unordered address elements or only partially specified addresses.
Pages: 1 to 2
Copyright: Copyright (c) IARIA, 2015
Publication date: February 22, 2015
Published in: conference
ISSN: 2308-393X
ISBN: 978-1-61208-383-4
Location: Lisbon, Portugal
Dates: from February 22, 2015 to February 27, 2015