Introduction

OSMNames is an open source tool that allows creating geographical gazetteer data out of OpenStreetMap OSM files.

There is a need for a data set consisting of street names of the world. Such gazetteer data, however, is either not available for every country (openaddresses.io) or is not in a suitable format. Furthermore, if such data exists, it is often not for free. A global data set can be downloaded at https://osmnames.org.

A current implementation on how the data looks like in a geocoder is a available at https://osmnames.org

OSMNames Geocoder

What can I do with OSMNames?

With OSMNames, you can create your own geocoder data set based on OpenStreetMap. It currently includes all addresses available. For each feature, the hierarchy, as well as a Wikipedia-based importance, is calculated.

Where to Start?

To download the newest set of data go to https://osmnames.org.

To process OpenStreetMap data yourself, check out the Getting Started document.

If you want to have a look at the Source Code or contribute to the project, check out the Development documentation. The source code is available in our GitHub Repository.

Output Format

The exported file geonames.tsv contains the following columns:

Column name Description
name The name of the feature (default language is en, others available are de, es, fr, ru, zh)
alternative_names All other available and distinct names separated by commas
osm_type The OSM type of the feature (node, way, relation)
osm_id The unique osm_id as identifier for the house numbers in the second file housenumbers.tsv
class The class of the feature e.g. boundary
type The type of the feature e.g. administrative
lon The decimal degrees (WGS84) longitude of the centroid of the feature
lat The decimal degrees (WGS84) latitude of the centroid of the feature
place_rank Rank from 1-30 ascending, 1 being the highest. Calculated with the type and class of the feature.
importance Importance of the feature, ranging [0.0-1.0], 1.0 being the most important.
street The name of the street if the feature is some kind of street
city The name of the city of the feature, if it has one
county The name of the county of the feature, if it has one
state The name of the state of the feature, it it has one
country The name of the country of the feature
country_code The ISO-3166 2-letter country code of the feature
display_name The display name of the feature representing the hierarchy, if available in English
west The western decimal degrees (WGS84) longitude of the bounding box of the feature
south The southern decimal degrees (WGS84) latitude of the bounding box of the feature
east The eastern decimal degrees (WGS84) longitude of the bounding box of the feature
north The northern decimal degrees (WGS84) latitude of the bounding box of the feature
wikidata The wikidata associated with the feature
wikipedia The wikipedia URL associated with the feature
housenumbers All house numbers, comma separated, associated to this element. Coordinates of the house numbers are part of the second output file housenumbers.tsv

Note

All coordinates are rounded to seven digits after the decimal point.

Note

The housenumbers column is a redundant information of all house numbers contained in the file housenumbers.tsv. The redundancy is accepted due to advantages for the full-text search of geocoders.

The second file housenumber.tsv contains the following columns:

Column name Description
osm_id The unique osm_id for debug purposes
street_id The osm_id of the element, the house number is associated to
street The name of the street it is associated to for debug purposes
housenumber The actual house number
lon The decimal degrees (WGS84) longitude of the centroid of the house number
lat The decimal degrees (WGS84) latitude of the centroid of the house number