Import/Catalogue/BnB-Matera
About
This page is about importing 700+ Bed & Breakfast and chalets included in "Strutture ricettive" (tourism infrastructure) datasets published by Comune di Matera, Italy.
The import is being discussed on the OSM mailing list. The import will be the result of consensus there.
Goals
This import aims to have a Comune di Matera-certified and updated set of POIs (OSM tourism=guest_house|chalet) for the municipal territory (OSM admin_level=8).
Schedule
Starting from May 2019, import will be performed thru conflation and audit. Progress will be trackable in an audit map. Depending on mappers involved, import should take 20-60 days to be accomplished.
Import Data
Background
Matera Opendata web page lists the following datasets (as may 2019):
- 35 import_Ricettività_Matera - Albergo.csv (hotel/motel)
- 172 import_Ricettività_Matera - Affittacamere.csv (rooms to let, w/o breakfast)
- 11 import_Ricettività_Matera - Agriturismo.csv
- 203 import_Ricettività_Matera - B&B.csv (bed and breakfast)
- 507 import_Ricettività_Matera - CasaVacanze.csv (chalet)
- 9 import_Ricettività_Matera - Varie.csv (other)
The subject of this wiki are files in bold. All records are punctual objects which geo coordinates based on property centroid, extracted from cadastre by Matera municipality.
Metadata
As defined in Matera Opendata page, datasets feature the following:
- source name: elenco-strutture-ricettive-nel-comune-di-matera-dal-2015
- release date: 01-02-2018
- last update: 20-09-2018
- AOI: Matera
- operator: Comune di Matera
Legal
- Data license: ODbL, as stated in Matera Opendata page and defined in Opendefinition site.
Record format and tagging plan
Matera Opendata datasets share a similar record format, except for fields "ID" (B&B only) and "CODICE FISCALE" (chalet only)
Table structure will be pruned and adapted thru OpenRefine; fields will be mapped referring to tourism wiki page.
Below table lists useful input fields:
Field | Value | Tagged as | Notes |
---|---|---|---|
ID | 414 | ref=414 | |
LAT | 40.6583221 | n/a | geocoord |
LON | 16.6113357 | n/a | geocoord |
TIPOLOGIA | Bed & Breakfast or Casa Vacanza | tourism=guest_house or tourism=chalet | |
name | BELVEDERE | name=Belvedere | |
LEGALE_RA | MANICON NICOLA | operator=Manicon Nicola | |
UBICAZIONE | Via Morelli 1 | addr:street=Via Morelli
addr:housenumber=1 |
|
CODICE FISCALE | MNCMHL82B53A225Z | ref:vatin=MNCMHL82B53A225Z | |
City | Matera | addr:city=Matera | |
POSTI LETTO | 25 | beds=25 |
Import Type
It shall not be a blind import: source data shall be checked and audited by mappers through an audit support map.
Audit support map
The dataset will be imported on its municipal base (OSM admin_level=8). OSM candidate nodes will be presented as pins on a dedicated Matera Opendata audit support map.
Pins
- Blue translucent: dataset position
- Blue: OSM position (centroid if polygon)
- Green: new POI, can be dragged in better position.
Fields
- Yellow: proposed tag value substitution
- Green: new tag
Goals
This audit aims to add missing source data POIs and to update OSM existing ones. Besides, you cat take the chance to:
- check name typos
- addr inconsistencies
- any other anomaly like position, duplicates etc
For any doubt, "skip" will postpone POI audit or a "fixme" will be inherited by OSM candidate object.
Team Approach
Import will be managed by OSM user Cascafico; audit will be open to any OSM user accessing audit map.
Workflow
Step by step operations:
- dataset download
- OpenRefine operations
- conflation
- community audit
In case of import problems, changeset involved will be reverted using proper reverter
Data Preparation
The data is presented as "comma separated values" files in a collection of punctual elements, one for each B&B/chalet. Minor column adaptations will be done by script
Refining
Some normalizations require refining operations. Below, a summary of actions performed thru OpenRefine operations:
- names and operators to title case (first char uppercase)
- name prepositions uppercase to lowercase
- address split in addr:street and addr:housenumber
Conflation
Conflation parameters are set in specific profile file
Due to high density source datasets, some mismatches can be generated in conflated data feeded to audit map; they will be reported with proper audit fixme's.
Upload
Data shall be uploaded manually thru JOSM editor. Dedicated upload account shall be attilaimport.
Changeset Tags
Changesets should be tagged with:
- source=Matera Opendata
- source:date=2018-09-20 (as defined in source dataset)
- source:license=ODbL
- type=import
- url=https://wiki.openstreetmap.org/w/index.php?title=Import/Catalogue/BnB-Matera