Portland, OR Bldg import
Overview
Please do not attempt to import any features without first consulting the main contacts below.
The goal of this import is to add building and associated address data from publicly available datasets in the Portland, OR USA, metropolitan region. This region includes not only the city of Portland but 24 other cities in Clackamas, Multnomah, and Washington counties.
The Github repository for all code is here: https://github.com/pdxosgeo/pdxbldgimport, as well as issues and project tasks.
The scripts and code for creating the data are intended to generate OSM files that will be validated and imported on a quarter-section by quarter-section basis.
A data sample can be found here: https://github.com/pdxosgeo/pdxbldgimport/blob/master/sample_osm_files/1n2e34d.osm
Contacts
Main contributors/organizers of this project:
- PDX-OSGEO
- Darrell Fuhriman : User page
- Rafa Gutierrez: User page
- TC Haddad : User page
- Mele Sax-Barnett : Wiki page - User page
Though PDX-OSGEO is organizing the import, the import is a joint effort with MaptimePDX, City of Portland staff, Metro GIS staff, TriMet, and other local OSM contributors.
How can I help?
We need technical help with validation and script updates as necessary and documentation help.
Technical help
- Data processing
- Help with the code (GitHub account required), see the readme instructions
- Help with issues
- Importing. OpenStreetMap account required (Create one here)
Documentation help
- Help update this project page; OpenStreetMap Wiki account required (Create one here)
- Write docs about importing
Source data
Summary
Data is provided by Metro. The datasets:
- Building Footprints: http://rlisdiscovery.oregonmetro.gov/?action=viewDetail&layerID=2406
- Address points: http://rlisdiscovery.oregonmetro.gov/?action=viewDetail&layerID=656
The Building Footprints dataset will be referred to as "bldgs". The Address Points dataset will be referred to as "addrs".
The addrs data set will be used to get address components without having to parse the address field in the bldgs data.
License
These Metro (RLIS) datasets are made available under ODbL, see: http://www.oregonmetro.gov/sites/default/files/Open_Database_and_Content_Licenses.pdf . Moreover, one of the reasons why Metro staff chose this license was to help make it compatible with OSM imports.
Process
The general process for importing data is described below.
Data processing
- Import building and address data into PostGIS
- Convert address data to OSM format (expanded abbreviations, etc.)
- Import existing OSM buildings (building=*) into the database
- Any new building footprints which intersect existing buildings will, for now, be ignored
- Tags will be created for each building as below
Tag Mappings
tag | source |
addr:housenumber | addrs.house |
addr:street | addrs.str_predir_code + addrs.street_name + addrs.street_type_code + addrs.str_postdir_code |
addr:postcode | addrs.zip_code |
addr:city | addrs.city |
building:levels | bldgs.num_story |
building | See discussion below |
ele | bldgs.surf_elev converted to meters, rounded to cm) |
height | bldgs.max_height |
Building Types
The bldgs dataset contains a column representing the use of the building "BLDG_TYPE". The corresponding building= tag will be set as follows (based on http://wiki.openstreetmap.org/wiki/Key:building):
source value | final value |
House | detached |
Garage | garage |
RES, Res | residential |
Duplex, Apartment Complex, Multiplex, Residential Condominiums | apartments |
Dormitories | dormitory |
Everything else (the vast majority) | yes |
Addresses
Addresses are connected to a building through a taxlot id. In the vast majority of cases, this is a 1:1 mapping, however there are some cases where there are many addresses associated to a property containing many buildings.
Some buildings have multiple addresses associated with them. Where this is the case, the addresses will be assigned to nodes inside the building polygon. No addresses will be assigned to the building, since there is no way to determine which, if any, of them is the "primary" address.
In the case where there are multiple buildings and multiple addresses, with no way to distinguish, then a site boundary will be generated, and nodes will be created inside that polygon, but not on any given building. If for some reason that is not possible, then no addresses will be assigned at that time.
Import Plan
Phase 0: Preparation
- Contributors/importers will create separate accounts followed by their normal usernames + "_pdxbuildings" for import purposes.
- We will hold a training and provide documentation for people who want to contribute
- Buildings will be extracted into consolidated quarter-sections (of at least 500 buildings each, at most 1400) and converted into .osm format, with three outputs:
- Only those that DO NOT overlap with existing OSM data and have a single address
- Only those that overlap with existing OSM data or have multiple addresses (to use in Phase 2)
- All buildings
Phase 1: The easy stuff
- This phase will only deal with buildings that do not overlap with other features in OSM and have one (or no) address, or simple one building on one taxlot with multiple addresses
- Contributors will download a section of buildings, and a section of address nodes and validate it in JOSM, verify them with local knowledge & against the most recent imagery available; likely NAIP imagery with Bing as a fallback.
Please note: satellite imagery can have significant positioning errors versus more accurate surveys. Much of the building data is generated from high accuracy LIDAR surveys, so take care when considering changing the position of a building based on imagery.
- After validation, the changeset for that section will be uploaded by the contributor with a commit message that indicates it is part of this project
- Contributors will adhere to a strict limit of features edited per hour to be determined with consultation of the OSM Imports-us group.
Phase 2: Reconciliation with existing buildings/addresses
This phase will involve conflating data to be added to buildings that already exist in OSM and dealing with any complex addressing issues.
A draft of the code for this is available in the github repository. It finds OSM buildings with no addresses, then looks up the appropriate address in the addr data using a spatial query, then commits it to OSM. This is done with 50 addrs in one change set, all located nearby each other. As of 2014-11-14, there are ~4000 addresses included.
The current draft of code only handle 1:1 building to addresses.
Phase 3: Handling multiple addresses on multiple buildings
We will make additional proposals and obtain approval from the OSM Imports-us group and OSM Data Working Group for any further work before proceeding. We are planning on having contributors use OSMLY for this phase.
Maintenance Plan
Contributors to this project are (and should be) Portland-area community members with an interest in keeping the data up to date. We have an active mapping community where some of this upkeep can be done:
- Monthly local meetups: http://maptime.io/portland and http://wiki.osgeo.org/wiki/PDX-OSGEO
- Annual WhereCamp conference: http://wherecamppdx.org