Go to file
2022-09-01 13:12:13 -07:00
data ready to import Initial commit 2022-09-01 11:52:22 -07:00
original data Initial commit 2022-09-01 11:52:22 -07:00
processed data update readme and addresses preprocessing 2022-09-01 13:12:13 -07:00
qgis-functions.py update readme and addresses preprocessing 2022-09-01 13:12:13 -07:00
README.md update readme and addresses preprocessing 2022-09-01 13:12:13 -07:00

  • Open the original data in QGIS
  • Format OSM fields with QGIS functions to have proper capitalization and full spellings without extraneous whitespace, based on original fields. For example OSM uses names like North Main Street, not N MAIN ST. All fields are of the QGIS type "text" even if they're numbers.
    • You can use the Attribute Table's Field Calculator for this; you can copy-paste the qgis-functions.py file into the Function Editor and then use the Expression tab to create new, formatted virtual fields. Don't worry if the field name limit is too short, it can be fixed in JOSM.
  • For addresses:
    • The Addresses shapefile is recorded in the ESRI:102659 CRS, you may need to convert or reproject to/from.
    • ADD_NUM becomes the virtual addr:housenumber (or addr:house temporarily, avoiding addr:house which is a real tag)
    • SADD becomes the virtual addr:street (or addr:stree temporarily) via the getformattedstreetnamefromaddress( "SADD") custom expression
    • POST_COMM becomes the virual addr:city via the title("POST_COMM") expression (we care about postal community addresses not what municipality a place might be governed by)
    • POST_CODE becomes addr:postcode (or addr:postc temporarily)
  • For roads:
    • NAME becomes the virtual name via the getformattedstreetname("NAME")
    • surface=asphalt added manually in JOSM
    • maxspeed=10 mph added manually in JOSM
  • Export to Geojson, selecting only the OSM-formatted fields we want and deselecting the rest.
  • Ensure the export file is in the EPSG:4326 - WGS84 CRS.
  • Open in JSOM. It's suggested to begin with roads first, addresses second, so the addresses can be placed in context.
  • In the Roads dataset, select and remove all relations from the geojson/shapefile layer: the data often has one relation per road and this is improper for OSM import.
  • Select a small region to work on: one neighborhood or smaller. For this import, we are assuming that only newly-constructed small residential areas will be imported, not main roads or commercial areas or areas with significant existing map data.
  • Download the area you're working on from OSM, into a new Data Layer (not your geojson layer.)
  • Select all features to be imported at this time and leave them selected until the merge step below.
  • Select all ways for roads, or all nodes for addresses. Make sure you aren't about to mass-edit the nodes of a road: deselect the nodes if this happens.
  • Ensure the tags are correct and good. (QGIS has a character limit and sometimes doesn't like colons, so double check that addr:house is addr:housenumber, addr:postc is addr:postcode, addr:stree is addr:street, etc.)
  • Mass-add new tags like highway=residential, surface=asphalt, etc, as indicated.
  • Remove any spurious tags that may have been brought over in the import (if it's not in the OSM Wiki, we don't want it.)
  • Press ctrl-shift-M to merge into the OSM data layer. There will be a warning, but click OK; we will be extra careful about validating the merge in the next steps.
  • For addresses, remove any address nodes that seem to not reflect reality or be placed far from the street bearing their name: it's better to not have 123 Adams Street mapped at all, than to claim that 123 Adams Street is hovering over someone's newly-built house at 321 Franklin Avenue, 200 feet away from Adams Street. (Cities often won't remove old addresses, leading to confusion when new streets are built.)
  • For roads, highlight multiple street segments which have the same name and press C to combine them: the county data has one way per road segment and that's excessive for OSM.
  • Check the edges of the imported areas to ensure new roads are merged with any preexisting roads
  • Check the import area to ensure no incorrect overlaps
  • Use the JOSM validator to ensure no errors in imported data. Warnings about existing data separate from the import can be ignored.
    • If there are duplicate house numbers in the data, investigate and remove the more-unlikely node or both nodes. For example 4650 Ramsell Road is duplicated in the source data, but the easternmost copy is on the "odd" side of the street and between 4653 and 4663 so it's more likely to actually be 4651, 4655, 4657, 4659, or 4661. We have no way of knowing, so we can either delete it entirely or simply delete the housenumber tag and leave it as an address without a number for a future editor to review. (We may submit incomplete data, just not wrong data.) We then leave the westernmost copy alone since 4650 fits neatly in between 4640/4644 and 4654/4660.
  • Click upload
    • Make sure there are no erroneous Relations or other unwanted objects about to be uploaded.
    • Use a descriptive changeset message like "Roads/Addresses in The Villages #villagesimport"
    • Set the Source to be "Sumter County GIS"
    • You can easily copy-paste the below into the Settings tab:
comment=Roads/Addresses in The Villages #villagesimport
import=yes
website=https://wiki.openstreetmap.org/wiki/The_Villages_Road_and_Address_Import
source=Sumter County GIS
source:url=https://gitlab.com/zyphlar/the-villages-import
  • Review imported data in Achavi or Osmcha to ensure it looks proper.