Tag: data science

Urban Form Analysis with OpenStreetMap Data

Post author By gboeing
Post date 2017-04-11
11 Comments on Urban Form Analysis with OpenStreetMap Data

Check out the journal article about OSMnx. This is a summary of some of my recent research on making OpenStreetMap data analysis easy for urban planners. It was also published on the ACSP blog.

OpenStreetMap – a collaborative worldwide mapping project inspired by Wikipedia – has emerged in recent years as a major player both for mapping and acquiring urban spatial data. Though coverage varies somewhat worldwide, its data are of high quality and compare favorably to CIA World Factbook estimates and US Census TIGER/Line data. OpenStreetMap imported the TIGER/Line roads in 2007 and since then its community has made numerous corrections and improvements. In fact, many of these additions go beyond TIGER/Line’s scope, including for example passageways between buildings, footpaths through parks, bike routes, and detailed feature attributes such as finer-grained street classifiers, speed limits, etc.

This presents a fantastic data source to help answer urban planning questions, but OpenStreetMap’s data has been somewhat difficult to work with due to its Byzantine query language and coarse-grained bulk extracts provided by third parties. As part of my dissertation, I developed a tool called OSMnx that allows researchers to download street networks and building footprints for any city name, address, or polygon in the world, then analyze and visualize them. OSMnx democratizes these data and methods to help technical and non-technical planners and researchers use OpenStreetMap data to study urban form, circulation networks, accessibility, and resilience.

Planning

Urban Form Figure-Ground Diagrams

Post author By gboeing
Post date 2017-03-01
4 Comments on Urban Form Figure-Ground Diagrams

Check out the journal article about OSMnx.

I previously demonstrated how to create figure-ground square-mile visualizations of urban street networks with OSMnx to consistently compare city patterns, design paradigms, and connectivity. OSMnx downloads, analyzes, and visualizes street networks from OpenStreetMap but it can also get building footprints. If we mash-up these building footprints with the street networks, we get a fascinating comparative window into urban form:

Tech

Getting Started with Python

Post author By gboeing
Post date 2017-02-28
10 Comments on Getting Started with Python

This is a guide for absolute beginners to get started using Python. Since releasing OSMnx a few weeks ago, I’ve received a lot of comments from people who would love to try it out, but don’t know where to begin with Python. I’ll demonstrate how to get Python up and running on your system, how to install packages, and how to run code.

Tags anaconda, city, code, data, data science, geopandas, geospatial, gis, ipython, jupyter, maps, matplotlib, networks, notebook, numpy, osmnx, pandas, python, tutorial, urban, visualization

Planning

Square-Mile Street Network Visualization

Post author By gboeing
Post date 2017-01-02
46 Comments on Square-Mile Street Network Visualization

Check out the journal article about OSMnx. All figures in this article come from this journal article, which you can read/cite for more.

The heart of Allan Jacobs’ classic book on street-level urban form and design, Great Streets, features dozens of hand-drawn figure-ground diagrams in the style of Nolli maps. Each depicts one square mile of a city’s street network. Drawing these cities at the same scale provides a revealing spatial objectivity in visually comparing their street networks and urban forms.

We can recreate these visualizations automatically with Python and the OSMnx package, which I developed as part of my dissertation. With OSMnx we can download a street network from OpenStreetMap for anywhere in the world in just one line of code. Here are the square-mile diagrams of Portland, San Francisco, Irvine, and Rome created and plotted automatically by OSMnx:

Tags boston, city, complex systems, complexity, data, data science, design, dubai, europe, geopandas, geopy, geospatial, gis, irvine, land use, livability, maps, matplotlib, modeling, neighborhood, networks, numpy, osaka, osmnx, pandas, portland, projection, python, rome, sacramento, san francisco, shapely, street-network, tutorial, urban, urban design, urban form, urban planning, visualization

Planning

OSMnx: Python for Street Networks

Post author By gboeing
Post date 2016-11-01
205 Comments on OSMnx: Python for Street Networks

If you use OSMnx in your work, please cite the journal article.

OSMnx is a Python package to retrieve, model, analyze, and visualize street networks from OpenStreetMap. Users can download and model walkable, drivable, or bikeable urban networks with a single line of Python code, and then easily analyze and visualize them. You can just as easily download and work with amenities/points of interest, building footprints, elevation data, street bearings/orientations, and network routing. If you use OSMnx in your work, please download/cite the paper here.

In a single line of code, OSMnx lets you download, model, and visualize the street network for, say, Modena Italy:

import osmnx as ox
ox.plot_graph(ox.graph_from_place('Modena, Italy'))

Tags city, complexity, data, data science, design, geocoding, geopandas, geopy, geospatial, gis, land use, livability, maps, matplotlib, modeling, neighborhood, networks, numpy, osmnx, pandas, planning, projection, python, science, shapely, street-network, streets, transportation, urban, urban design, urban planning, visualization

Data

R-tree Spatial Indexing with Python

Post author By gboeing
Post date 2016-10-24
21 Comments on R-tree Spatial Indexing with Python

Check out the journal article about OSMnx, which implements this technique.

A spatial index such as R-tree can drastically speed up GIS operations like intersections and joins. Spatial indices are key features of spatial databases like PostGIS, but they’re also available for DIY coding in Python. I’ll introduce how R-trees work and how to use them in Python and its geopandas library. All of my code is in this notebook in this urban data science GitHub repo.

Tags city, data, data science, geopandas, geospatial, gis, maps, matplotlib, modeling, pandas, projection, python, r-tree, science, shapely, spatial index, tutorial, urban, urban planning, visualization

Data

College Football Stadium Attendance

Post author By gboeing
Post date 2016-09-30
1 Comment on College Football Stadium Attendance

A few months ago, I wrote about the large investments that U.S. universities are making in their football stadiums. This also included a visual analysis of stadium capacity around the country. Outside of North Korea, the 8 largest stadiums in the world are college football stadiums, and the 15 largest college football stadiums are larger than any NFL stadium.

I received a few comments interested in further analysis of the actual attendance of games held in these stadiums. While capacity is interesting because it represents an expectation and sustained investment by the school, attendance represents the utilization of that investment. My stadium capacity data covered every NCAA division I football stadium in the U.S. as of the 2015 college football season. So, I downloaded the NCAA’s 2015 home game attendance data to compare. My data, code, and analysis are in this GitHub repo. First, I visualized the FBS attendance figures themselves:

Tags academia, data, data science, football, land use, ncaa, pandas, planning, python, stadiums, urban, urban planning, visualization

Planning

How to Visualize Urban Accessibility and Walkability

Post author By gboeing
Post date 2016-07-31
4 Comments on How to Visualize Urban Accessibility and Walkability

Tools like WalkScore visualize how “walkable” a neighborhood is in terms of access to different amenities like parks, schools, or restaurants. It’s easy to create accessibility visualizations like these ad hoc with Python and its pandana library. Pandana (pandas for network analysis – developed by Fletcher Foti during his dissertation research here at UC Berkeley) performs fast accessibility queries over a network. I’ll demonstrate how to use it to visualize urban walkability. My code is in these IPython notebooks in this urban data science course GitHub repo.

First I give pandana a bounding box around Berkeley/Oakland in the East Bay of the San Francisco Bay Area. Then I load the street network and amenities from OpenStreetMap. In this example I’ll look at accessibility to restaurants, bars, and schools. But, you can create any basket of amenities that you are interested in – basically visualizing a personalized “AnythingScore” instead of a generic WalkScore for everyone. Finally I calculate and plot the distance from each node in the network to the nearest amenity:

Tags basemap, berkeley, city, data, data science, design, geopandas, geospatial, gis, land use, livability, maps, matplotlib, modeling, neighborhood, networks, new urbanism, numpy, pandas, planning, python, smart cities, smart growth, tutorial, urban, urban design, urban planning, visualization

Data

Mapping Everywhere I’ve Ever Been in My Life

Post author By gboeing
Post date 2016-06-27
3 Comments on Mapping Everywhere I’ve Ever Been in My Life

I recently wrote about visualizing my Foursquare check-in history and mapping my Google location history, and it inspired me to mount a more substantial project: mapping everywhere I’ve ever been in my life (!!). I’ve got 4 years of Foursquare check-ins and Google location history data. For everything pre-smart phone, I typed up a simple spreadsheet of places I’d visited in the past and then geocoded it with the Google Maps API. All my Python and Leaflet code is available in this GitHub repo and is easy to re-purpose to visualize your own location history.

I’ll show the maps first, then run through the process I followed, below. First off, I used Python and matplotlib basemap to create this map of everywhere I’ve ever been:

Tags basemap, berkeley, clustering, data, data science, dbscan, foursquare, geocoding, geospatial, gis, google, javascript, leaflet, maps, matplotlib, pandas, projection, python, scikit-learn, travel, tutorial, visualization

Tech

Scientific Python for Raspberry Pi

Post author By gboeing
Post date 2016-03-14
25 Comments on Scientific Python for Raspberry Pi

A guide to setting up the Python scientific stack, well-suited for geospatial analysis, on a Raspberry Pi 3. The whole process takes just a few minutes.

The Raspberry Pi 3 was announced two weeks ago and presents a substantial step up in computational power over its predecessors. It can serve as a functional Wi-Fi connected Linux desktop computer, albeit underpowered. However it’s perfectly capable of running the Python scientific computing stack including Jupyter, pandas, matplotlib, scipy, scikit-learn, and OSMnx.

Despite (or because of?) its low power, it’s ideal for low-overhead and repetitive tasks that researchers and engineers often face, including geocoding, web scraping, scheduled API calls, or recurring statistical or spatial analyses (with small-ish data sets). It’s also a great way to set up a simple server or experiment with Linux. This guide is aimed at newcomers to the world of Raspberry Pi and Linux, but who have an interest in setting up a Python environment on these $35 credit card sized computers. We’ll run through everything you need to do to get started (if your Pi is already up and running, skip steps 1 and 2).

Tags api, basemap, data, data science, geocoding, geopandas, geopy, geospatial, iot, ipython, jupyter, linux, matplotlib, numpy, pandas, pyproj, raspberry pi, raspbian, science, scikit-learn, scipy, scrapy, shapely, statistics, statsmodels, web scraping