University of California, Berkeley Department of City and Regional Planning
Professor: Paul Waddell
GSI: Irene Farah Rivadeneyra
This is a hands-on course that trains students to analyze urban data, develop indicators, and create visualizations and maps using the Python programming language, open source tools, and public data. The course will first introduce the fundamentals of programming in Python before moving on to a survey of data analysis/visualization tools and technologies. Classroom sessions will include lectures and workshops. A series of exercises will reinforce the skills and topics being presented, and a final project will provide an opportunity for students to develop a more complete project from harvesting data from Open Data portals to synthesizing and analyzing those data to explore a question or problem, to communicating their results in a web map and blog, as well as a final presentation.
COURSE SYNOPSIS AND OBJECTIVES
This course is designed to provide future city planners with a toolkit of technical skills for quantitative problem solving. It requires tolerance for experimentation, self-directed trial and error, and an interest in learning to write code. If you are willing to roll up your sleeves and embrace some uncertainty, you'll learn the fundamentals of urban data analysis and visualization, and might discover an entirely new lens through which to study, plan, and design neighborhoods, cities, and regions.
Topics to be covered include:
- Fundamentals of programming with Python and Jupyter notebooks
- Cleaning, manipulating, and analyzing urban data
- Visualizing data in Python with charts, graphs, and tables
- Accessing public data from the web and APIs (including Twitter, Google, Census data, and the Open Data portals of cities)
- Developing spatial indicators and mapping urban data
COURSE PREREQUISITES Prior coursework (such as CP 204c) and experience using a GIS is required. Students are not required to have prior programming experience, although it will be beneficial. Python is an accessible language and the course will emphasize learning by doing. Prior or concurrent course work in statistics and data analysis (e.g. CP 204A or 204D) is encouraged as this course will not provide the theoretical foundations of statistical analysis.
This course is open to students from across campus, but priority enrollment will be given to students in the Master of City Planning program and PhD program in City and Regional Planning. Graduate students from other CED departments, and departments that have concurrent programs with DCRP are next in priority.
ASSIGNMENTS AND ASSESSMENT
Students will develop skills gradually through exercises paced over the semester. These will typically involve writing Python code and documenting it, using Jupyter Notebooks that can be shared and interactively run inside a web browser, and a writeup discussing the assignment and its results. Exercises will generally be due Sunday night at 11:59pm.
Each exercise except the first two (which are 10 points each) will be worth 20 points and the final project + presentation will be worth 100. Exercises will be tasks that demonstrate a degree of mastery of skills, and will be used as a means of ensuring that students are keeping up with the material and not falling behind. All assignments will be marked down 20% for each day late, so please submit on time. Final projects will require harnessing the skills practiced in the exercises and developing a more independent work plan to accomplish an analysis of data.
Given how important it is for students to become self-sufficient in problem solving, a key component of the grading rubric for the exercises and the final project will be self-sufficiency. Students that are uncomfortable with solving problems and getting help using Internet searches to see how others have solved similar problems should consider taking a more traditional GIS or data analysis course that does not require extensive programming.
Please make sure you arrive on time to class (we will start at 9:40 - please be in class by then). Due to the nature of the computer lab, it can be very distracting to have students trickling in after class has begun. Make sure you have a keycard with access to room 214 and please do not knock on the door to be let in after class has begun. If you are taking this class and are not a Master of City Planning student, you must sign up for a computer account and access to the 214 lab. This may require paying a fee. You can sign up in 477 Wurster Hall.
READINGS AND RESOURCES
There will be a reading or set of short readings required for most class sessions. These are essential for coming prepared as the lectures will proceed under the assumption that everyone has already read the assigned reading materials. Most class sessions will begin with a quick discussion of the reading material, and you will have the opportunity to ask us questions about anything you didn't understand.
For the most part, the lecture notes, tutorials and examples provided on this website will provide a good foundation for the skills students will gain in thew course. Some of the key software packages we will be using have free documentation available, and you should make extensive use of these to help you climb the learning curve.
Downey, Allen. 2016. Think Python: How to Think Like a Computer Scientist. Green Tea Press. http://greenteapress.com/wp/think-python-2e/ This text provides a fairly comprehensive overview of Python as a programming language.
Pilgrim, Mark. Dive Into Python 3. http://www.diveintopython3.net/
Jupyter Notebook Documentation 2018. https://jupyter-notebook.readthedocs.io/en/latest/notebook.html
IPython Documentation 2018. http://ipython.readthedocs.io/en/stable/
McKinney, Wes. 2018. Python for Data Analysis, 2nd Edition. O'reilly. Available under Files > Reading Materials. You may also want to access the online Documentation available in pdf.
This course requires a lot of experimentation and trial-and-error. Google is your best friend! Google your questions, Google any error messages, and if you can't find an answer, email Paul and Geoff. When you email us, tell us what you Googled and what you've discovered, and include screenshots and error messages. The key to success in this course has been repetition: keeping up on the readings, writing lots of code over and over, and stretching to figure out "what next".
The course website is online through bcourses, accessible here: https://bcourses.berkeley.edu/courses/1462984