This project analyzes air quality in New York City, focusing on Ozone (O₃), Nitrogen Dioxide (NO₂), and Fine Particulate Matter (PM2.5). It examines pollutant emissions (e.g., SO₂, NOₓ, PM2.5), associated health outcomes (e.g., asthma ER visits, hospitalizations), and annual vehicle miles traveled. The analysis is performed using R and includes data cleaning, visualization, and time series forecasting using the ARIMA model.
The dataset contains surveillance data on air quality in New York City, highlighting the spatial and temporal variation in pollutant levels and exposure risks. Air pollution poses serious environmental and public health threats, especially in urban areas. Exposure to common pollutants is linked to respiratory illnesses, cardiovascular diseases, cancers, and premature death. These indicators help evaluate pollution trends and disparities across NYC neighborhoods.
- Date range: 2005–2023
- Metadata last updated: March 29, 2025
- NYC Air Quality dataset
- Air-Quality – HTML Report: Interactive report with code, Plotly visualizations, and a few static plots.
- Air-Quality.Rmd - R markdown
You can view the full interactive HTML report here:
R packages: tidyverse, dplyr, plotly, forecast, forcats, ggrepel, ggplot2, lubridate
- Data Cleaning
- Exploratory Data Analysis and Visualization
- Time Series Forecasting (ARIMA model)