Skip to content

Commit

Permalink
Update completes 2024
Browse files Browse the repository at this point in the history
  • Loading branch information
frankpd committed Jan 10, 2025
1 parent 1f3ace8 commit cde7ca3
Show file tree
Hide file tree
Showing 26 changed files with 11,615 additions and 743 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@

The Providence Geocoded Crime Incidents is a geodataset created by the Brown University Library [GeoData@SciLi](https://libguides.brown.edu/geodata/) team for the mapping of crimes by location and type over time. It is derived from the [Providence Police Case Log](https://data.providenceri.gov/Public-Safety/Providence-Police-Case-Log-Past-180-days/rz3y-pz8v/about_data), updated nightly with the past 180 days of crime incidents in Providence, Rhode Island. The purpose of this project is to create annual data archives of geocoded crime incidents. **The locations of incidents in this project are general approximations, and do not represent the precise locations where crimes occurred**.

Incidents were plotted to: the mid-point of a range of addresses in a block, the intersection of two streets, or the center of a large landmark. Coordinates were derived from three sources: RIGIS's E911 point layer, the RIDOT geocoder, and Open Street Map. The coordinate source is determined by the format of the location attribute in the original dataset: block number, intersection, or landmark. Locations which do not fall into these categories, such as standalone street names, were not geocoded due to their imprecise nature. Successfully geocoded results are stored as a point layer (`pvd_geocoded_[year].shp`) in the Rhode Island State Plane (ft-US) coordinate system, EPSG 3438. Results are also available in Excel format (`pvd_geocoded_[year].xlsx`), with 'latitude' and 'longitude' columns in WGS 84 (EPSG 4326). Cases which could not be accurately geocoded are stored in `pvd_non_geocoded_[year].xlsx`. Output files are available separately by year. In addition to these output files, this repository includes metadata in the OSM Aardvark standard, documentation, and the Python script for generating the results.
Incidents were plotted to: the mid-point of a range of addresses in a block, the intersection of two streets, or the center of a large landmark. Coordinates were derived from three sources: RIGIS's E911 point layer, the RIDOT geocoder, and Open Street Map. The coordinate source is determined by the format of the location attribute in the original dataset: block number, intersection, or landmark. Locations which do not fall into these categories, such as standalone street names, were not geocoded due to their imprecise nature. Successfully geocoded results are stored as a point layer (`pvd_geocoded_[year].shp`) in the Rhode Island State Plane (ft-US) coordinate system, EPSG 3438. Geocoded results are also available in CSV format (`pvd_geocoded_[year].csv`), with 'latitude' and 'longitude' columns in WGS 84 (EPSG 4326). Cases which could not be accurately geocoded are stored in `pvd_non_geocoded_[year].csv`. Output files are available separately by year. In addition to these output files, this repository includes metadata in the OSM Aardvark standard, documentation, and the Python script for generating the results.

Project lead: Felicity Hade, DSI Undergraduate Fellow, Brown University '24

Expand All @@ -20,7 +20,7 @@ Data files are stored in the `outputs` directory. Open the `all` folder to find

Completed years:

- [2024](/outputs/2024/pvdcrime_2024.zip) (partial, includes Jan to mid-Aug only)
- [2024](/outputs/2024/pvdcrime_2024.zip) (Jan to Dec)
- [2023](/outputs/2023/pvdcrime_2023.zip) (partial, includes June to Dec only)

## Running the Script
Expand Down
618 changes: 302 additions & 316 deletions code/.ipynb_checkpoints/code-checkpoint.ipynb

Large diffs are not rendered by default.

618 changes: 302 additions & 316 deletions code/code.ipynb

Large diffs are not rendered by default.

Binary file modified code/inputs/crime_cats.xlsx
Binary file not shown.
Binary file modified code/inputs/landmarks.xlsx
Binary file not shown.
76 changes: 56 additions & 20 deletions metadata/pvdcrime_2024.md
Original file line number Diff line number Diff line change
@@ -1,85 +1,121 @@
# TITLE

dct_title_s: 'Crime Incidents, Providence Rhode Island, 2024'

# DESCRIPTION

dct_description_sm:
- This is a point layer of crime incidents in Providence, Rhode Island from January to mid-August 2024. This layer was created from the Providence Police Department's Case Log, which is updated nightly with the most recent 180 days of data. The purpose of this dataset is to archive data annually, add categories that are useful for classifying crimes, and provide geocoded locations that generally represent where an incident occurred; it is important to note that these locations are not precise. The logs include a general incident location such as a block range, street intersection, major landmark, or street name. Crime locations were geocoded using different methods based on the type of location, street intersections with the RIDOT geocoder to represent the intersection, block locations using the E911 shapefile provided by RIGIS to calculate the midpoint of a range of addresses in a block, and landmark centroid coordinates from a manually created points-of-interest file. Locations that fall outside these categories (e.g. street names) and locations which could not be geocoded were omitted from this dataset and stored in a separate tabular file. Each incident is assigned a unique case number; multiple offenses may be associated with the same incident/case number and are stored as separate data points. Each offense has a 'counts' attribute which indicates the number of instances of that offense. Offenses are uniquely identifiable by the 'unique_id' attribute, which is an extension of the associated case number. Offenses were categorized by type, where 'violent_cat' indicates the classification of violent crime, and 'property_cat' indicates the classification of property crime if applicable. Offenses are categorized using the 'offense_desc' (offense description) attribute according to the FBI's guidelines. This layer was created by the Brown University Library to allow for basic mapping and geospatial analysis of crime data.

- This is a point layer of crime incidents in Providence, Rhode Island from January to December 2024. This layer was created from the Providence Police Department's Case Log, which is updated nightly with the most recent 180 days of data. The purpose of this dataset is to archive data annually, add categories that are useful for classifying crimes, and provide geocoded locations that generally represent where an incident occurred; it is important to note that these locations are not precise. The logs include a general incident location such as a block range, street intersection, major landmark, or street name. Crime locations were geocoded using different methods based on the type of location, street intersections with the RIDOT geocoder to represent the intersection, block locations using the E911 shapefile provided by RIGIS to calculate the midpoint of a range of addresses in a block, and landmark centroid coordinates from a manually created points-of-interest file. Locations that fall outside these categories (e.g. street names) and locations which could not be geocoded were omitted from this dataset and stored in a separate tabular file. Each incident is assigned a unique case number; multiple offenses may be associated with the same incident/case number and are stored as separate data points. Each offense has a 'counts' attribute which indicates the number of instances of that offense. Offenses are uniquely identifiable by the 'unique_id' attribute, which is an extension of the associated case number. Offenses were categorized by type, where 'violent_cat' indicates the classification of violent crime, and 'property_cat' indicates the classification of property crime if applicable. Offenses are categorized using the 'offense_desc' (offense description) attribute according to the FBI's guidelines. This layer was created by the Brown University Library to allow for basic mapping and geospatial analysis of crime data.

# LANGUAGE

dct_language_sm:
- eng

- eng

# CREATOR

dct_creator_sm:
- Brown University Library

- Brown University Library

# PUBLISHER

dct_publisher_sm:
- Providence Police Department

- Providence Police Department

# PROVIDER

schema_provider_s: Brown

# RESOURCE CLASS

gbl_resourceClass_sm:
- Datasets

- Datasets

# RESOURCE TYPE

gbl_resourceType_sm:
- Point data

- Point data

# LC SUBJECT

dct_subject_sm:
- Crime
- Criminal Statistics
- Police

- Crime
- Criminal Statistics
- Police

# ISO THEME

dcat_theme_sm:
- Events
- Society

- Events
- Society

# TEMPORAL

dct_temporal_sm:
- '2024'

- '2024'

# DATE ISSUED
dct_issued_s: '2024-08'

dct_issued_s: '2025-01'

# SPATIAL

dct_spatial_sm:
- 'Providence, Rhode Island, United States'

- 'Providence, Rhode Island, United States'

# BOUNDING BOX

dcat_bbox: 'ENVELOPE(-71.472667,-71.373614,41.861801,41.772435)'

# RIGHTS

dct_rights_sm:
- The data are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. You are free to share and to adapt the work as long as you cite the source, do not use it for commercial purposes, and release adaptations under the same license.
- Disclaimer. Every effort was made to insure that the data, which was compiled from public sources, was processed and presented accurately. The creators and Brown University disclaim any liability for errors, inaccuracies, or omissions that may be contained therein or for any damages that may arise from the foregoing. Users should independently verify the accuracy and fitness of the data for their purposes.

- The data are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. You are free to share and to adapt the work as long as you cite the source, do not use it for commercial purposes, and release adaptations under the same license.
- Disclaimer. Every effort was made to insure that the data, which was compiled from public sources, was processed and presented accurately. The creators and Brown University disclaim any liability for errors, inaccuracies, or omissions that may be contained therein or for any damages that may arise from the foregoing. Users should independently verify the accuracy and fitness of the data for their purposes.

# LICENSE

dct_license_sm:
- https://creativecommons.org/licenses/by-nc-sa/4.0/

- https://creativecommons.org/licenses/by-nc-sa/4.0/

# ACCESS RIGHTS

dct_accessRights_s: Public

# FILE FORMAT

dct_format_s: Shapefile

# UNIQUE ID

id: brown-08192024AAA

# IDENTIFIER

dct_identifier_sm:
- https://github.com/Brown-University-Library/geodata_pvdcrime/data

- https://github.com/Brown-University-Library/geodata_pvdcrime/data

# METADATA MODIFIED
gbl_mdModified_dt: '2024-08-19'

gbl_mdModified_dt: '2025-01-10'

# METADATA VERSION

gbl_mdVersion_s: Aardvark

# GEOREFERENCED

gbl_georeferenced_b: True
6 changes: 3 additions & 3 deletions metadata/pvdcrime_2024.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ dct_title_s: 'Crime Incidents, Providence Rhode Island, 2024'

# DESCRIPTION
dct_description_sm:
- This is a point layer of crime incidents in Providence, Rhode Island from January to mid-August 2024. This layer was created from the Providence Police Department's Case Log, which is updated nightly with the most recent 180 days of data. The purpose of this dataset is to archive data annually, add categories that are useful for classifying crimes, and provide geocoded locations that generally represent where an incident occurred; it is important to note that these locations are not precise. The logs include a general incident location such as a block range, street intersection, major landmark, or street name. Crime locations were geocoded using different methods based on the type of location, street intersections with the RIDOT geocoder to represent the intersection, block locations using the E911 shapefile provided by RIGIS to calculate the midpoint of a range of addresses in a block, and landmark centroid coordinates from a manually created points-of-interest file. Locations that fall outside these categories (e.g. street names) and locations which could not be geocoded were omitted from this dataset and stored in a separate tabular file. Each incident is assigned a unique case number; multiple offenses may be associated with the same incident/case number and are stored as separate data points. Each offense has a 'counts' attribute which indicates the number of instances of that offense. Offenses are uniquely identifiable by the 'unique_id' attribute, which is an extension of the associated case number. Offenses were categorized by type, where 'violent_cat' indicates the classification of violent crime, and 'property_cat' indicates the classification of property crime if applicable. Offenses are categorized using the 'offense_desc' (offense description) attribute according to the FBI's guidelines. This layer was created by the Brown University Library to allow for basic mapping and geospatial analysis of crime data.
- This is a point layer of crime incidents in Providence, Rhode Island from January to December 2024. This layer was created from the Providence Police Department's Case Log, which is updated nightly with the most recent 180 days of data. The purpose of this dataset is to archive data annually, add categories that are useful for classifying crimes, and provide geocoded locations that generally represent where an incident occurred; it is important to note that these locations are not precise. The logs include a general incident location such as a block range, street intersection, major landmark, or street name. Crime locations were geocoded using different methods based on the type of location, street intersections with the RIDOT geocoder to represent the intersection, block locations using the E911 shapefile provided by RIGIS to calculate the midpoint of a range of addresses in a block, and landmark centroid coordinates from a manually created points-of-interest file. Locations that fall outside these categories (e.g. street names) and locations which could not be geocoded were omitted from this dataset and stored in a separate tabular file. Each incident is assigned a unique case number; multiple offenses may be associated with the same incident/case number and are stored as separate data points. Each offense has a 'counts' attribute which indicates the number of instances of that offense. Offenses are uniquely identifiable by the 'unique_id' attribute, which is an extension of the associated case number. Offenses were categorized by type, where 'violent_cat' indicates the classification of violent crime, and 'property_cat' indicates the classification of property crime if applicable. Offenses are categorized using the 'offense_desc' (offense description) attribute according to the FBI's guidelines. This layer was created by the Brown University Library to allow for basic mapping and geospatial analysis of crime data.

# LANGUAGE
dct_language_sm:
Expand Down Expand Up @@ -44,7 +44,7 @@ dct_temporal_sm:
- '2024'

# DATE ISSUED
dct_issued_s: '2024-08'
dct_issued_s: '2025-01'

# SPATIAL
dct_spatial_sm:
Expand Down Expand Up @@ -76,7 +76,7 @@ dct_identifier_sm:
- https://github.com/Brown-University-Library/geodata_pvdcrime/data

# METADATA MODIFIED
gbl_mdModified_dt: '2024-08-19'
gbl_mdModified_dt: '2025-01-10'

# METADATA VERSION
gbl_mdVersion_s: Aardvark
Expand Down
Binary file modified outputs/2023/pvd_geocoded_2023.dbf
Binary file not shown.
Loading

0 comments on commit cde7ca3

Please sign in to comment.