Properties of Data Sources to identify

## Context
We want to add metadata to URLs, filter for relevancy, and expand our database of valid data sources.

## Flowchart
The overall plan for data source identification is now in the [readme of this repo](https://github.com/Police-Data-Accessibility-Project/data-source-identification#readme).

## Properties 

> These are all explained in the [data dictionary](https://docs.pdap.io/activities/data-dictionaries/record-types-taxonomy)

### S tier
- [x] #9 
- [x] #12 
- [x] #43 

### A tier
- [ ] `description`, a subjective thing—fills in the gaps left by `name`, `record type`, and `agency`. Can be used to disambiguate similar sources. Difficult to automate.
- [ ] `aggregation_type`
- [ ] `access_type`
- [ ] `record_download_option_provided`
- [ ] `record_format`
- [ ] Is it `agency_supplied` and `agency_originated`? If not, who are the supplier and originator?
- [ ] `coverage_start`
- [ ] `coverage_end`
- [ ] `portal_type`
- [ ] `scraper_url`
- [ ] `readme_url`

Still A tier, but rarely published:
- [ ] `retention_schedule`
- [ ] `update_frequency`
- [ ] `source_last_updated`

### B tier
- [ ] `size`
- [ ] `update_method`
- [ ] `sort_method`
- [ ] `access_restrictions`

## Related reading
https://github.com/palewire/storysniffer/
http://blog.apps.npr.org/2016/06/17/scraping-tips.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Properties of Data Sources to identify #11

Context

Flowchart

Properties

S tier

A tier

B tier

Related reading

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Properties of Data Sources to identify #11

Description

Context

Flowchart

Properties

S tier

A tier

B tier

Related reading

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions