This repo contains the Jupyter notebooks, data and charts from our water quality testing project with the Hackuarium. This repo is a build from the hammerdirt/notes repo. That folder no longer exists we maintain a local copy in our archives.
The original data is stored in two files - Data_2016.csv and Data_2017.csv. This data is in its original format, there are a few duplicate records and there is some formatting to be done if you want to use it.
The data has been cleaned and stored in the JSON folder. Identifiers like colony colors, week dates or week numbers and any information we needed to index the results is stored in the utililties folders as JSON objects.
If you clone this repo everything is local and will work just fine.
ATTENTION: Make sure to change the output file names to avoid overwriting the original output.
There are three workbooks :
1 - Preparing the data: we go through all the gymnastics necesary to make JSON output for other applications. Our end-use is a web based app that has the same output as the notebook.
2- Output for hdch: The JSON output is used to make the various charts
3- Micro-bar-chart: We tap into the original data to make barchart arrays of all the results per location per day.
The requirements file is from "pip freeze >" this should be good for anybody using a virtual env. If you are using Anaconda go off the list. If in doubt here are the list of imports from the the note book:
- import pandas as pd
- import numpy as np
- import matplotlib
- import matplotlib.pyplot as plt
- import json
- import re
- from textwrap import wrap
- import matplotlib.ticker
- import os
- import seaborn as sns
Submit a pull request if you see something that really needs to be changed.