dynamical.org reformatters

Reformat weather datasets into zarr.

See the dataset integration guide to integrate a new dataset to be reformatted.

Local development

We use

uv to manage dependencies and python environments
ruff for linting and formatting
mypy for type checking
pytest for testing
pre-commit to automatically lint and format as you git commit

Setup

Install uv
Run uv run pre-commit install to setup the git hooks
If you use VSCode, you may want to install the extensions (ruff, mypy) it will recommend when you open this folder

Running locally

uv run main --help
uv run main <DATASET_ID> update-template
uv run main <DATASET_ID> backfill-local <INIT_TIME_END>

Development commands

Add dependency: uv add <package> [--dev]. Use --dev to add a development only dependency.
Lint: uv run ruff check
Type check: uv run mypy
Format: uv run ruff format
Test: uv run pytest

Deploying to the cloud

To reformat a large archive we parallelize work across multiple cloud servers.

We use

docker to package the code and dependencies
kubernetes indexed jobs to run work in parallel

Setup

Install docker and kubectl. Make sure docker can be found at /usr/bin/docker and kubectl at /usr/bin/kubectl.
Setup a docker image repository and export the DOCKER_REPOSITORY environment variable in your local shell. eg. export DOCKER_REPOSITORY=us-central1-docker.pkg.dev/<project-id>/reformatters/main
Setup a kubernetes cluster and configure kubectl to point to your cluster. eg gcloud container clusters get-credentials <cluster-name> --region <region> --project <project>
Create a kubectl secret containing your Source Coop S3 credentials kubectl create secret generic source-coop-key --from-literal='AWS_ACCESS_KEY_ID=xxx' --from-literal='AWS_SECRET_ACCESS_KEY=xxx' and set these environment variables in your local shell export AWS_ACCESS_KEY_ID=xxx; export AWS_SECRET_ACCESS_KEY=xxx.

Development commands

DYNAMICAL_ENV=prod uv run main <DATASET_ID> backfill-kubernetes <INIT_TIME_END> [--jobs-per-pod <int>] [--max-parallelism <int>]

Name		Name	Last commit message	Last commit date
Latest commit History 388 Commits
.github/workflows		.github/workflows
.vscode		.vscode
data/download		data/download
deploy		deploy
docs		docs
src/reformatters		src/reformatters
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CONVENTIONS.md		CONVENTIONS.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dynamical.org reformatters

Local development

Setup

Running locally

Development commands

Deploying to the cloud

Setup

Development commands

About

Uh oh!

Uh oh!

Contributors 7

Uh oh!

Languages

License

dynamical-org/reformatters

Folders and files

Latest commit

History

Repository files navigation

dynamical.org reformatters

Local development

Setup

Running locally

Development commands

Deploying to the cloud

Setup

Development commands

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 7

Uh oh!

Languages