HiRAD-Gen

HiRAD-Gen is short for high-resolution atmospheric downscaling using generative models. This repository contains the code and configuration required to train and use the model.

Installation (Alps)

To set up the environment for HiRAD-Gen on Alps supercomputer, follow these steps:

Start the PyTorch user environment:

uenv start pytorch/v2.6.0:v1 --view=default

Create a Python virtual environment (replace {env_name} with your desired environment name):
```
python -m venv ./{env_name}
```
Activate the virtual environment:
```
source ./{env_name}/bin/activate
```
Install project dependencies:
```
pip install -e .
```

This will set up the necessary environment to run HiRAD-Gen within the Alps infrastructure.

Training

Run regression model training (Alps)

Script for running the training of regression model is in src/hirad/train_regression.sh. Inside this script set the following:

### OUTPUT ###
#SBATCH --output=your_path_to_output_log
#SBATCH --error=your_path_to_output_error

#SBATCH -A your_compute_group

srun bash -c "
    . ./{your_env_name}/bin/activate
    python src/hirad/training/train.py --config-name=training_era_cosmo_regression.yaml
"

Set up the following config files in src/hirad/conf:

In training_era_cosmo_regression.yaml set:

hydra:
  run:
    dir: your_path_to_save_training_output

In training/era_cosmo_regression.yaml set:

hp:
    training_duration: number of samples to train for (set to 4 for debugging, 512 fits into 30 minutes on 1 gpu with total_batch_size: 4)

In dataset/era_cosmo.yaml set the dataset_path if different from default.

Submit the job with:

sbatch src/hirad/train_regression.sh

Run diffusion model training (Alps)

Before training diffusion model, checkpoint for regression model has to exist.

Script for running the training of diffusion model is in src/hirad/train_diffusion.sh. Inside this script set the following:

### OUTPUT ###
#SBATCH --output=your_path_to_output_log
#SBATCH --error=your_path_to_output_error

#SBATCH -A your_compute_group

srun bash -c "
    . ./{your_env_name}/bin/activate
    python src/hirad/training/train.py --config-name=training_era_cosmo_diffusion.yaml
"

Set up the following config files in src/hirad/conf:

In training_era_cosmo_diffusion.yaml set:

hydra:
  run:
    dir: your_path_to_save_training_output

In training/era_cosmo_regression.yaml set:

hp:
    training_duration: number of samples to train for (set to 4 for debugging, 512 fits into 30 minutes on 1 gpu with total_batch_size: 4)
io:
    regression_checkpoint_path: path_to_directory_containing_regression_training_model_checkpoints

In dataset/era_cosmo.yaml set the dataset_path if different from default.

Submit the job with:

sbatch src/hirad/train_diffusion.sh

Inference

Running inference on Alps

Script for running the inference is in src/hirad/generate.sh. Inside this script set the following:

### OUTPUT ###
#SBATCH --output=your_path_to_output_log
#SBATCH --error=your_path_to_output_error

#SBATCH -A your_compute_group

srun bash -c "
    . ./{your_env_name}/bin/activate
    python src/hirad/inference/generate.py --config-name=generate_era_cosmo.yaml
"

Set up the following config files in src/hirad/conf:

In generate_era_cosmo.yaml set:

hydra:
  run:
    dir: your_path_to_save_inference_output

In generation/era_cosmo.yaml: Choose the inference mode:

inference_mode: all/regression/diffusion

by default all does both regression and diffusion. Depending on mode, regression and/or diffusion model pretrained weights should be provided:

io:
  res_ckpt_path: path_to_directory_containing_diffusion_training_model_checkpoints
  reg_ckpt_path: path_to_directory_containing_regression_training_model_checkpoints

Finally, from the dataset, subset of time steps can be chosen to do inference for.

One way is to list steps under times: in format %Y%m%d-%H%M for era5_cosmo dataset.

The other way is to specify times_range: with three items: first time step (%Y%m%d-%H%M), last time step (%Y%m%d-%H%M), hour shift (int). Hour shift specifies distance in hours between closest time steps for specific dataset (6 for era_cosmo).

By default, inference is done for one time step 20160101-0000

In dataset/era_cosmo.yaml set the dataset_path if different from default.

Submit the job with:

sbatch src/hirad/generate.sh

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
ci		ci
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
interpolate.sh		interpolate.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HiRAD-Gen

Installation (Alps)

Training

Run regression model training (Alps)

Run diffusion model training (Alps)

Inference

Running inference on Alps

About

Uh oh!

Releases

Packages

Contributors 3

Languages

License

MeteoSwiss/HiRAD-Gen

Folders and files

Latest commit

History

Repository files navigation

HiRAD-Gen

Installation (Alps)

Training

Run regression model training (Alps)

Run diffusion model training (Alps)

Inference

Running inference on Alps

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages