Tiger HLM Runoff (GPU)

Directory Structure

root/
├── data/                       
│   ├── forcings/               # Forcing data (CSV/NetCDF)
│   │   ├── precip_forcing.nc   
│   │   ├── precip_lookup.csv
│   │   ├── t2m_forcing.nc
│   │   └── t2m_lookup.csv
│   ├── parameters.csv          # Spatially varying input parameters
│   └── config.yaml             # config for input/output paths, model ID,
│
├── scripts/                    # post‐processing or plotting scripts
│   └── …                       
│
└── src/
    ├── Makefile                # Builds everything with nvcc (see below)
    │
    ├── main.cpp                # Host driver: calls setModelParameters<T>() and run_rk45<T>()
    ├── model_registry.hpp      # Declares inline setModelParameters<T>(…) and extern devParams
    ├── model_registry.cpp      # Defines the single __constant__ devParams for DummyModel
    │
    ├── I_O/                    # I/O utilities (e.g., CSV/NetCDF readers, checking input files)
    │   ├── forcing.cpp         # Reads forcing data (e.g., precipitation, temperature) from NetCDF files.
    │   └── forcing.hpp         # Declares functions for reading NetCDF forcing data.
    |   ├── parameters.cpp      # Reads spatially varying model parameters from CSV files.
    │   └── parameters.hpp      # Declares functions for reading parameter data from CSV files.
    |   ├── config_yaml.cpp     # Parses YAML configuration files for model settings and I/O paths.
    │   └── config_yaml.hpp     # Declares functions for parsing YAML configuration files.
    │
    ├── solver/                 # Core RK45 solver components
    │   ├── rk45.h              # Low‐level kernel prototype (templated kernel)
    │   ├── rk45_kernel.cu      # Implements rk45_kernel_multi<Model> for each Model
    │   ├── rk45_step_dense.cuh # Device‐side Dormand–Prince step (calls Model::rhs inside) and dense output
    │   └── rk45_api.hpp        # Host‐side “run_rk45<T>” and setModelParameters<T>() wrapper
    │
    └── models/                 
        ├── model_dummy.hpp     # Declares DummyModel::UID, Parameters, __device__ rhs(...), extern devParams
        └── model_dummy.cu      # Defines “__constant__ DummyModel::Parameters devParams;”
        └── … (future models go here, e.g. model_foo.hpp + model_foo.cu) …

Setting Up the Environment

Load the required CUDA toolkit using the following command:

For della gpu or gh use these as standard

module load cudatoolkit/12.9
module load openmpi/gcc/4.1.6
module load hdf5/gcc/openmpi-4.1.6/1.14.4
module load netcdf/gcc/hdf5-1.14.4/openmpi-4.1.6/4.9.2

Building & Running

Prerequisites

CUDA Toolkit (nvcc)
C++14-capable compiler (e.g. g++)
NetCDF C++ library for .nc files

Compile

cd src
make

This produces the executable rk45_solver.

Optional Build Modes

You can customize the build process using the following options:

Debug Mode (DEBUG=1): Enables debugging symbols (-g) and disables optimizations (-O0) for easier debugging.
Release Mode (DEBUG=0): Enables optimizations (-O2 or higher) for better performance.
Verbose Mode (VERBOSE=1): Prints detailed compilation commands during the build process.

Run

./rk45_solver

By default, it uses DummyModel::UID, reads data/forcing.csv (if present), integrates num_systems ODEs from t0=0.0 to tf=5.0, and writes:

final.csv (one line per system: H0,H1,H2,H3,H4 at t=tf)
dense.csv (“time,H0_sys0,H1_sys0,…,H4_sysN” at num_queries sample times).

Note: Steps 4 and 5 are still drafts, it works for dummy model and will be expanded to model 200/204 soon.

Add a New Model

Create models/model_new.hpp following the stub in model_dummy.hpp.
Add its hostParams block in model_registry.cpp.
Add a launch_rk45_new<<<…>>> wrapper in rk45_kernel.cu.
Rebuild with make.

Adjust Tolerances or Coefficients In main.cpp, before you call the kernel, set up

NewModel::Parameters hostParams = { absTol_val, relTol_val, /* ... */ };
cudaMemcpyToSymbol(NewModel::devParams, &hostParams, sizeof(hostParams));

so the GPU uses those tolerances instead of hard-coded values.

6. Splitting data

In the main.cpp rank 0 is used to split the parameters by the number of gpus (right now just the number of ranks minus one).

della-gh command line: mpirun -np 2 ./rk45_solver (for now only one gpu)

for nsight info: mpirun -np 2 nsys profile --trace=cuda,mpi --stats=true --output=rk45_profile_%q{OMPI_COMM_WORLD_RANK} rk45_solver

Citation

To cite this software in your publication, please use the following BibTeX (to be updated upon paper acceptance) to refer to the code's method paper:

@article{,
	doi = {},
	url = {},
	year = ,
	month = ,
	publisher = {},
	volume = {},
	number = {},
	pages = {}
	author = {},
	title = {},
	journal = {},
}

Finally, we will have DOIs for each released version on Zenodo. This approach promotes computational reproducibility by allowing you to specify the exact version of the code used to generate the results presented in your publication. A working zenodo badge will be added above once the first version is released.

@software{,
  author       = {Tiger HLM development team},
  title        = {},
  month        = ,
  year         = ,
  publisher    = {},
  version      = {},
  doi          = {},
  url          = {}
}

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tiger HLM Runoff (GPU)

Directory Structure

Building & Running

Optional Build Modes

6. Splitting data

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

PrincetonUniversity/Tiger_HLM_GPU

Folders and files

Latest commit

History

Repository files navigation

Tiger HLM Runoff (GPU)

Directory Structure

Building & Running

Optional Build Modes

6. Splitting data

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages