Mind the CHL Gap

Create a tutorial on gap-free Indian Ocean gridded data with CNNs. This will build on work started during GeoHackWeek 2024. We will try to get a tutorial for U-Net gap-filling working and add to https://ocean-satellite-tools.github.io/mind-the-chl-gap/intro.html. We also hope to get other algorithms working (DINCAE and DINEOF) or at least describe them.

The basic approach is the following:

graph LR
  A[netcdf/Zarr w time, lat, lon] --> G{to xarray}
  G --> C[standardized Zarr w masks and season]
  C --> D{CNN or UNet model}
  D --> E[Predict: xarray with gaps filled]

Functions are in mindthegap directory.

import mindthegap as mtg

Collaborators

Name	Role
Eli Holmes	Project Facilitator
Bruna Cândido	Fellow
Trina Xavier	Participant
Lilac Hong	Participant

Planning

Initial idea: Create a tutorial on gap-free Indian Ocean gridded data with U-Net method
Pitch slide
Slack channel: ohw25_proj_gap
repo: https://github.com/oceanhackweek/ohw25_proj_gap
Final presentation

Background

Chlorophyll is a widely used indicator of plankton abundance, and thus a key measure of marine productivity and ecosystem health, since the ocean covers nearly 70% of Earth’s surface. Estimating chlorophyll concentrations allows researchers to assess phytoplankton biomass, which supports oceanic food webs and contributes to global carbon cycling. Remote sensing with ocean-color instruments enables large-scale monitoring of chlorophyll-a by detecting the light reflectance of plankton. However, cloud cover continues to be a significant challenge, obstructing surface observations and creating gaps in chlorophyll-a data. These gaps limit our ability to monitor marine productivity accurately and to quantify the contribution of plankton to the global carbon cycle.

Goals

Contribute to "mind-the-chl-gap" project and the create a tutorial on gap-free Indian Ocean gridded data with U-Net method. For OceanHackWeek 2025, we aimed to extend the existing work by exploring different types of CNN architectures and experimenting with alternative gap-filling tools, such as segmentation_models_pytorch, DINCAE.

Datasets

import xarray as xr
dataset = xr.open_dataset(
    "gcs://nmfs_odp_nwfsc/CB/mind_the_chl_gap/IO.zarr",
    engine="zarr",
    backend_kwargs={"storage_options": {"token": "anon"}},
    consolidated=True
)
dataset

Workflow/Roadmap

flowchart TD
    A[Zarr data] --> B[Data Preprocessing]
    B --> C[Model Fit]
    C --> D[Result Visualization]

Results/Findings

oceanhackweek.org/ohw25_proj_gap/

Lessons Learned

Working with outdated packages can be quite challenging.
Existing frameworks (e.g., DINCAE) can serve as inspiration but need to be adapted to the specific context.
Pay attention to memory efficiency — document how much memory is required to run your code and data.
Collaboration and thorough documentation help improve workflow efficiency.
Avoid using to_numpy() on the full dataset (time, lat, lon, var). Instead, stream patches directly from the Zarr files in batches or use dask.
Xarray is powerful, with advanced options available in icechunk and cubed.

References

PFT_gapfilling

Creating the JupyterBook

Create template in book directory

pip install -U jupyter-book
jupyter-book create book

Build and push to GitHub. Make sure you are in book dir.

jupyter-book build .
ghp-import -n -p -f _build/html

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
.ipynb_checkpoints		.ipynb_checkpoints
book		book
contributor_folders		contributor_folders
envs		envs
final_notebooks		final_notebooks
mindthegap		mindthegap
models/2015_3_ArabSea		models/2015_3_ArabSea
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
U-Net_Fit.ipynb		U-Net_Fit.ipynb
U-Net_Modules.ipynb		U-Net_Modules.ipynb
U-Net_Tutorial.ipynb		U-Net_Tutorial.ipynb
U-Net_Tutorial_eli.ipynb		U-Net_Tutorial_eli.ipynb
U-Net_Viz.ipynb		U-Net_Viz.ipynb
Untitled.ipynb		Untitled.ipynb
environment.yml		environment.yml
main.py		main.py
pyproject.toml		pyproject.toml
unet_to_gcs.ipynb		unet_to_gcs.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mind the CHL Gap

Collaborators

Planning

Background

Goals

Datasets

Workflow/Roadmap

Results/Findings

Lessons Learned

References

Creating the JupyterBook

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

oceanhackweek/ohw25_proj_gap

Folders and files

Latest commit

History

Repository files navigation

Mind the CHL Gap

Collaborators

Planning

Background

Goals

Datasets

Workflow/Roadmap

Results/Findings

Lessons Learned

References

Creating the JupyterBook

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages