Skip to content

Project uses NASA’s PACE satellite ocean color data to link plankton dynamics with fish occurrence, revealing how space-based observations can predict patterns in marine fisheries..

License

Notifications You must be signed in to change notification settings

oceanhackweek/ohw25_proj_fishpace

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FisHy PACE: Predicting fish distributions from phytoplankton using hyperspectral satellite data

Project Description

Our goal is to show how the new PACE (Plankton, Aerosol, Cloud, ocean Ecosystem) dataset can enhance our ability to predict species distributions.

Collaborators

Name Role Affiliation Github Email
Jon Peake Project Facilitator/Mentor NOAA Fisheries Open Science jonpeake [email protected]
Sam Alaimo Participant Rutgers University salaimo26 [email protected]
Israt Jahan Mili Participant University of Massachusetts--Dartmouth israt-mili [email protected]
Alberto Rivera Participant Scripps Institution of Oceanography buzoAlberto [email protected]
Isidora Rojas Participant Scripps Institution of Oceanography isidora-rojas [email protected]
Max Titcomb Participant Scripps Institution of Oceanography maxtitcomb [email protected]

Planning

  • Initial idea: Understand potential associations between phytoplankton species or abundance and adult fish functional groups or larvae respectively
  • Final idea: Predict adult fish distributions from PACE chlorophyll and absorption data and compare to predictions from MODIS chlorophyll data
  • Ideation Presentation: Link
  • Slack channel: ohw25_proj_fishy
  • Final presentation: Link

Background

PACE (Plankton, Aerosol, Cloud, ocean Ecosystem) is NASA's newest earth-observing satellite that collects hyperspectral measurements across a broad range of visible wavelengths. The data PACE collects allows us to better resolve absorbance spectra at any given point in the ocean than previous earth-observing satellites (i.e., MODIS). This includes better estimates of Chlorophyll-A, which is highly correlated with primary production. Chlorophyll-A concentrations derived from MODIS have been previously shown to be important predictors of fish distributions and abundance. We want to investigate how well PACE Chlorophyll-A data can predict fish distributions compared to MODIS.

Goals

  1. Establish and proof of concept between the ROMS Chlorophyll-A model and CalCOFI datasets

  2. Create species distribution models using Chlorophyll-A data from PACE across adult stages of fish.

  3. Compare the predictive ability of Chlorophyll-A from PACE.

  4. Assess differences in effect sizes between PACE Chlorophyll-A and absorbance to determine whether absorbance is a better predictor of fish occurrence than Chlorophyll

  5. (If time permits) Assess differences in Chlorophyll-A effect size between the Pacific and Atlantic datasets

Datasets

  • PACE Chlorophyll-A

  • Federal Bottom Trawl Survey (NW and NE)

  • CalCOFI (West Coast Larval Dataset)

  • EcoMON (East Coast Larval Dataset)

Workflow/Roadmap

1. Data Collection and Assimilation

First, we pulled data from the bottom trawl fish datasets (NEFSC, NWFSC) and the larval fish datasets (CalCOFI, EcoMON) for the east and west coasts. We identified the area and temporal extent/resolution of each dataset to determine the spatiotemporal limits of our project. We then pulled Chlorophyll-A data from PACE and MODIS, specifying the boundaries and temporal extent for the area and years of interest. We additionally pulled absorbance data from the PACE dataset and environmental data taken at time of sample from the trawl datasets.

2. Data preparation

Trawl Data: Trawl data was prepared by subsetting only the fish species that were caught in the surveys. We focused on year 2024 to match the PACE data range.

East Coast: Since the biological and environmental data is in separate .csv files, these two were merged together with Latitude, Longitude, Surface and Bottom Temperature, Surface and Bottom Salinity and Average Depth. selected. The final data frame consisted of rows corresponding to each individual trawl ID and columns corresponding to a species caught in the trawl, and the environmental data associated with each respective trawl ID.

West Coast: Trawl data was derived from the Northwest Fisheries Science Center (NWFSC) Survey package here: https://pfmc-assessments.github.io/nwfscSurvey/index.html

survey_sites_map (1)

PACE Data: https://pace.gsfc.nasa.gov/

3. Model training

Response & covariate. For each species with sufficient data, I modeled raw CPUE (individuals per ha) from the NWFSC West Coast trawl survey as a function of PACE chlorophyll-a:

$$ \log \big( \mathbb{E}[\text{CPUE}] \big) = \beta_0 + \beta_1 \cdot \text{chlor a standardized} + \text{spatial random field} $$

  • Response: derivation of catch per unit effort in numbers of individuals per hectare, estimated as expanded haul catch in numbers divided by the area swept by the net.
  • Family / link: Tweedie with log link (handles many zeros + right-skewed positive values).
  • Covariate: chlor_a_standardized (PACE Chl-a, z-scored).
  • Species inclusion: kept species present in ≥ 50 trawls with non-zero catch.
  • Spatial structure: barrier mesh over coastline (North America land polygons projected to UTM 10N and scaled to km), with spatial = "on", spatiotemporal = "off".
  • Fitting: sdmTMB with sanity() gating; species failing sanity checks were skipped.
  • Train/test split: random 80/20 by trawl (slice_sample(prop = 0.2) for test set).
  • Prediction grid: satellite grid with UTM km coordinates (pace_grid_no_na) for mapping predictions.

This setup let me estimate a single coefficient per species for the marginal effect of PACE Chl-a on CPUE while accounting for spatial autocorrelation.

4. Model prediction and visualization

For each species that passed sanity checks I generated a standardized four-panel figure: • A — Observed CPUE map (training): bubble map of observed CPUE over land basemap. • B — Predicted CPUE raster (grid): model predictions on the PACE grid (log-scaled fill) with coastline mask. • C — Spatial residuals (held-out): points colored by Pred − Obs and sized by absolute error to show regional over/under-prediction patterns. • D — Bars (held-out): side-by-side Predicted vs Observed CPUE ordered south→north to visualize bias along latitude.

arrowtooth_flounder_cpue_numbers_per_ha_der_final_plot (1)

I also extracted fixed-effects coefficients and 90% CIs for chlor_a_standardized across all species and compiled them into a single table. Using life-history joins, I produced summary coefficient plots colored by dominant movement and by adult prey category. resident_mobile_chla

trophic_level_chla

Results/Findings

Most species: Chl-a effect on adult CPUE is small/uncertain (90% CIs often cross 0). Some negatives: A minority show negative associations with Chl-a. Validation: Held-out bars/residual maps show modest skill with regional bias—Chl-a alone doesn’t explain adult demersal CPUE.

Lessons Learned & Next Steps

Future work will integrate additional PACE products (e.g., attenuation, absorption, backscatter) and external plankton community data from MOANA ERDDAP to test whether optical properties or plankton group composition improve predictions. Exploring time-varying coefficients and spatiotemporal models will help assess how relationships shift across seasons and years, providing a clearer picture of when and where PACE data best explain fish distributions.

References

Data

About

Project uses NASA’s PACE satellite ocean color data to link plankton dynamics with fish occurrence, revealing how space-based observations can predict patterns in marine fisheries..

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •