Revamp obs staging and analysis stats job #4306

DavidNew-NOAA · 2025-12-09T14:34:48Z

Description

This PR makes changes in two areas of GW code for JEDI jobs.

First, it makes changes to ush/python/pygfs/jedi/jedi.py related to observations handling that does the following:

It creates a stage_observations() method for the Jedi class that stages observations for analysis jobs, rather than relying on the task config YAML in GDASApp for staging. This change is justified by the fact that obs staging is essentially the same across all analysis tasks.
Methods for staging, extracting, taring bias corrections, and saving obs diags and radiative bias corrections are moved from the the Analysis class into the Jedi class for the following reason:
The paths, file prefixes, and file suffixes used in these methods for obs, obs diags, and bias corrections are taken from the JCB config dictionary in the Jedi class. This ensures that the file structure and naming for obs and their statistics are consistent between how they are staged in GW and how they are stages and saved by JEDI applications. Thus there can never be naming conflicts.
The JCB config dictionary for a Jedi object is created by the class constructor rather than the initialize class. This way, task_config doesn't need to be passed to both the class constructor and the initialize class. One benefit of this is that it cuts down on the number of times task_config is dumped by the logger.
Other minor changes are made to the Jedi class code such as hardening and more descriptive method/variable naming.

Second, ush/python/pygfs/task/analysis_stats.py is refactored in the following ways:

AnalysisStats class now inherits from Analysis rather than Task. This allows it to inherit parameters like APREFIX, GPREFIX, etc.
Changes are made so that task_config is never modified after the class constructor is run, consistent now with all other tasks.
The "base config" and "JEDI config" YAMLs are consolidated into a single master config YAML (like all other task now), and any staging/saving that was carried on in the original Python code is now invoked by the FileHandler with that master YAML (data_in and data_out keys).
The run directory is reorganized by analysis type with subdirectories for the inputs and outputs.
Input and output paths for staging/saving are taken from the JCB config dictionary of the relevant Jedi object, to ensure that file paths and naming are consistent between the GW code and JEDI application configuration YAMLs.

Resolves #4224
Resolves #4228

Type of change

Bug fix (fixes something broken)
New feature (adds functionality)
Maintenance (code refactor, clean-up, new CI test, etc.)

Change characteristics

Is this change expected to change outputs (e.g. value changes to existing outputs, new files stored in COM, files removed from COM, filename changes, additions/subtractions to archives)? YES/NO (If YES, please indicate to which system(s))
- GFS
- GEFS
- SFS
- GCAFS
Is this a breaking change (a change in existing functionality)? YES
Does this change require a documentation update? NO
Does this change require an update to any of the following submodules? YES
- EMC verif-global
- GDAS
- GFS-utils
- GSI
- GSI-monitor
- GSI-utils
- UFS-utils
- UFS-weather-model
- wxflow

How has this been tested?

Clone, build, and full CI suite on Hera

Checklist

CoryMartin-NOAA · 2025-12-09T17:26:20Z

dev/jobs/JGLOBAL_ANALYSIS_STATS

    COMOUT_AERO_ANLMON:COM_CHEM_ANLMON_TMPL \
    COMOUT_SNOW_ANLMON:COM_SNOW_ANLMON_TMPL

+mkdir -m 755 -p "${COMOUT_ATMOS_ANALYSIS}"


we should only mkdir these if these components are active in the experiment

CoryMartin-NOAA · 2025-12-09T17:27:17Z

parm/archive/enkf.yaml.j2

                                        "correction_increment.yaml",
                                        "ensemble_recenter.yaml"] %}
            {% else %}
-                {% set da_stat_files = ["stat.atm.tar"]%}


these are probably the old GSI files and not IODA, right?

CoryMartin-NOAA · 2025-12-09T17:28:12Z

parm/archive/enkf.yaml.j2

        {% endfor %}

        {% if DO_JEDISNOWDA %}
-        - "{{ COMIN_SNOW_ANALYSIS_ENSSTAT | relpath(ROTDIR) }}/{{ head }}snow_analysis.ioda_hofx.ensmean.tar"


I vaguely remember @aerorahul mentioning we should drop the gzipping

CoryMartin-NOAA · 2025-12-09T17:28:31Z

parm/archive/gfs_arcdir.yaml.j2

                                    ARCDIR ~ "/pgbanl." ~ RUN ~ "." ~ cycle_YMDH ~ ".grib2"]) %}

    {% if DO_JEDIATMVAR == True %}
-        {% do det_anl_files.append([COMIN_ATMOS_ANALYSIS ~ "/" ~ head ~ "stat.atm.tar",


same as before, this might be GSI not JEDI/IODA

DavidNew-NOAA added 29 commits November 5, 2025 21:28

Initial commit

78a0279

Update gdas hash

8fdd557

Save changes

d5c4519

Debug

f32e97f

mkdir obs, diags, and bc in stage_obs()

0b62b7e

Attempt to fix weird pynorms

a8fc75a

pynorms?

b061def

Tinkering

a771101

Update

f623e6d

Clean up

ce8a749

Slight name change

2524219

Debug

a690382

Debug

0185dc6

Merge branch 'develop' into feature/stage-obs

df6a972

pynorms

9c83b2f

Update

a425b44

Update

619bb93

debug

68f4390

Merge remote-tracking branch 'origin/develop' into feature/stage-obs

cb40af6

Debug and new features

b19b8d3

Missed files in last commit

e62a3f9

Move diag files and bias corrections to come with JEDI class methods

39c4e46

Some logs, etc

f305ccc

Minor change

37e8d0e

Debug

e46c347

Missing files

f8486f9

Merge branch 'develop' into feature/stage-obs

cc8f48e

Revamp somethings

8f42352

Merge branch 'develop' into feature/stage-obs

6289e2b

DavidNew-NOAA requested a review from jiaruidong2017 as a code owner December 9, 2025 14:34

DavidNew-NOAA requested review from AndrewEichmann-NOAA, CoryMartin-NOAA, DavidHuber-NOAA, RussTreadon-NOAA, aerorahul and guillaumevernieres as code owners December 9, 2025 14:34

DavidNew-NOAA marked this pull request as draft December 9, 2025 14:35

DavidNew-NOAA added 2 commits December 9, 2025 17:08

debug

35b2088

Merge branch 'develop' into feature/stage-obs

cd8e70e

CoryMartin-NOAA requested a review from kevindougherty-noaa December 9, 2025 17:25

CoryMartin-NOAA reviewed Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revamp obs staging and analysis stats job #4306

Revamp obs staging and analysis stats job #4306

Uh oh!

DavidNew-NOAA commented Dec 9, 2025 •

edited

Loading

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Revamp obs staging and analysis stats job #4306

Are you sure you want to change the base?

Revamp obs staging and analysis stats job #4306

Uh oh!

Conversation

DavidNew-NOAA commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Change characteristics

How has this been tested?

Checklist

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

CoryMartin-NOAA Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DavidNew-NOAA commented Dec 9, 2025 •

edited

Loading