Test PR for Log Warnings #2924

gspetro-NOAA · 2025-10-08T18:48:39Z

Commit Queue Requirements:

This PR addresses a relevant WM issue (if not, create an issue).
All subcomponent pull requests (if any) have been reviewed by their code managers.
Run the full Intel+GNU RT suite (compared to current baselines), preferably on Ursa (Derecho or Hercules are acceptable alternatives). Exceptions: documentation-only PRs, CI-only PRs, etc.
- Commit log file w/full results from RT suite run (if applicable).
- Verify that test_changes.list indicates which tests, if any, are changed by this PR. Commit test_changes.list, even if it is empty.
Fill out all sections of this template.

Description:

This PR is currently being used to test a GitHub Actions workflow that will hopefully resolve Issue #2527. Currently, the scorecard can be viewed by clicking on "Regression Resource Check / write-results (pull_request)" at the bottom of the PR once it has passed. Then, on the left-hand side of the page that opens, click Summary. Scroll down, and click on "Runtime Results Summary" and/or "Memory Results Summary." See here, for example.

The scorecard currently:

Extracts runtime/memory for the logs at the HEAD of the current PR.
Extracts the last 10 commits from the develop branch to calculate the mean and standard deviation for runtime and memory per test
Compares the runtime/memory for the log at the HEAD of the current PR with the runtime/memory for the last two commits to develop.
- For a specific test on a given machine:
  - ✅ indicates normal runtime/memory
  - ⚠️ indicates that the runtime/memory value is greater than two standard deviations above the mean.
  - ❌ indicates that for the past 2 PRs, runtime/memory has been greater than two standard deviations above the mean.

In progress:

Caching for historical data --> The get_data task takes a few minutes to run when extracting 30+ commits (as opposed to the 10 it is set at currently), but more commits will result in less variance in mean/std values. The solution is for the workflow to extract historical data once, cache it, and reference the cache in the future to avoid rerunning steps.
Reporting only tests that have warnings/failures in the row? This is especially important for Memory, where most tests seem to be in a normal range most of the time.
Testing to ensure that values are as expected
Refactoring --> to introduce better logging message, reduce code duplication, increase clarity, improve documentation, etc.

Commit Message:

* UFSWM - Create scorecard for runtime/memory metrics by machine

Priority:

Critical Bugfix: Reason
High: Reason
Normal

Git Tracking

UFSWM:

Closes track time/memory use statistics reported in RT logs #2527

Sub component Pull Requests:

None

UFSWM Blocking Dependencies:

Blocked by #
None

Documentation:

Documentation update required.
- Relevant updates are included with this PR.
- A WM issue has been opened to track the need for a documentation update; a person responsible for submitting the update has been assigned to the issue (link issue).
Documentation update NOT required.
- Explanation: This is CI/CD testing targeted toward CMs, not users.

Changes

Regression Test Changes (Please commit test_changes.list):

PR Adds New Tests/Baselines.
PR Updates/Changes Baselines.
No Baseline Changes.

Input data Changes:

None.
New input data.
Updated input data.

Library Changes/Upgrades:

Required
- Library names w/versions:
- Git Stack Issue (JCSDA/spack-stack#)
No Updates

Testing Log:

…d build env to remove gnu from stack (was ufs-community#2842) (ufs-community#2867) * UFSWM - update ufs_noaacloud.intel.lua module file * UFSWM - replace icplocn2atm with use_oceanuv in scripts and tests * CMEPS - update CCPP metadata and type defs for use_oceanuv * FV3 - * ccpp-physics - replace instances of icplocn2atm with use_oceanuv * atmos_cubed_sphere - replace instances of icplocn2atm with use_oceanuv * NOAHMP - replace icplocn2atm with use_oceanuv

…ther-model into feature/log-warning

gspetro-NOAA · 2025-10-13T16:07:35Z

@DeniseWorthen I've updated this so that only warning/failing tests are reported. At the bottom, I have the number of tests passing on each platform, but that could easily be inverted to how many are warning/failing for runtime/memory on each platform. I could also do percentages or decimal value (0 to 1) if preferred. In theory, I could add two rows, one with warning and one with failing. Lots of options, so I'd like to hear what you think would be most useful. I can stick to your original idea if that's what you prefer but wanted to propose options! Current output here.

I also added a column that shows number of platforms on which a test is passing. Seems like a row of mostly red would also be cause for concern, as it suggests an issue with the specific test, rather than with a particular platform.

For Jong's plots, I believe they are only for Ursa, and it would be a lot of plots if we did one for each machine. Should we just use Ursa as a reference machine for the plots? Or do you think it would be useful to have plots for every test on every machine?

…ther-model into feature/log-warning

gspetro-NOAA and others added 30 commits September 18, 2025 07:32

add Jong's updated warnings files

3183d80

experiment w/token

cb71806

Merge branch 'ufs-community:develop' into feature/log-warning

20aaa17

add historical data

cbb8abd

Merge branch 'feature/log-warning' of github.com:gspetro-NOAA/ufs-wea…

11c61cd

…ther-model into feature/log-warning

generate historical data - functional but not pretty

f20665e

add machines - gaeac6 not working

ac3c053

handle microseconds

a1c2278

Merge branch 'ufs-community:develop' into feature/log-warning

30ecbae

partially refactor parse_historical_data.py

f8e779b

add action for retrieving and storigin historical mem/runtime data

1e11c49

Merge branch 'feature/log-warning' of github.com:gspetro-NOAA/ufs-wea…

91d1351

…ther-model into feature/log-warning

fix header formatting

b5a2fa1

fix typo

53e736c

cancel in-progress jobs w/another push

611bf58

fix env array syntax

583e46c

fix api ref syntax

a59295b

fix api ref syntax

544c989

upgrade to upload-artifact v4

4f2368c

rm json install

ed3bd04

update path

51cc5ec

update file permissions

5494109

update header

deaab2a

add https://

a8d8574

update header info

431f904

fix function call

6dabc32

fix machine names

fd90c39

add github api base url

70a18ee

fix api call?

5818878

gspetro-NOAA added 5 commits October 13, 2025 06:47

add pass rates for each test

8576ed4

add pass rates per machine

ca0c3ba

add machine name to column total

f5ce83f

format bottom total

c89c55f

minor formatting updates

080e581

gspetro-NOAA added 20 commits October 13, 2025 12:23

add 'passing' to end of results

9b3932a

rm extraneous files

213aa80

minor refactoring/docs for get_data.py

26e891b

attempt caching

75952ec

fix mkdir cmd

47846ee

test caching

e561352

change file mode from append to write

7db1c16

fix stat read from cache

7e3bbce

test stats.json path

efab05b

test stats.json path

10fc76c

check if stats.json exists in path

09616ae

reorder steps

27a8fe5

debug path issue

c7f74a1

try new cache

2ad5731

rm data dir

c1207b3

rm data dir

8b99e62

mkdir data

07b73c6

add back data dir

cf9e42b

add ls

d729af8

adjust path

0eed44f

gspetro-NOAA mentioned this pull request Oct 14, 2025

Debug logs show hashFiles generates a hash; hash gets truncated when actually saving actions/cache#1661

Open

gspetro-NOAA and others added 3 commits October 14, 2025 14:44

Merge branch 'develop' into feature/log-warning

eeae385

change hash to latest develop sha

20ff28f

Merge branch 'feature/log-warning' of github.com:gspetro-NOAA/ufs-wea…

9e6871f

…ther-model into feature/log-warning

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test PR for Log Warnings #2924

Test PR for Log Warnings #2924

gspetro-NOAA commented Oct 8, 2025 •

edited

Loading

Uh oh!

gspetro-NOAA commented Oct 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Test PR for Log Warnings #2924

Are you sure you want to change the base?

Test PR for Log Warnings #2924

Conversation

gspetro-NOAA commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Commit Queue Requirements:

Description:

Commit Message:

Priority:

Git Tracking

UFSWM:

Sub component Pull Requests:

UFSWM Blocking Dependencies:

Documentation:

Changes

Regression Test Changes (Please commit test_changes.list):

Input data Changes:

Library Changes/Upgrades:

Testing Log:

Uh oh!

gspetro-NOAA commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gspetro-NOAA commented Oct 8, 2025 •

edited

Loading

gspetro-NOAA commented Oct 13, 2025 •

edited

Loading