Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

petebachant · 2025-10-14T16:58:48Z

This uses CliMA/ClimaCore.jl#2376 to provide more useful CUDA kernel names in benchmarks.

TODO

Can we format the kernel names with anything other than underscores as separators? --> No, all non-alphanumeric characters get converted to underscores.
Do we want to do this on the benchmarks that use benchmark.jl, not benchmark_step.jl? --> we could, but probably not useful for CI yet.
Update Buildkite pipeline to use this feature
Switch to non-dev ClimaCore

perf/benchmark_step.jl

petebachant · 2025-10-14T20:48:14Z

.buildkite/pipeline.yml


  - group: "Reproducibility infrastructure"
    steps:
-


These changes were made by a YAML auto-formatter in VS Code. Is there a style guide I might be breaking here?

I'm not sure... this is something I have been wondering as well. I considered following this example, which is used in Buildkite's docs.

perf/benchmark_step.jl

…/gpu-perf-2

petebachant · 2025-11-04T16:27:22Z

@dennisYatunin @imreddyTeja thoughts on the timeouts here? Should I increase the limit or disable kernel renaming?

imreddyTeja · 2025-11-04T18:15:58Z

@dennisYatunin @imreddyTeja thoughts on the timeouts here? Should I increase the limit or disable kernel renaming?

What is the advantage of using kernel renaming in climaatmos-ci for buildkite steps that don't profile? Is the idea to run the profiler at the end of each simulation?

Comparing the buildkite for this PR to the main's last buildkite run shows a ~40% slowdown, but everything after the first step doesn't seem to be affected. I'm not sure if that cost is worth it at the moment.

petebachant · 2025-11-04T19:15:12Z

What is the advantage of using kernel renaming in climaatmos-ci for buildkite steps that don't profile? Is the idea to run the profiler at the end of each simulation?

I believe that was Dennis's vision, and then we'd process and summarize all of the profiling results together in a later step.

This reverts commit 427b8db.

petebachant · 2025-11-12T16:01:22Z

Alright, so now that CliMA/ClimaCore.jl#2376 is merged, do I need to make a ClimaCore release, or should I simply update the .buildkite project to refer to a specific ClimaCore Git commit?

I suppose I could revert any Buildkite changes here and just keep the updates to the benchmarking script.

imreddyTeja · 2025-11-13T00:38:21Z

.buildkite/pipeline.yml

  - group: "Benchmarks"
    steps:
-
      - label: ":computer: Benchmark: CPU baroclinic wave moist"


I think this label is incorrect

imreddyTeja · 2025-11-13T00:41:54Z

perf/benchmark_step.jl

 redirect_stderr(IOContext(stderr, :stacktrace_types_limited => Ref(false)))
 import ClimaComms
 ClimaComms.@import_required_backends
+import ClimaCore


Is this import used?

Good catch. It's no longer used. ClimaComms was also imported twice. We really need to figure out how to get linting working!

imreddyTeja · 2025-11-13T00:53:00Z

Alright, so now that CliMA/ClimaCore.jl#2376 is merged, do I need to make a ClimaCore release, or should I simply update the .buildkite project to refer to a specific ClimaCore Git commit?

I suppose I could revert any Buildkite changes here and just keep the updates to the benchmarking script.

I think we should make a ClimaCore release. It would be nice if we updated the prettytables compat before releasing.
After the release and updating the manifest, I think this PR is good to go.

…/gpu-perf-2

petebachant and others added 4 commits October 6, 2025 10:47

Update benchmark_step.jl for CUDA profiling

abe1f57

Fix external profiler determination

cbad8ac

Get kernel naming option from ClimaCore

df5f349

Control kernel naming via env var

606f584

petebachant marked this pull request as draft October 14, 2025 16:58

petebachant added 5 commits October 14, 2025 10:07

Use dev version of ClimaCore

f369d6c

Short-circuit GPU benchmark based on device

e4ce7a2

Rename kernels in buildkite

a521070

Autoformat .buildkite/pipeline.yml

04e0454

Improve logging

8a931f5

petebachant commented Oct 14, 2025

View reviewed changes

perf/benchmark_step.jl Outdated Show resolved Hide resolved

petebachant commented Oct 14, 2025

View reviewed changes

imreddyTeja reviewed Oct 14, 2025

View reviewed changes

perf/benchmark_step.jl Show resolved Hide resolved

imreddyTeja reviewed Oct 15, 2025

View reviewed changes

perf/benchmark_step.jl Outdated Show resolved Hide resolved

petebachant and others added 13 commits October 17, 2025 09:53

Always import CUDA

215255f

Name kernels from stack trace in benchmark GPU default

0b362e7

Merge branch 'main' of https://github.com/CliMA/ClimaAtmos.jl into pb…

cba2694

…/gpu-perf-2

Set stacktrace-based kernel names before compiling

6620982

Print internal profling result in benchmark_step.jl

3dcca85

Relocate function so it can be called

b228a35

Update ClimaCore dev dep

4439d43

Update ClimaCore

f36f405

Trigger build

a09bbc0

Merge main

317dca8

Fix url

1153ca3

Widen display size for CUDA profiling results

7aa7dea

Narrow print

f4429ff

petebachant marked this pull request as ready for review October 22, 2025 18:30

petebachant requested review from daverumph and dennisYatunin October 22, 2025 18:31

petebachant and others added 5 commits October 27, 2025 15:47

Update ClimaCore

63e213d

Set kernel naming from stack trace enabled for entire buildkite pipeline

512471e

Update ClimaCore

175164c

Update ClimaCore

7dd4645

Merge branch 'main' of https://github.com/CliMA/ClimaAtmos.jl into pb…

88cc058

…/gpu-perf-2

petebachant and others added 14 commits November 4, 2025 11:45

Update ClimaCore

7e959cd

Update ClimaCore and only rename kernels in specific benchmarks

77ef5e8

Update ClimaAtmos

d7befca

Update ClimaCore

5559604

Update comment

adde261

Update comment

427b8db

Revert "Update comment"

ad20c71

This reverts commit 427b8db.

Merge branch 'main' of github.com:CliMA/ClimaAtmos.jl into pb/gpu-perf-2

39c6438

Switch back to function redef method

395a0f0

Update ClimaCore

e92a8a5

Update ClimaCore

b6ca744

Update ClimaCore

df705c4

More env var reading back into core

5adf780

Update ClimaCore

d7c6177

imreddyTeja reviewed Nov 13, 2025

View reviewed changes

petebachant and others added 4 commits November 13, 2025 07:22

Clean up imports

1f431bc

Merge branch 'main' of https://github.com/CliMA/ClimaAtmos.jl into pb…

d847d5d

…/gpu-perf-2

Merge branch 'main' of github.com:CliMA/ClimaAtmos.jl into pb/gpu-perf-2

5fed428

Update ClimaCore

8616282

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

Uh oh!

petebachant commented Oct 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

petebachant Oct 14, 2025

Uh oh!

imreddyTeja Oct 14, 2025

Uh oh!

Uh oh!

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

imreddyTeja commented Nov 4, 2025

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

petebachant commented Nov 12, 2025

Uh oh!

imreddyTeja Nov 13, 2025

Uh oh!

imreddyTeja Nov 13, 2025

Uh oh!

petebachant Nov 13, 2025

Uh oh!

imreddyTeja commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update benchmark_step.jl for CUDA benchmarking with useful kernel names #4055

Are you sure you want to change the base?

Update benchmark_step.jl for CUDA benchmarking with useful kernel names #4055

Uh oh!

Conversation

petebachant commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Uh oh!

Uh oh!

petebachant Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

imreddyTeja Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

imreddyTeja commented Nov 4, 2025

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

petebachant commented Nov 12, 2025

Uh oh!

imreddyTeja Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

imreddyTeja Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

petebachant Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

imreddyTeja commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

petebachant commented Oct 14, 2025 •

edited

Loading