add benchmark script and testing #158

JoshuaLampert · 2024-10-28T12:02:02Z

I recently came across AirspeedVelocity.jl and think it is a nice tool to run benchmarks for changes in a PR. The GitHub Action file is (almost) a copy of https://github.com/SymbolicML/DynamicQuantities.jl/blob/21b7468801c773c5072c6db358f2fddcb8529ff9/.github/workflows/benchmark_pr.yml.
For the benchmark test I included some elixirs, which should cover most of the features.

codecov · 2024-10-28T12:25:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.97%. Comparing base (2061e53) to head (820f2b2).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #158   +/-   ##
=======================================
  Coverage   97.97%   97.97%           
=======================================
  Files          19       19           
  Lines        1776     1776           
=======================================
  Hits         1740     1740           
  Misses         36       36

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

JoshuaLampert · 2024-10-28T12:43:45Z

Looks like some test values need to be adjusted. Tests were also failing locally. I used the values I now got locally.

JoshuaLampert · 2024-10-28T13:39:23Z

I'm not sure why there is the UndefVarError: sol not defined error in the new benchmark Action. Looks like some namespace issue. I guess that's related to the fact that AirspeedVelocity.jl creates a module internally, but I don't know how to fix that. Locally, it runs fine.

ranocha

Sounds like a nice idea to try out - if it does not cause too much work/trouble to set it up

JoshuaLampert · 2024-10-29T09:02:58Z

I have set it up in another repo, which worked easily and nicely. But I'm not sure what to do with the error here.

benchmark/benchmarks.jl

github-actions · 2024-10-29T09:20:25Z

Benchmark Results

	main	`820f2b2`...	main/820f2b2d99ae92...
bbm_1d/bbm_1d_basic.jl	13.9 ± 0.29 μs	13.8 ± 0.29 μs	1.01
bbm_1d/bbm_1d_fourier.jl	0.528 ± 0.0039 ms	0.216 ± 0.015 ms	2.45
bbm_bbm_1d/bbm_bbm_1d_basic_reflecting.jl	0.114 ± 0.0034 ms	0.121 ± 0.0034 ms	0.944
bbm_bbm_1d/bbm_bbm_1d_dg.jl	0.0344 ± 0.00048 ms	0.0341 ± 0.00047 ms	1.01
bbm_bbm_1d/bbm_bbm_1d_relaxation.jl	27.6 ± 0.46 μs	27.4 ± 0.41 μs	1.01
bbm_bbm_1d/bbm_bbm_1d_upwind_relaxation.jl	0.0485 ± 0.00054 ms	0.0485 ± 0.00091 ms	1
hyperbolic_serre_green_naghdi_1d/hyperbolic_serre_green_naghdi_dingemans.jl	4.1 ± 0.012 μs	4.1 ± 0.013 μs	1
serre_green_naghdi_1d/serre_green_naghdi_well_balanced.jl	0.198 ± 0.0074 ms	0.201 ± 0.0062 ms	0.985
svaerd_kalisch_1d/svaerd_kalisch_1d_dingemans_relaxation.jl	0.145 ± 0.0029 ms	0.151 ± 0.0037 ms	0.958
time_to_load	1.98 ± 0.017 s	1.99 ± 0.024 s	0.996

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR.

JoshuaLampert · 2024-10-29T09:21:37Z

Seems like trixi_include is the problem. With include instead it works. But then we cannot set a smaller time span for the ODE.

JoshuaLampert · 2024-10-29T10:11:31Z

This seems fine now. The plots can be found here.
It's not optimal that all elixirs are run for the whole time span (because trixi_include creates namespace issues in combination with AirspeedVelocity.jl), but with a total runtime of less than 10 minutes I think it's still reasonable.

ranocha · 2024-10-29T11:03:52Z

Seems like trixi_include is the problem. With include instead it works. But then we cannot set a smaller time span for the ODE.

trixi_include([mod::Module=Main,] elixir::AbstractString; kwargs...) uses Main as module whereas include uses the current module. Can you use @__MODULE__ as first argument to trixi_include?

JoshuaLampert · 2024-10-29T11:08:04Z

Seems like trixi_include is the problem. With include instead it works. But then we cannot set a smaller time span for the ODE.

trixi_include([mod::Module=Main,] elixir::AbstractString; kwargs...) uses Main as module whereas include uses the current module. Can you use @MODULE as first argument to trixi_include?

Ah, I see. Thanks! Let' see.

ranocha

Looks reasonable to me. Benchmarks on GitHub runners can be quite noisy but let's see 👍

.github/workflows/benchmark.yml

test/test_serre_green_naghdi_1d.jl

ranocha · 2024-10-29T11:11:43Z

This seems fine now. The plots can be found here. It's not optimal that all elixirs are run for the whole time span (because trixi_include creates namespace issues in combination with AirspeedVelocity.jl), but with a total runtime of less than 10 minutes I think it's still reasonable.

Is there some way to display the plots in a PR comment like the table above?

JoshuaLampert · 2024-10-29T12:11:29Z

This seems fine now. The plots can be found here. It's not optimal that all elixirs are run for the whole time span (because trixi_include creates namespace issues in combination with AirspeedVelocity.jl), but with a total runtime of less than 10 minutes I think it's still reasonable.

Is there some way to display the plots in a PR comment like the table above?

Good question. I agree that this would be nice. Let me see if I can wangle that.

Co-authored-by: Hendrik Ranocha <[email protected]>

JoshuaLampert · 2024-10-29T12:55:32Z

This seems fine now. The plots can be found here. It's not optimal that all elixirs are run for the whole time span (because trixi_include creates namespace issues in combination with AirspeedVelocity.jl), but with a total runtime of less than 10 minutes I think it's still reasonable.

Is there some way to display the plots in a PR comment like the table above?

Good question. I agree that this would be nice. Let me see if I can wangle that.

I think it's not so easy. We would need to upload it somewhere else and then put the link into the comment, cf. peter-evans/create-or-update-comment#68. IMHO, that's a bit too sophisticated. The numbers in the table also show most relevant information. EDIT: We can at least link to the artifact, which saves some clicks, but having the plot inside the comment is not so easy, I think. Artifacts are always uploaded as zip, see https://github.com/actions/upload-artifact?tab=readme-ov-file#zip-archives.
What makes this even harder in the general case is the fact that multiple plots are generated by AirspeedVelocity.jl if there are a lot of benchmarks tests (the default is 10 benchmark tests per image, i.e. we have only one image because we currently have 10 benchmark tests).

ranocha

I see. Thanks!

JoshuaLampert added 5 commits October 28, 2024 12:45

add benchmark script and testing

a0c35a0

add compat bounds

02af774

update CompatHelper

407556b

ignore benchmark for CI

de9c1c9

format

eba7cb6

adjust test values

a4d3a6d

JoshuaLampert added performance testing labels Oct 28, 2024

soften tolerances

7d53417

ranocha reviewed Oct 29, 2024

View reviewed changes

WIP: try include instead of trixi_include

ab2d2b0

JoshuaLampert marked this pull request as draft October 29, 2024 09:08

github-actions bot reviewed Oct 29, 2024

View reviewed changes

benchmark/benchmarks.jl Outdated Show resolved Hide resolved

JoshuaLampert added 6 commits October 29, 2024 10:29

uncomment all elixirs again

9dfcc7f

fix typo

95d2181

instantiate

c918123

don't clutter terminal

e43fcd7

don't activate in benchmarks.jl

9698c9c

add README

ec092c8

JoshuaLampert marked this pull request as ready for review October 29, 2024 10:08

JoshuaLampert requested a review from ranocha October 29, 2024 10:12

move develop after activate

61981f0

use @__MODULE__ in trixi_include

31f1956

ranocha previously approved these changes Oct 29, 2024

View reviewed changes

.github/workflows/benchmark.yml Outdated Show resolved Hide resolved

test/test_serre_green_naghdi_1d.jl Outdated Show resolved Hide resolved

test to include plot in comment

a2f47b3

JoshuaLampert dismissed ranocha’s stale review via a2f47b3 October 29, 2024 12:29

JoshuaLampert and others added 2 commits October 29, 2024 13:29

Apply suggestions from code review

5991ec2

Co-authored-by: Hendrik Ranocha <[email protected]>

use braces

cc4397b

JoshuaLampert added 2 commits October 29, 2024 13:56

return to original version

bfac9c2

try embedding link to artifact

820f2b2

JoshuaLampert requested a review from ranocha October 29, 2024 13:26

ranocha approved these changes Oct 29, 2024

View reviewed changes

JoshuaLampert merged commit 17a3d77 into main Oct 29, 2024
18 checks passed

JoshuaLampert deleted the benchmark branch October 29, 2024 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add benchmark script and testing #158

add benchmark script and testing #158

JoshuaLampert commented Oct 28, 2024

codecov bot commented Oct 28, 2024 •

edited

Loading

JoshuaLampert commented Oct 28, 2024

JoshuaLampert commented Oct 28, 2024

ranocha left a comment

JoshuaLampert commented Oct 29, 2024

github-actions bot commented Oct 29, 2024 •

edited

Loading

JoshuaLampert commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024

ranocha commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024

ranocha left a comment

ranocha commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024 •

edited

Loading

ranocha left a comment

add benchmark script and testing #158

add benchmark script and testing #158

Conversation

JoshuaLampert commented Oct 28, 2024

codecov bot commented Oct 28, 2024 • edited Loading

Codecov Report

JoshuaLampert commented Oct 28, 2024

JoshuaLampert commented Oct 28, 2024

ranocha left a comment

Choose a reason for hiding this comment

JoshuaLampert commented Oct 29, 2024

github-actions bot commented Oct 29, 2024 • edited Loading

Benchmark Results

Benchmark Plots

JoshuaLampert commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024

ranocha commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024

ranocha left a comment

Choose a reason for hiding this comment

ranocha commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024

JoshuaLampert commented Oct 29, 2024 • edited Loading

ranocha left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 28, 2024 •

edited

Loading

github-actions bot commented Oct 29, 2024 •

edited

Loading

JoshuaLampert commented Oct 29, 2024 •

edited

Loading