Formalizing the comment from PR #691 (comment). Adding a GitHub action or Prow Job to compare benchmarks vs. main would help catch issues sooner, like the one tracked in #720.
@ahrtr, a couple of questions to finalize scoping this task:
- What would be a good set of benchmark parameters to run?
- Should this run on GitHub Actions or as a Prow Job?
- What would be the priority for this task?