Usage Scenario Variables and ScenarioRunner templates #1172

ArneTR · 2025-05-01T09:43:17Z

Adds measurement templates for beginners and quick measurements with GMT.

YOu can now do:

Quickly measure a website
- ./run-template website "https://www.google.de"
Quickly measure an AI prompt (will default to gemma3:1b)
- ./run-template ai "How cool is the GMT?"

This makes the tool more beginner friendly, but also abstracts away some of the pitfalls that are happing with measuring these two cases for instance that for both you need to have sensible providers active and for websites the resolution should be at least 1/10 of the page load time (typically 10 ms)

We will iterate on these modes and add warnings / guard-clauses as needed, but will have them as bare like this for the moment.

Greptile Summary

Added measurement templates and usage scenario variables to enable quick website and AI measurements, with significant updates to the ScenarioRunner and API endpoints.

Added usage_scenario_variables JSONB column to runs, jobs, and watchlist tables with proper constraints
Introduced /v2/jobs and /v2/runs endpoints, deprecating v1 versions with backward compatibility
Added validation and error handling for usage scenario variables in ScenarioRunner
Added test coverage for variable handling, template functionality and API changes
Security concern: Potential shell injection risk in Docker command execution with shell=True

ArneTR · 2025-05-01T09:44:36Z

@ribalba How do you like these modes? Do they fulfill your idea of an easy mode?

I am keeping the templates in the GMT repository and thus they are automatically versioned. To not muddy the overview of which version they have been changed in the GMT now overloads the commit-hash with the actual folder of the templates. Thus a change to the AI template will still keep the same commit_hash for the website template and vice versa.

* main: Root DIR of GMT was not accurate Renamed Runner to ScenarioRunner and moved to lib/ Software add now returns job_id on insert; Jobs API now allows filter for job_id [skip ci] (#1170)

greptile-apps

_{23 file(s) reviewed, 11 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

lib/scenario_runner.py

runner.py

templates/website/Dockerfile

templates/website/squid.conf.intercept-cache

templates/website/README.md

templates/website/visit.py

tests/test_config_opts.py

github-actions · 2025-05-01T14:30:19Z

Old Energy Estimation

Eco CI Output:

Label	🖥 avg. CPU utilization [%]	🔋 Total Energy [Joules]	🔌 avg. Power [Watts]	Duration [Seconds]
Measurement #1	28.5396	3227.83	4.18	771.37
---	---	---	---	---
Total Run	28.54	3227.83	4.18	771.37
---	---	---	---	---
Additional overhead from Eco CI	N/A	9.11	4.00	2.28

🌳 CO2 Data:
City: Chicago, Lat: 41.8835, Lon: -87.6305
IP: 172.183.175.195
CO₂ from energy is: 1.191069270 g
CO₂ from manufacturing (embodied carbon) is: 0.220082512 g
Carbon Intensity for this location: 369 gCO₂eq/kWh
SCI: 1.411152 gCO₂eq / pipeline run emitted

Total cost of whole PR so far:

…MLEscape on insert

…un* endpoint

ArneTR · 2025-05-02T05:21:49Z

@greptileai

greptile-apps

_{29 file(s) reviewed, 10 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

api/scenario_runner.py

api/api_helpers.py

frontend/js/helpers/runs.js

runner.py

templates/website/squid.conf.intercept-cache

templates/website/visit.py

tests/test_runner.py

ArneTR · 2025-05-02T09:05:31Z

@greptileai

greptile-apps

LGTM

_{24 file(s) reviewed, no comment(s)}
_{Edit PR Review Bot Settings | Greptile}

github-actions · 2025-05-02T09:26:29Z

Eco CI Output:

Label	🖥 avg. CPU utilization [%]	🔋 Total Energy [Joules]	🔌 avg. Power [Watts]	Duration [Seconds]
Measurement #1	23.597	3801.55	3.82	995.44
---	---	---	---	---
Total Run	23.60	3801.55	3.82	995.44
---	---	---	---	---
Additional overhead from Eco CI	N/A	11.63	4.07	2.86

🌳 CO2 Data:
City: Chicago, Lat: 41.8835, Lon: -87.6305
IP: 20.88.39.126
CO₂ from energy is: 1.532024650 g
CO₂ from manufacturing (embodied carbon) is: 0.284012777 g
Carbon Intensity for this location: 403 gCO₂eq/kWh
SCI: 1.816037 gCO₂eq / pipeline run emitted

Total cost of whole PR so far:

* main: (27 commits) Bump orjson from 3.10.16 to 3.10.18 (#1168) Bump pydantic from 2.11.3 to 2.11.4 (#1169) Bump psycopg[binary] from 3.2.6 to 3.2.7 (#1171) Resource limits now also from services key; Added tests (#1173) (fix): Cron job queue check logic was reversed Updated ee Moving [system] and [machine] to upper case Shortened model Watchlist must insert usage_scenario_variables Comment for MCP Job ID now a field to filter by Usage Scenario Variables and ScenarioRunner templates (#1172) Disabled providers are now removed also from DB entry not only from effective measurement (fix): Entries for config options that were null where not correctly showing Made temperatur error more helpful Typo Root DIR of GMT was not accurate Renamed Runner to ScenarioRunner and moved to lib/ Software add now returns job_id on insert; Jobs API now allows filter for job_id [skip ci] (#1170) (test-fix): New wording ...

ArneTR added 5 commits May 1, 2025 09:17

Renamed Runner to ScenarioRunner and moved to lib/

a1ce10e

Root DIR of GMT was not accurate

fa0e33d

Added scenario runner AI and Website templates

73167fe

Stray newline

04ea245

Abstracted temp file creation for quick measurement modes

4c9c3c0

ArneTR mentioned this pull request May 1, 2025

Adding a no-backend option for checking a site #792

Open

Merge branch 'main' into scenario-runner-templates [skip ci]

1850cc9

* main: Root DIR of GMT was not accurate Renamed Runner to ScenarioRunner and moved to lib/ Software add now returns job_id on insert; Jobs API now allows filter for job_id [skip ci] (#1170)

greptile-apps bot reviewed May 1, 2025

View reviewed changes

ArneTR added 3 commits May 1, 2025 11:55

Some smaller styling and pythonic fixes [skip ci]

c256831

Memory leak prevention [skip ci]

2846052

Switched to variable replacement mechanism in runner.py directly

ad61ac9

ArneTR changed the title ~~Scenario runner templates~~ Usage Scenario Variables and ScenarioRunner templates May 1, 2025

Made template var matching stricter to __GMT_VAR_ [skip ci]

5f62f79

ArneTR added 4 commits May 2, 2025 06:33

Adding usage_scenario_variables adding to ScenarioRunner; Removing HT…

a0ce1a5

…MLEscape on insert

Removed escaping on insert

126fe08

Adding usage scenario variables to API incl. compare; Moving to /v2/r…

c41e9c3

…un* endpoint

Deprecating old /v1/run* endpoints

4831baf

greptile-apps bot reviewed May 2, 2025

View reviewed changes

ArneTR added 9 commits May 2, 2025 08:22

Updated demo data with new usage_scenario_variables column

bbd0002

Added usage_scenario_variables to test diff

e91980f

Runner could not handle not supplied variables

ae670b8

(fix): Compare mode key must be forced to string everywhere

bb75a8b

Added test case for Usage Scenario Variables compare mode

16e7275

Typos

a856d63

Test fix

eb87719

Test for Usage Scenario Variables frontend

1476354

Jobs endpint v2, more tests and some fixes

df241f8

Added cluster/job run test

26f158c

ArneTR added 2 commits May 2, 2025 11:07

TEst fix [skip ci]

d6526fc

Merge branch 'main' into scenario-runner-templates

770b0f9

greptile-apps bot reviewed May 2, 2025

View reviewed changes

Jobs API display fix [skip ci]

a26e969

Made the model also a variable for template runs; GPU Template added

ffbb0ee

ArneTR merged commit 4f2a73b into main May 2, 2025
1 check failed

ArneTR deleted the scenario-runner-templates branch May 2, 2025 10:26

Usage Scenario Variables and ScenarioRunner templates #1172

Usage Scenario Variables and ScenarioRunner templates #1172

Uh oh!

Conversation

ArneTR commented May 1, 2025 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Uh oh!

ArneTR commented May 1, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArneTR commented May 2, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArneTR commented May 2, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

ArneTR commented May 1, 2025 •

edited by greptile-apps bot

Loading

github-actions bot commented May 1, 2025 •

edited

Loading