Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,066 workflow runs
4,066 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix mmlu_continuation subgroup names to fit Readme and other variants
Unit Tests #5009: Pull request #3137 opened by lamalunderscore
July 11, 2025 14:30 Action required lamalunderscore:main
July 11, 2025 14:30 Action required
Add tasklist
Unit Tests #5008: Pull request #3133 synchronize by baberabb
July 11, 2025 14:26 6m 25s tasklist
July 11, 2025 14:26 6m 25s
Add tasklist
Tasks Modified #5036: Pull request #3133 synchronize by baberabb
July 11, 2025 14:26 2m 10s tasklist
July 11, 2025 14:26 2m 10s
Add tasklist
Unit Tests #5004: Pull request #3133 opened by baberabb
July 11, 2025 11:01 4m 43s tasklist
July 11, 2025 11:01 4m 43s
Add tasklist
Tasks Modified #5032: Pull request #3133 opened by baberabb
July 11, 2025 11:01 1m 37s tasklist
July 11, 2025 11:01 1m 37s
when using vllm with lora, it will have some mistakes, now i fix it.
Unit Tests #5003: Pull request #3132 opened by Jacky-MYQ
July 11, 2025 10:26 Action required Jacky-MYQ:main
July 11, 2025 10:26 Action required
when using vllm with lora, it will have some mistakes, now i fix it.
Tasks Modified #5031: Pull request #3132 opened by Jacky-MYQ
July 11, 2025 10:26 Action required Jacky-MYQ:main
July 11, 2025 10:26 Action required
FixBug: Fix the wrong configs for gpqa_cot_n_shot
Tasks Modified #5030: Pull request #3131 synchronize by Summer-Summer
July 11, 2025 10:14 Action required Summer-Summer:fix-gpqa
July 11, 2025 10:14 Action required
FixBug: Fix the wrong configs for gpqa_cot_n_shot
Unit Tests #5002: Pull request #3131 synchronize by Summer-Summer
July 11, 2025 10:14 Action required Summer-Summer:fix-gpqa
July 11, 2025 10:14 Action required
FixBug: Fix the wrong configs for gpqa_cot_n_shot
Tasks Modified #5029: Pull request #3131 opened by Summer-Summer
July 11, 2025 02:21 1m 57s Summer-Summer:fix-gpqa
July 11, 2025 02:21 1m 57s
add kwargs passing into filters
Tasks Modified #5028: Pull request #3036 synchronize by artemorloff
July 10, 2025 19:55 1m 36s artemorloff:inference_filters
July 10, 2025 19:55 1m 36s
add kwargs passing into filters
Unit Tests #5000: Pull request #3036 synchronize by artemorloff
July 10, 2025 19:55 4m 51s artemorloff:inference_filters
July 10, 2025 19:55 4m 51s
fix: remove warning (#3128)
Tasks Modified #5027: Commit fcddf19 pushed by baberabb
July 10, 2025 14:53 14s main
July 10, 2025 14:53 14s
fix: remove warning (#3128)
Unit Tests #4999: Commit fcddf19 pushed by baberabb
July 10, 2025 14:53 5m 3s main
July 10, 2025 14:53 5m 3s
fix: remove warning
Unit Tests #4998: Pull request #3128 opened by baberabb
July 10, 2025 14:46 4m 54s ll
ll
July 10, 2025 14:46 4m 54s
fix: remove warning
Tasks Modified #5026: Pull request #3128 opened by baberabb
July 10, 2025 14:46 11s ll
ll
July 10, 2025 14:46 11s
warning for "chat" pretrained; disable buggy evalita configs (#3127)
Unit Tests #4997: Commit f3a0b55 pushed by baberabb
July 10, 2025 14:44 5m 12s main
July 10, 2025 14:44 5m 12s
warning for "chat" pretrained; disable buggy evalita configs (#3127)
Tasks Modified #5025: Commit f3a0b55 pushed by baberabb
July 10, 2025 14:44 1m 51s main
July 10, 2025 14:44 1m 51s
warning for "chat" pretrained; disable buggy evalita configs
Tasks Modified #5024: Pull request #3127 synchronize by baberabb
July 10, 2025 14:30 1m 57s war
war
July 10, 2025 14:30 1m 57s