Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,685 workflow runs
5,685 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add llama3 tasks
Unit Tests #4081: Pull request #2556 synchronize by baberabb
January 22, 2025 00:16 In progress llama
January 22, 2025 00:16 In progress
add llama3 tasks
Tasks Modified #4109: Pull request #2556 synchronize by baberabb
January 22, 2025 00:16 2m 27s llama
January 22, 2025 00:16 2m 27s
add llama3 tasks
Unit Tests #4080: Pull request #2556 synchronize by baberabb
January 21, 2025 23:44 6m 45s llama
January 21, 2025 23:44 6m 45s
add llama3 tasks
Tasks Modified #4108: Pull request #2556 synchronize by baberabb
January 21, 2025 23:44 2m 5s llama
January 21, 2025 23:44 2m 5s
add llama3 tasks
Tasks Modified #4107: Pull request #2556 synchronize by baberabb
January 21, 2025 23:38 3m 1s llama
January 21, 2025 23:38 3m 1s
add llama3 tasks
Unit Tests #4079: Pull request #2556 synchronize by baberabb
January 21, 2025 23:38 6m 59s llama
January 21, 2025 23:38 6m 59s
add llama3 tasks
Unit Tests #4078: Pull request #2556 synchronize by baberabb
January 21, 2025 22:18 6m 57s llama
January 21, 2025 22:18 6m 57s
add llama3 tasks
Tasks Modified #4106: Pull request #2556 synchronize by baberabb
January 21, 2025 22:18 2m 23s llama
January 21, 2025 22:18 2m 23s
add llama3 tasks
Unit Tests #4077: Pull request #2556 synchronize by baberabb
January 21, 2025 22:08 7m 9s llama
January 21, 2025 22:08 7m 9s
add llama3 tasks
Tasks Modified #4105: Pull request #2556 synchronize by baberabb
January 21, 2025 22:08 2m 4s llama
January 21, 2025 22:08 2m 4s
add llama3 tasks
Unit Tests #4076: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 7m 44s llama
January 21, 2025 22:06 7m 44s
add llama3 tasks
Tasks Modified #4104: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 1m 49s llama
January 21, 2025 22:06 1m 49s
add llama3 tasks
Tasks Modified #4103: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 1m 50s llama
January 21, 2025 22:06 1m 50s
add llama3 tasks
Unit Tests #4075: Pull request #2556 synchronize by baberabb
January 21, 2025 22:06 7m 38s llama
January 21, 2025 22:06 7m 38s
add llama3 tasks
Unit Tests #4074: Pull request #2556 synchronize by baberabb
January 21, 2025 22:00 7m 31s llama
January 21, 2025 22:00 7m 31s
add llama3 tasks
Tasks Modified #4102: Pull request #2556 synchronize by baberabb
January 21, 2025 22:00 1m 51s llama
January 21, 2025 22:00 1m 51s
Easily evaluate models steered by SAEs
Tasks Modified #4101: Pull request #2641 synchronize by AMindToThink
January 21, 2025 20:57 Action required AMindToThink:sae_steered
January 21, 2025 20:57 Action required
Easily evaluate models steered by SAEs
Unit Tests #4073: Pull request #2641 synchronize by AMindToThink
January 21, 2025 20:57 Action required AMindToThink:sae_steered
January 21, 2025 20:57 Action required
add llama3 tasks
Unit Tests #4072: Pull request #2556 synchronize by baberabb
January 21, 2025 17:27 7m 18s llama
January 21, 2025 17:27 7m 18s
add llama3 tasks
Tasks Modified #4100: Pull request #2556 synchronize by baberabb
January 21, 2025 17:27 2m 26s llama
January 21, 2025 17:27 2m 26s
add llama3 tasks
Tasks Modified #4099: Pull request #2556 synchronize by baberabb
January 21, 2025 17:22 2m 3s llama
January 21, 2025 17:22 2m 3s
add llama3 tasks
Unit Tests #4071: Pull request #2556 synchronize by baberabb
January 21, 2025 17:22 7m 9s llama
January 21, 2025 17:22 7m 9s
Fix max_tokens handling in vllm_vlms.py (#2637)
Unit Tests #4070: Commit 370e2f9 pushed by baberabb
January 21, 2025 16:55 6m 59s main
January 21, 2025 16:55 6m 59s
Fix max_tokens handling in vllm_vlms.py (#2637)
Tasks Modified #4098: Commit 370e2f9 pushed by baberabb
January 21, 2025 16:55 12s main
January 21, 2025 16:55 12s
aggregate by group (total and categories) (#2643)
Tasks Modified #4097: Commit b2c090c pushed by baberabb
January 21, 2025 16:48 7m 16s main
January 21, 2025 16:48 7m 16s