Skip to content

Pull requests: mlfoundations/evalchemy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add debug mode to 5 benchmarks
#135 opened Jul 1, 2025 by dkimds Loading…
Register OpenLM models at startup
#133 opened Jun 27, 2025 by reinhardh Loading…
LCB official scorer instead of skythoughts
#130 opened Jun 13, 2025 by slimfrkha Loading…
update lm-eval-harness version
#124 opened May 12, 2025 by jannalulu Loading…
Jean/update curator
#115 opened Apr 15, 2025 by jmercat Loading…
Adding Arena Hard Auto
#65 opened Jan 28, 2025 by asuvarna31 Loading…
Multi-node
#29 opened Nov 22, 2024 by jmercat Loading…
ProTip! Add no:assignee to see everything that’s not assigned.