-
Notifications
You must be signed in to change notification settings - Fork 55
Pull requests: mlfoundations/evalchemy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: add type check and list conversion for LiveCodeBenchs
#137
opened Jul 1, 2025 by
dkimds
Loading…
Optimize Evaluation Workflow for Better Batching and Model Reuse For benchmarks with n_repeat > 1
#125
opened May 27, 2025 by
ihebchaa
Loading…
Fix: truncate model identifier in case model name is too long
#120
opened May 2, 2025 by
younesbelkada
Loading…
Support for Big Bench Extra Hard (General-purpose reasoning eval)
#92
opened Mar 8, 2025 by
Hritikbansal
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.