[MMLU redux] Do not use samples which do not have error_type="ok" (…
#5532
| Job | Run time |
|---|---|
| 25s | |
| 5m 5s | |
| 4m 27s | |
| 4m 58s | |
| 14m 55s |
error_type="ok" (…
#5532
| Job | Run time |
|---|---|
| 25s | |
| 5m 5s | |
| 4m 27s | |
| 4m 58s | |
| 14m 55s |