Skip to content

Commit 9ef2090

Browse files
committed
x
Signed-off-by: SumanthRH <[email protected]>
1 parent 5433333 commit 9ef2090

File tree

3 files changed

+18
-9
lines changed

3 files changed

+18
-9
lines changed

skythought/evals/tasks/aime/aime25.yaml renamed to skythought/evals/tasks/aime/aime25_1.yaml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
handler: aime
22
dataset_path: opencompass/AIME2025
3-
dataset_split: train
3+
dataset_subset: AIME2025-I
4+
dataset_split: test
45
question_key: question
56
answer_key: answer
67
templating_parameters:
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
handler: aime
2+
dataset_path: opencompass/AIME2025
3+
dataset_subset: AIME2025-II
4+
dataset_split: test
5+
question_key: question
6+
answer_key: answer
7+
templating_parameters:
8+
template: "{prompt}\nReturn your final response within \\boxed{{}}"
9+

skythought/evals/tasks/apps/apps_handler.py

Lines changed: 7 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -36,14 +36,6 @@ def generate_prompt(self, problem):
3636
def check_correctness(self, problem, generation):
3737
TIMEOUT = 10
3838

39-
def _temp_run(problem, generation, debug, result):
40-
try:
41-
result.append(
42-
apps_run_test(problem=problem, test=generation, debug=debug)
43-
)
44-
except Exception:
45-
pass
46-
4739
manager = Manager()
4840
result = manager.list()
4941
p = multiprocessing.Process(
@@ -100,3 +92,10 @@ def load_and_filter_dataset(
10092
train_data = train_data.to_pandas()
10193

10294
return train_data.iloc[start:end] if end > 0 else train_data.iloc[start:]
95+
96+
97+
def _temp_run(problem, generation, debug, result):
98+
try:
99+
result.append(apps_run_test(problem=problem, test=generation, debug=debug))
100+
except Exception:
101+
pass

0 commit comments

Comments
 (0)