Fix get_problem_ids to respect instance_ids parameter in SWE environments #179

Copilot · 2025-07-10T13:17:50Z

The get_problem_ids method in SWESmithEnv and SWEBenchEnv was not respecting the instance_ids parameter when returning all available problems. This caused issues when users tried to filter tasks using the CLI parameter -p base.env_kwargs.instance_ids=["task1", "task2"].

Problem

When running scripts/run.py with instance filtering:

python scripts/run.py config.yaml -p base.env_kwargs.instance_ids='["task1", "task2"]'

The environment would correctly load only the specified tasks in the dataset, but get_problem_ids("all") would ignore the instance_ids parameter and return all available problems instead of just the filtered ones. This caused the script to attempt running all problems, leading to errors when accessing tasks that weren't loaded.

Solution

Modified the get_problem_ids method in both environments to filter results by instance_ids when provided:

def get_problem_ids(self, split_or_problem_id):
    if split_or_problem_id == "all":
        all_tasks = sorted(k for k in self.dataset.keys() if k not in self.excluded_ids)
        # If instance_ids is provided, filter to only include those tasks
        if self.instance_ids is not None:
            all_tasks = [task for task in all_tasks if task in self.instance_ids]
        return all_tasks
    # ... rest of method unchanged

Changes

SWESmithEnv: Added instance_ids filtering while preserving exclusion logic
SWEBenchEnv: Added instance_ids filtering for consistent behavior
Tests: Added comprehensive test cases for both environments covering edge cases
Backward compatibility: When instance_ids=None, behavior remains unchanged

Testing

The fix handles various scenarios correctly:

Normal filtering: instance_ids=["task1", "task2"] returns only those tasks
Non-existent tasks: instance_ids=["task1", "missing"] returns ["task1"]
Excluded tasks: Respects exclusion rules in SWESmithEnv
Backward compatibility: instance_ids=None returns all available tasks

Fixes #178.

💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.

MarcCote · 2025-07-10T18:13:18Z

@copilot we want to remove instance_ids completley and only rely on "problems" to know which docker images to download.

Copilot · 2025-07-10T18:23:40Z

@copilot we want to remove instance_ids completley and only rely on "problems" to know which docker images to download.

I've removed the instance_ids parameter completely from both SWEBenchEnv and SWESmithEnv and replaced it with a problems parameter. This change ensures that docker images are downloaded only for the specific problems that will be run, rather than being filtered at the environment level through instance_ids. The change is in commit 3daffaa.

Co-authored-by: sordonia <[email protected]>

Co-authored-by: MarcCote <[email protected]>

MarcCote

I pushed changes that implement the fix properly. @copilot

matheper

LGTM

Copilot AI assigned Copilot and sordonia Jul 10, 2025

Copilot started work on behalf of sordonia July 10, 2025 13:17 View session

Copilot AI changed the title ~~[WIP] merge "base.problems" with "env_kwargs.instance_ids"~~ Fix get_problem_ids to respect instance_ids parameter in SWE environments Jul 10, 2025

Copilot AI requested a review from sordonia July 10, 2025 13:28

Copilot finished work on behalf of sordonia July 10, 2025 13:28

Copilot started work on behalf of MarcCote July 10, 2025 18:13 View session

Copilot AI requested a review from MarcCote July 10, 2025 18:28

Copilot finished work on behalf of MarcCote July 10, 2025 18:28

Copilot AI and others added 5 commits July 21, 2025 06:33

Initial plan

e0b951f

Fix get_problem_ids to respect instance_ids parameter

ee39d32

Co-authored-by: sordonia <[email protected]>

Remove instance_ids parameter and replace with problems parameter

ccc7776

Co-authored-by: MarcCote <[email protected]>

Refactor how load_dataset works and problems are being filtered

cded3a8

Fix tests and missing ones

4f0128f

MarcCote force-pushed the copilot/fix-178 branch from 3daffaa to 4f0128f Compare July 21, 2025 18:05

MarcCote reviewed Jul 21, 2025

View reviewed changes

MarcCote marked this pull request as ready for review July 21, 2025 18:08

sordonia approved these changes Jul 21, 2025

View reviewed changes

Merge branch 'main' into copilot/fix-178

8bef1a4

matheper approved these changes Jul 21, 2025

View reviewed changes

MarcCote merged commit 27efbe8 into main Jul 22, 2025
6 checks passed

MarcCote deleted the copilot/fix-178 branch July 22, 2025 12:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix get_problem_ids to respect instance_ids parameter in SWE environments #179

Fix get_problem_ids to respect instance_ids parameter in SWE environments #179

Uh oh!

Copilot AI commented Jul 10, 2025 •

edited

Loading

Uh oh!

MarcCote commented Jul 10, 2025

Uh oh!

Copilot AI commented Jul 10, 2025

Uh oh!

MarcCote left a comment

Uh oh!

matheper left a comment

Uh oh!

Uh oh!

Uh oh!

Fix get_problem_ids to respect instance_ids parameter in SWE environments #179

Fix get_problem_ids to respect instance_ids parameter in SWE environments #179

Uh oh!

Conversation

Copilot AI commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes

Testing

Uh oh!

MarcCote commented Jul 10, 2025

Uh oh!

Copilot AI commented Jul 10, 2025

Uh oh!

MarcCote left a comment

Choose a reason for hiding this comment

Uh oh!

matheper left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Jul 10, 2025 •

edited

Loading