feat: Allow prefetching dependencies using a Threadpool #175

Jim-Encord · 2025-06-30T10:18:04Z

Endless amount of effort could be put into this. Ideally there would be a separate fetch and execute queue. Ideally we could mark our dependencies as async to ensure that when we are fetching them, we can switch between different threads.

This is just one approach

…a bottle-neck there

github-actions · 2025-06-30T10:21:04Z

Encord Agents test report

93 tests 93 ✅ 1m 58s ⏱️
1 suites 0 💤
1 files 0 ❌

Results for commit 25f99c4.

frederik-encord · 2025-06-30T10:28:40Z

encord_agents/tasks/runner/sequential_runner.py

+                        with ThreadPoolExecutor() as executor:
+                            dependency_list = list(
+                                executor.map(
+                                    lambda context: solve_dependencies(
+                                        context=context, dependant=runner_agent.dependant, stack=stack
+                                    ),
+                                    batch,
+                                )
+                            )


I don't see why we'd want to wait for all tasks to be fetched before starting the compute? Can't we just call the agent when iterating the output of the executor.map call rather than wrapping it in list?

One could do. I was just replicating the customer's original behaviour where they do all fetching first ahead of task execution.
Notably the customer wanted explicitly sequential inference and if we had the map include the agent execution, we lose this benefit.

frederik-encord · 2025-06-30T10:31:35Z

encord_agents/tasks/runner/sequential_runner.py

                help="Max number of tasks to try to process per stage on a given run. If `None`, will attempt all",
            ),
        ] = None,
+        pre_fetch_factor: Annotated[


Not sure name and functionality match here. Factor is multiplied, this seems to be an absolute number - at least based on doc string.

Could be pre-fetch batch size. My view on factor was: If at x, we perform a (grouped) dependency fetch N/x times rather than N times.

frederik-encord · 2025-06-30T10:32:28Z

encord_agents/tasks/runner/sequential_runner.py

-                            except Exception:
-                                print(f"[attempt {attempt+1}/{num_retries+1}] Agent failed with error: ")
-                                traceback.print_exc()
+                        with ThreadPoolExecutor() as executor:


Should we not specify how many threads (perhaps even give as argument)?

We could do. By default, ThreadPoolExecutor will pick it based on n_cpus. I didn't necessarily want to expand the interface too much. But I defo see your point

feat: Allow prefetching dependencies using a Threadpool to alleviate …

25f99c4

…a bottle-neck there

frederik-encord requested changes Jun 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Allow prefetching dependencies using a Threadpool #175

feat: Allow prefetching dependencies using a Threadpool #175

Uh oh!

Jim-Encord commented Jun 30, 2025

Uh oh!

github-actions bot commented Jun 30, 2025

Uh oh!

frederik-encord Jun 30, 2025

Uh oh!

Jim-Encord Jun 30, 2025

Uh oh!

frederik-encord Jun 30, 2025

Uh oh!

Jim-Encord Jun 30, 2025

Uh oh!

frederik-encord Jun 30, 2025

Uh oh!

Jim-Encord Jun 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Allow prefetching dependencies using a Threadpool #175

Are you sure you want to change the base?

feat: Allow prefetching dependencies using a Threadpool #175

Uh oh!

Conversation

Jim-Encord commented Jun 30, 2025

Uh oh!

github-actions bot commented Jun 30, 2025

Encord Agents test report

Uh oh!

frederik-encord Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Jim-Encord Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

frederik-encord Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Jim-Encord Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

frederik-encord Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Jim-Encord Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants