Fix LiteLLM split iteration in greedy_until to avoid duplicate API requests by dyurchenko98 · Pull Request #1177 · huggingface/lighteval

dyurchenko98 · 2026-03-05T14:56:08Z

Summary

This PR fixes a regression in LiteLLMClient.greedy_until where contexts were built from the full dataset inside the split loop, instead of the current split.

Root Cause

In the split loop in litellm_model.py, this line used full-dataset iteration:

contexts = [self.prompt_manager.prepare_prompt_api(doc) for doc in dataset]

Because of that, for S splits and N docs, LiteLLM sent ~N*S requests instead of N.

Changes

Updated split loop context construction to use split-local docs:

contexts = [self.prompt_manager.prepare_prompt_api(doc) for doc in split]

Added regression test:
tests/unit/models/endpoints/test_litellm_split_iteration.py
Verifies API call inputs are split-local and total sent requests equals total docs (no redundant requests).

Why this matters

Prevents redundant API calls and extra cost.
Ensures split-level params (generation_size, use_logits, num_samples, stop_sequences) are applied to the correct group.
Prevents silently dropped extra responses after get_original_order.

fix: fix litellm splits iteration issue

96b5f4b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LiteLLM split iteration in greedy_until to avoid duplicate API requests#1177

Fix LiteLLM split iteration in greedy_until to avoid duplicate API requests#1177
dyurchenko98 wants to merge 1 commit intohuggingface:mainfrom
dyurchenko98:fix/litellm_split_iteration

dyurchenko98 commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dyurchenko98 commented Mar 5, 2026

Summary

Root Cause

Changes

Why this matters

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant