Integrate upstream logits processors #290

maxdebayser · 2025-07-08T19:47:54Z

At first it wasn't obvious if it would be easy to integrate the changes of PR vllm-project/vllm#16728 so initially I added PR that copies the sampler files previous to that PR in vllm-spyre. But actually it's easier than I thought because the sampler code is not compiled to the AIU, only the model forward is.

Currently in the MinP processor there is a tensor for the cpu and for the device. Since only the model forward runs on the AIU, both tensors end up on the CPU, which means that there is an unnecessary copy from one to the other, but the result is still correct.

There is a future upstream PR that will generalize the Logits processor to other sampling parameters:

vllm-project/vllm#19912

github-actions · 2025-07-08T19:48:03Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

The changes introduced by PR vllm-project/vllm#16728 to the sampler architecture were incompatible with our spyre model runner. Initially, as a stopgap solution. I copied the old sampling classes into our vllm_spyre tree just so that we can keep working on the latest changes from main. Now this commit reverts that and makes the same logits processor logic work for the spyre input batch and model runner classes. The difference with the gpu model runner is that in spyre we don't condense the batch but have a boolean mask that is used to calculate "dense" request indices. These indices must be used for the BatchUpdateBuilder because they are the right ones to slice the `logits` tensor that is passed to the Sampler. Signed-off-by: Max de Bayser <[email protected]>

joerunde · 2025-07-09T19:03:26Z

bot:test

prashantgupta24 · 2025-07-09T19:19:52Z

bot:test
MARKERS="cb and spyre"

maxdebayser · 2025-07-09T19:37:26Z

@joerunde, I've opened an issue for the follow-up PR that will add tests: #295

prashantgupta24 · 2025-07-09T20:13:20Z

bot:test
MARKERS="cb and spyre"

prashantgupta24 · 2025-07-09T20:14:45Z

bot:test

prashantgupta24 · 2025-07-09T20:55:43Z

bot:test
MARKERS="cb and spyre"

prashantgupta24 · 2025-07-09T21:01:53Z

bot:test

joerunde · 2025-07-09T21:24:51Z

bot:test

prashantgupta24 · 2025-07-09T21:44:33Z

bot:test
MARKERS="cb and spyre"

maxdebayser · 2025-07-14T19:44:25Z

bot:test

maxdebayser · 2025-07-14T20:06:05Z

For reference: the test run 129 successfully executed all tests

maxdebayser requested review from rafvasq, prashantgupta24, sducouedic, yannicks1, tdoublep and nikolaospapandreou as code owners July 8, 2025 19:47

maxdebayser force-pushed the logits_processors branch from ca0c89b to 052b28d Compare July 9, 2025 16:09

maxdebayser mentioned this pull request Jul 9, 2025

Add tests for logits processor correctness #295

Open

Merge branch 'main' into logits_processors

9a314b5

Merge branch 'main' into logits_processors

e11b8b0

Merge branch 'main' into logits_processors

5b39a57

prashantgupta24 approved these changes Jul 14, 2025

View reviewed changes

joerunde merged commit 5975e98 into main Jul 14, 2025
14 of 18 checks passed

joerunde deleted the logits_processors branch July 14, 2025 20:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrate upstream logits processors #290

Integrate upstream logits processors #290

Uh oh!

maxdebayser commented Jul 8, 2025 •

edited by rafvasq

Loading

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

joerunde commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

maxdebayser commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

joerunde commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

maxdebayser commented Jul 14, 2025

Uh oh!

Uh oh!

maxdebayser commented Jul 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Integrate upstream logits processors #290

Integrate upstream logits processors #290

Uh oh!

Conversation

maxdebayser commented Jul 8, 2025 • edited by rafvasq Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

joerunde commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

maxdebayser commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

joerunde commented Jul 9, 2025

Uh oh!

prashantgupta24 commented Jul 9, 2025

Uh oh!

maxdebayser commented Jul 14, 2025

Uh oh!

Uh oh!

maxdebayser commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

maxdebayser commented Jul 8, 2025 •

edited by rafvasq

Loading

maxdebayser commented Jul 14, 2025 •

edited

Loading