Duplicate the SamplingMetadata class #278

maxdebayser · 2025-07-03T17:13:42Z

We were previously reusing the GPU SamplingMetadata class but there have been incompatible changes upstream (PR vllm-project/vllm#16728)

Since it's not clear for now whether we want, should or can reuse the LogitsProcessor implementation as is, I'm making temporarily making a copy of the old versions of the files that we need for the spyre backend.

This won't affect any features for now since the vllm change was an internal refactoring without UX impact.

Follow-up issue to give a definitive solution: https://github.ibm.com/ai-foundation/aiu-app-sw-tracker/issues/804

We were previously reusing the GPU SamplingMetadata class but there have been incompatible changes upstream (PR vllm-project/vllm#16728) Since it's not clear for now whether we want, should or can reuse the LogitsProcessor implementation as is, I'm making a copy of the old version of the class for the spyre backend. This won't affect any features for now since the vllm change was an internal refactoring without UX impact. Signed-off-by: Max de Bayser <[email protected]>

github-actions · 2025-07-03T17:13:49Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Max de Bayser <[email protected]>

yannicks1

If this is the preferred way to tackle these breaking changes upstream, then it LGTM.

yannicks1 · 2025-07-04T14:38:26Z

vllm_spyre/v1/sample/metadata.py

+# This is a copy of the vLLM vllm file prior to PR
+# https://github.com/vllm-project/vllm/pull/16728


duplicate lines 3-4

joerunde · 2025-07-07T16:04:36Z

We were previously reusing the GPU SamplingMetadata class

I don't think that class is gpu-specific though, right? It's the sampling metadata used by all in-tree platforms supported by vllm

maxdebayser · 2025-07-07T16:30:29Z

I don't think that class is gpu-specific though, right? It's the sampling metadata used by all in-tree platforms supported by vllm

No, for example, in the TPU it's a different class: vllm/v1/sample/tpu/metadata.py

prashantgupta24

Any chance we can rebase this to get merged to vllm-main-updates instead of main?

maxdebayser · 2025-07-07T19:14:30Z

Any chance we can rebase this to get merged to vllm-main-updates instead of main?

Sorry, I'm not sure I understand. These commits here are already in vllm-main-updates, rght?

prashantgupta24 · 2025-07-07T19:17:08Z

Any chance we can rebase this to get merged to vllm-main-updates instead of main?

Sorry, I'm not sure I understand. These commits here are already in vllm-main-updates, rght?

You're right, I said that just in case we wanted to merge any new changes

Signed-off-by: Max de Bayser <[email protected]>

This reverts commit 785a5d5.

maxdebayser requested review from yannicks1, tdoublep, nikolaospapandreou and sducouedic as code owners July 3, 2025 17:13

maxdebayser added 5 commits July 3, 2025 14:24

fix linting

425c3d2

Signed-off-by: Max de Bayser <[email protected]>

Actually more classes need to be duplicated

05ea423

Signed-off-by: Max de Bayser <[email protected]>

import the right sampler

6e1f712

Signed-off-by: Max de Bayser <[email protected]>

fix tests

c614eb1

Signed-off-by: Max de Bayser <[email protected]>

fix tests

a3b37c4

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser requested review from rafvasq and prashantgupta24 as code owners July 3, 2025 17:53

maxdebayser mentioned this pull request Jul 3, 2025

✨ Use shared CachedRequestData as vllm:main #273

Closed

prashantgupta24 mentioned this pull request Jul 4, 2025

vllm main updates #283

Merged

yannicks1 reviewed Jul 4, 2025

View reviewed changes

prashantgupta24 reviewed Jul 7, 2025

View reviewed changes

Remove duplicated comment

4e19fd0

Signed-off-by: Max de Bayser <[email protected]>

prashantgupta24 approved these changes Jul 8, 2025

View reviewed changes

joerunde enabled auto-merge (squash) July 8, 2025 17:12

github-actions bot added the ready label Jul 8, 2025

joerunde merged commit 785a5d5 into main Jul 8, 2025
17 of 19 checks passed

joerunde deleted the fix_sampling_metadata branch July 8, 2025 17:17

maxdebayser added a commit that referenced this pull request Jul 8, 2025

Revert "Duplicate the SamplingMetadata class (#278)"

eda6f58

This reverts commit 785a5d5.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Duplicate the SamplingMetadata class #278

Duplicate the SamplingMetadata class #278

Uh oh!

maxdebayser commented Jul 3, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 3, 2025

Uh oh!

yannicks1 left a comment

Uh oh!

yannicks1 Jul 4, 2025

Uh oh!

joerunde commented Jul 7, 2025

Uh oh!

maxdebayser commented Jul 7, 2025

Uh oh!

prashantgupta24 left a comment

Uh oh!

maxdebayser commented Jul 7, 2025

Uh oh!

prashantgupta24 commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

		# This is a copy of the vLLM vllm file prior to PR
		# https://github.com/vllm-project/vllm/pull/16728

Duplicate the SamplingMetadata class #278

Duplicate the SamplingMetadata class #278

Uh oh!

Conversation

maxdebayser commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 3, 2025

Uh oh!

yannicks1 left a comment

Choose a reason for hiding this comment

Uh oh!

yannicks1 Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

joerunde commented Jul 7, 2025

Uh oh!

maxdebayser commented Jul 7, 2025

Uh oh!

prashantgupta24 left a comment

Choose a reason for hiding this comment

Uh oh!

maxdebayser commented Jul 7, 2025

Uh oh!

prashantgupta24 commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

maxdebayser commented Jul 3, 2025 •

edited

Loading