✨ Use shared CachedRequestData as vllm:main #273

prashantgupta24 · 2025-07-01T17:34:43Z

Description

This is more complicated than I thought originally :)

Alright, all tests are passing locally. There seems to be another breaking change in vllm:main that will have to be addressed to make the main tests pass. The default tests fail because this code is not backward compatible yet 😅

Related Issues

fix #271

Signed-off-by: Prashant Gupta <[email protected]>

github-actions · 2025-07-01T17:34:52Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Prashant Gupta <[email protected]>

vllm_spyre/v1/worker/spyre_worker.py

Signed-off-by: Prashant Gupta <[email protected]>

maxdebayser · 2025-07-03T18:00:18Z

@prashantgupta24 , I think the other breaking change that you mentioned is the sampling metadata one, right? I've opened a hacky PR to temporarily fix this: #278

prashantgupta24 · 2025-07-03T18:05:54Z

@prashantgupta24 , I think the other breaking change that you mentioned is the sampling metadata one, right? I've opened a hacky PR to temporarily fix this: #278

Yep, thanks!

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 · 2025-07-07T19:13:40Z

closing in favor of #283

# Description This branch has a fix for: - Caching the token_ids (now the new tokens are cached in `execute_model` instead of `update_states`. This is because of vllm-project/vllm#20291. ) - Changes from the `CachedRequestData` (#273) ## Related Issues Fix for #271 --------- Signed-off-by: Prashant Gupta <[email protected]> Signed-off-by: Max de Bayser <[email protected]> Co-authored-by: Max de Bayser <[email protected]>

🐛 req_ids is now a list

f7f6bb6

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 added 2 commits July 1, 2025 10:40

🐛 req_ids is now a list

8f7acaf

Signed-off-by: Prashant Gupta <[email protected]>

♻️ using cached requests

06b8d7d

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 changed the title ~~🐛 req_ids is now a list in vllm:main~~ 🐛 Use shared CachedRequestData as vllm:main Jul 1, 2025

prashantgupta24 added 2 commits July 2, 2025 11:55

🐛 first pass for sb

a064d3b

Signed-off-by: Prashant Gupta <[email protected]>

🐛 first pass for cb

989f6d2

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 changed the title ~~🐛 Use shared CachedRequestData as vllm:main~~ ✨ Use shared CachedRequestData as vllm:main Jul 2, 2025

prashantgupta24 added 2 commits July 2, 2025 14:45

Merge remote-tracking branch 'upstream/main' into fix-upstream

1dcc582

Signed-off-by: Prashant Gupta <[email protected]>

🐛 fix merge bug

bd7e008

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 commented Jul 2, 2025

View reviewed changes

vllm_spyre/v1/worker/spyre_worker.py Outdated Show resolved Hide resolved

prashantgupta24 added 3 commits July 3, 2025 09:30

🎨 renaming vars

fe8e64c

Signed-off-by: Prashant Gupta <[email protected]>

🔥 remove commented code

df9214b

Signed-off-by: Prashant Gupta <[email protected]>

🔥 remove extra commas

f065785

Signed-off-by: Prashant Gupta <[email protected]>

🚧 wip to see if tests pass

d965338

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 mentioned this pull request Jul 4, 2025

vllm main updates #283

Merged

prashantgupta24 closed this Jul 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ Use shared CachedRequestData as vllm:main #273

✨ Use shared CachedRequestData as vllm:main #273

Uh oh!

prashantgupta24 commented Jul 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

maxdebayser commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 7, 2025

Uh oh!

Uh oh!

✨ Use shared CachedRequestData as vllm:main #273

✨ Use shared CachedRequestData as vllm:main #273

Uh oh!

Conversation

prashantgupta24 commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

maxdebayser commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 7, 2025

Uh oh!

Uh oh!

prashantgupta24 commented Jul 1, 2025 •

edited

Loading