🍱 Swap tests to tiny granite #264

joerunde · 2025-06-27T00:19:35Z

Description

This PR swaps our default test decoder model from llama-160m to the micro granite 3.3 model: https://huggingface.co/ibm-ai-platform/micro-g3.3-8b-instruct-1b

Static batching tests run too slowly on cpu with the granite model though, so we've overridden them to continue using the llama model for cpu tests on github. 🤞 the static batching code + tests will be removed shortly in an upcoming release.

Related Issues

Signed-off-by: Joe Runde <[email protected]>

github-actions · 2025-06-27T00:19:42Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Joe Runde <[email protected]>

.github/workflows/test.yml

yannicks1 · 2025-06-27T15:23:07Z

@joerunde do we know how to fix the failing tests? looks like something with the tokenizer is off?

Signed-off-by: Joe Runde <[email protected]>

joerunde · 2025-06-27T15:52:25Z

@yannicks1 yeah I think I have this almost working, just not sure if the cache is carrect based on how slow it's going right now

joerunde · 2025-06-27T16:03:08Z

hmmm, maybe the abort test is just too slow now for static batching? 🤔

Signed-off-by: Joe Runde <[email protected]>

prashantgupta24 · 2025-06-27T16:30:40Z

_local_envs_for_test.sh

I wonder if we can unset the HF_HUB_OFFLINE variable everywhere with this model

Trying it out

Works! Not sure if it is a good idea though, if HF will try to download a new version on each test run?

Signed-off-by: Joe Runde <[email protected]>

joerunde · 2025-06-27T18:34:29Z

bot:test

Signed-off-by: Joe Runde <[email protected]>

⚗️ try to swap to tiny granite

350f526

Signed-off-by: Joe Runde <[email protected]>

joerunde requested review from ckadner, rafvasq, yannicks1, prashantgupta24 and sducouedic as code owners June 27, 2025 00:19

joerunde added 2 commits June 26, 2025 18:24

🐛 mkdir before linking

d43679b

Signed-off-by: Joe Runde <[email protected]>

🎨 fmt

7e4d6f2

Signed-off-by: Joe Runde <[email protected]>

prashantgupta24 reviewed Jun 27, 2025

View reviewed changes

.github/workflows/test.yml Show resolved Hide resolved

⚗️ use hf hub cache in tests

918c70e

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down abort test

2c979b5

Signed-off-by: Joe Runde <[email protected]>

prashantgupta24 reviewed Jun 27, 2025

View reviewed changes

joerunde added 12 commits June 27, 2025 11:19

⚡ try to run tinyllama for sb tests

6288a60

Signed-off-by: Joe Runde <[email protected]>

🐛 fixup env setting

27a7e40

Signed-off-by: Joe Runde <[email protected]>

⚗️ try other env setting methos

7a8b7f3

Signed-off-by: Joe Runde <[email protected]>

🐛 use default model if env var is empty

583e096

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down warmup shapes test

7761e89

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down seed test

02c2f05

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down online test

eb10c50

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down online (gptq) test

a56a9db

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down basic test

bcbeb1f

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down max tokens tests

c391bac

Signed-off-by: Joe Runde <[email protected]>

🐛🔥 rm prompts arg

3214432

Signed-off-by: Joe Runde <[email protected]>

⚡ turn down seed test more

4566984

Signed-off-by: Joe Runde <[email protected]>

joerunde changed the title ~~⚗️ try to swap to tiny granite~~ 🍱 Swap tests to tiny granite Jun 27, 2025

🔥 remove commented test.yml bit

dec8c5a

Signed-off-by: Joe Runde <[email protected]>

prashantgupta24 approved these changes Jun 27, 2025

View reviewed changes

joerunde merged commit 94cee66 into main Jun 27, 2025
17 checks passed

joerunde deleted the swap-to-granite branch June 27, 2025 19:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🍱 Swap tests to tiny granite #264

🍱 Swap tests to tiny granite #264

Uh oh!

joerunde commented Jun 27, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 27, 2025

Uh oh!

Uh oh!

yannicks1 commented Jun 27, 2025

Uh oh!

joerunde commented Jun 27, 2025

Uh oh!

joerunde commented Jun 27, 2025

Uh oh!

prashantgupta24 Jun 27, 2025

Uh oh!

prashantgupta24 Jun 27, 2025

Uh oh!

prashantgupta24 Jun 27, 2025

Uh oh!

joerunde Jun 27, 2025

Uh oh!

joerunde commented Jun 27, 2025

Uh oh!

Uh oh!

Uh oh!

🍱 Swap tests to tiny granite #264

🍱 Swap tests to tiny granite #264

Uh oh!

Conversation

joerunde commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Uh oh!

github-actions bot commented Jun 27, 2025

Uh oh!

Uh oh!

yannicks1 commented Jun 27, 2025

Uh oh!

joerunde commented Jun 27, 2025

Uh oh!

joerunde commented Jun 27, 2025

Uh oh!

prashantgupta24 Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

joerunde Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

joerunde commented Jun 27, 2025

Uh oh!

Uh oh!

Uh oh!

joerunde commented Jun 27, 2025 •

edited

Loading