Add tests for HFTransformers recipe with static cache #2179

KaelanDt · 2025-06-03T11:15:49Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Partially fixes #2067 (backward to come in a subsequent PR) by adding a generate test with static cache on Qwen2.5 and Llama3.
the two tests take 1:40 total to run on an L4.

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

thunder/tests/test_recipes.py

t-vi · 2025-06-03T17:24:39Z

So on our CI it's more like 4 minutes

178.00s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[Qwen/Qwen2.5-3B]
64.62s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[unsloth/Llama-3.2-1B]

is there any way to further reduce this?

thunder/tests/test_recipes.py

KaelanDt · 2025-06-03T17:58:27Z

So on our CI it's more like 4 minutes

178.00s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[Qwen/Qwen2.5-3B]
64.62s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[unsloth/Llama-3.2-1B]

is there any way to further reduce this?

in fact there is a smaller version of Qwen2.5, so I added that.

Better now @t-vi

45.52s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[unsloth/Llama-3.2-1B]
42.06s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[Qwen/Qwen2.5-1.5B]

KaelanDt · 2025-06-05T10:58:56Z

18.20s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[Qwen2ForCausalLM-Qwen2Config]
17.94s call     thunder/tests/test_recipes.py::test_recipe_model_with_cache[LlamaForCausalLM-LlamaConfig]

t-vi

Supergood, thank you @KaelanDt @Borda

KaelanDt requested review from mruberry, lantiga and t-vi as code owners June 3, 2025 11:15

KaelanDt force-pushed the kaelan/inplace-tests branch from 858d468 to d255e79 Compare June 3, 2025 11:17

Borda reviewed Jun 3, 2025

View reviewed changes

thunder/tests/test_recipes.py Outdated Show resolved Hide resolved

KaelanDt requested a review from Borda June 3, 2025 14:05

Borda approved these changes Jun 3, 2025

View reviewed changes

t-vi reviewed Jun 3, 2025

View reviewed changes

thunder/tests/test_recipes.py Outdated Show resolved Hide resolved

KaelanDt commented Jun 3, 2025

View reviewed changes

thunder/tests/test_recipes.py Outdated Show resolved Hide resolved

KaelanDt commented Jun 3, 2025

View reviewed changes

thunder/tests/test_recipes.py Outdated Show resolved Hide resolved

t-vi added the lightning-l1 label Jun 4, 2025

github-actions bot added documentation Improvements or additions to documentation ci labels Jun 5, 2025

Add static KV cache generate test for Qwen2.5 and LLaMA3

d6f8751

KaelanDt force-pushed the kaelan/inplace-tests branch from 161b347 to d6f8751 Compare June 5, 2025 10:37

github-actions bot removed documentation Improvements or additions to documentation ci labels Jun 5, 2025

t-vi approved these changes Jun 5, 2025

View reviewed changes

t-vi enabled auto-merge (squash) June 5, 2025 11:22

t-vi merged commit 45feb17 into main Jun 5, 2025
49 checks passed

t-vi deleted the kaelan/inplace-tests branch June 5, 2025 11:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tests for HFTransformers recipe with static cache #2179

Add tests for HFTransformers recipe with static cache #2179

Uh oh!

KaelanDt commented Jun 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

t-vi commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KaelanDt commented Jun 3, 2025 •

edited

Loading

Uh oh!

KaelanDt commented Jun 5, 2025

Uh oh!

t-vi left a comment

Uh oh!

Uh oh!

Uh oh!

Add tests for HFTransformers recipe with static cache #2179

Add tests for HFTransformers recipe with static cache #2179

Uh oh!

Conversation

KaelanDt commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

PR review

Did you have fun?

Uh oh!

Uh oh!

t-vi commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KaelanDt commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KaelanDt commented Jun 5, 2025

Uh oh!

t-vi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

KaelanDt commented Jun 3, 2025 •

edited

Loading

KaelanDt commented Jun 3, 2025 •

edited

Loading