Add LruFusionCache when direct bindings is used by rdspring1 · Pull Request #2806 · Lightning-AI/lightning-thunder

rdspring1 · 2025-12-15T21:21:22Z

This PR enables an LRU Fusion Cache decorator for direct bindings to address issue #2700.

Legacy Fusion Cache
iter 0: loss 0.1641, iter time: 102852.81ms, t: 4096
iter 1: loss 0.1196, iter time: 20309.14ms, t: 4096
iter 2: loss 0.0771, iter time: 19994.30ms, t: 4096

ToT Direct w/o LRU Cache
iter 0: loss 0.1641, iter time: 204575.74ms, t: 4096
iter 1: loss 0.1196, iter time: 22914.21ms, t: 4096
iter 2: loss 0.0771, iter time: 20006.35ms, t: 4096

Direct with LRU Cache
iter 0: loss 0.1641, iter time: 115281.74ms, t: 4096
iter 1: loss 0.1196, iter time: 20228.57ms, t: 4096
iter 2: loss 0.0771, iter time: 19998.50ms, t: 4096

LRU Cache is 12% slower because equality check was updated to cover block
scale operations, which create non-trivial allocation domains.

for more information, see https://pre-commit.ci

kshitij12345

LGTM, thanks @rdspring1

kshitij12345 · 2025-12-18T21:23:29Z

thunder/executors/nvfuserex_impl.py

 #   by nvfuserex.py when nvFuser is available.

-DIRECT_BINDINGS_SUPPORTED_VERSION = LooseVersion("0.2.34")
+DIRECT_BINDINGS_SUPPORTED_VERSION = LooseVersion("0.2.35")


Is this the minimum version which ships with LruFusionCache?

kshitij12345 · 2025-12-18T21:27:23Z

thunder/executors/nvfuserex_impl.py

+        return func
+    from nvfuser_direct import LruFusionCache
+
+    return LruFusionCache(max_fusions=16384)(func)


Default value for max_fusions is 16384, we can remove this here.

qq: will there be an option of setting cache size? Or it would not be that useful

Usually we pick a reasonable number to avoid out-of-memory issues. We never thought to change it at runtime.

rdspring1 and others added 2 commits December 12, 2025 10:14

Add LruFusionCache when direct bindings is used

954fbf1

[pre-commit.ci] auto fixes from pre-commit.com hooks

fa29d3b

for more information, see https://pre-commit.ci

rdspring1 marked this pull request as ready for review December 17, 2025 16:58

rdspring1 requested review from KaelanDt, lantiga and mruberry as code owners December 17, 2025 16:58

rdspring1 requested review from kshitij12345 and riccardofelluga December 17, 2025 16:58

kshitij12345 approved these changes Dec 18, 2025

View reviewed changes

crcrpar approved these changes Dec 19, 2025

View reviewed changes

kshitij12345 and others added 2 commits December 19, 2025 10:38

Merge branch 'main' into direct_lru

989273a

Merge branch 'main' into direct_lru

f3aedb6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add LruFusionCache when direct bindings is used#2806

Add LruFusionCache when direct bindings is used#2806
rdspring1 wants to merge 4 commits intoLightning-AI:mainfrom
rdspring1:direct_lru

rdspring1 commented Dec 15, 2025

Uh oh!

kshitij12345 left a comment

Uh oh!

kshitij12345 Dec 18, 2025

Uh oh!

rdspring1 Dec 19, 2025

Uh oh!

kshitij12345 Dec 18, 2025

Uh oh!

crcrpar Dec 19, 2025

Uh oh!

rdspring1 Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

rdspring1 commented Dec 15, 2025

Uh oh!

kshitij12345 left a comment

Choose a reason for hiding this comment

Uh oh!

kshitij12345 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

rdspring1 Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

kshitij12345 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

crcrpar Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

rdspring1 Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants