[nvfuser executor] Allow sm89 for fp8 types by crcrpar · Pull Request #1576 · Lightning-AI/lightning-thunder

crcrpar · 2024-12-20T13:20:17Z

What does this PR do?

With NVIDIA/Fuser#3624, devices >= sm89 get allowed to use nvfuser executor for fp8.

crcrpar · 2024-12-20T13:23:02Z

for me to be verbose, this waits on the mentioned nvfuser pr

crcrpar · 2024-12-23T11:41:17Z

@jjsjann123 what would be the nvfuser version that ships the SM89 support?

jjsjann123 · 2024-12-23T17:59:07Z

@jjsjann123 what would be the nvfuser version that ships the SM89 support?

You caught me! I forgot to bump nvfuser version in the PR. I'll go back and do that. So if you are making a version guard, make it 0.2.24 (nvfuser is currently at 0.2.23, and the PR is already in).

jjsjann123 · 2024-12-23T21:40:03Z

thunder/executors/nvfuserex_impl.py

-    cuda_major, _ = torch.cuda.get_device_capability()
-    return cuda_major > 8
+    cuda_major, cuda_minor = torch.cuda.get_device_capability()
+    return (cuda_major, cuda_minor) >= (8, 9)


I realize that this is an ugly bit...

I think the full logic here should copy this: https://github.com/NVIDIA/Fuser/blob/6fa084312d7eec5c69d59f3eb3cbdd9fa72a1600/csrc/device_lower/analysis/device_version.cpp#L24-L39

But that's a lot... We should have a generic API on nvfuser side that does is_dtype_support_on_device(dtype, device_index)

A Python function exposed by nvFuser for this logic would be great!

(Even if we don't do that in this PR, an issue for it would be great)

Signed-off-by: Masaki Kozuki <[email protected]>

for more information, see https://pre-commit.ci

crcrpar mentioned this pull request Dec 20, 2024

Enable fp8 on sm89 NVIDIA/Fuser#3624

Merged

crcrpar force-pushed the crpa/allow_sm89_for_fp8 branch from 2ab964e to 080d391 Compare December 23, 2024 11:40

jjsjann123 reviewed Dec 23, 2024

View reviewed changes

crcrpar force-pushed the crpa/allow_sm89_for_fp8 branch from 080d391 to 28af668 Compare February 5, 2025 11:25

crcrpar and others added 6 commits March 28, 2025 19:48

allow sm89 for fp8 types

04cd73a

Signed-off-by: Masaki Kozuki <[email protected]>

Update nvfuserex_impl.py

d577998

Update nvfuserex_impl.py

a528a09

[pre-commit.ci] auto fixes from pre-commit.com hooks

bc171a4

for more information, see https://pre-commit.ci

Update nvfuserex_impl.py

732b24c

[pre-commit.ci] auto fixes from pre-commit.com hooks

c99e048

for more information, see https://pre-commit.ci

crcrpar force-pushed the crpa/allow_sm89_for_fp8 branch from 26b1582 to c99e048 Compare March 28, 2025 10:48

[pre-commit.ci] auto fixes from pre-commit.com hooks

b543813

for more information, see https://pre-commit.ci

crcrpar closed this Aug 20, 2025

crcrpar deleted the crpa/allow_sm89_for_fp8 branch August 20, 2025 13:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[nvfuser executor] Allow sm89 for fp8 types#1576

[nvfuser executor] Allow sm89 for fp8 types#1576
crcrpar wants to merge 7 commits intomainfrom
crpa/allow_sm89_for_fp8

crcrpar commented Dec 20, 2024

Uh oh!

crcrpar commented Dec 20, 2024

Uh oh!

crcrpar commented Dec 23, 2024

Uh oh!

jjsjann123 commented Dec 23, 2024

Uh oh!

jjsjann123 Dec 23, 2024

Uh oh!

mruberry Dec 23, 2024

Uh oh!

mruberry Dec 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

crcrpar commented Dec 20, 2024

What does this PR do?

Uh oh!

crcrpar commented Dec 20, 2024

Uh oh!

crcrpar commented Dec 23, 2024

Uh oh!

jjsjann123 commented Dec 23, 2024

Uh oh!

jjsjann123 Dec 23, 2024

Choose a reason for hiding this comment

Uh oh!

mruberry Dec 23, 2024

Choose a reason for hiding this comment

Uh oh!

mruberry Dec 23, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants