feat(examples/test_run): use runtime sm arch by tpoisonooo · Pull Request #2916 · NVIDIA/cutlass

tpoisonooo · 2025-12-31T09:05:37Z

Issue Summary

Example 13_two_tensor_op_fusion uses hardcoded SM architecture, causing confusion and extra work for new comers.

Background

I'm using H800 GPU (SM90) and initially misunderstood the SM compatibility (perhaps due to TensorRT 1.0 and cudnn7).

To run the 13_two_tensor_op_fusion example, I have spent hours reading template source code, write new version and fix compile error for SM90 (like these code) .

Proposal

Modify testRun to use actual SM architecture of the current GPU, rather than using a hardcoded value.

Benefits

Improves onboarding experience by eliminating manual code modification for new users
Avoids confusion about SM compatibility requirements

examples/45_dual_gemm/test_run.h

tpoisonooo · 2026-01-06T08:45:13Z

Now the fix only applies to examples/13_two_tensor_op_fusion.
I tested it with

cd build/examples/13_two_tensor_op_fusion

for f in 13_fused_*; do [ -f "$f" ] && ./"$f" >> run.out; done

# manually check the output file

@hwu36

hwu36 · 2026-01-07T04:59:53Z

@jwang323

github-actions · 2026-02-06T05:27:21Z

This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.

tpoisonooo · 2026-02-06T11:42:52Z

@jwu323 please review.

feat(examples/test_run): use runtime sm arch

661be79

hwu36 reviewed Jan 6, 2026

View reviewed changes

examples/45_dual_gemm/test_run.h Show resolved Hide resolved

tpoisonooo added 2 commits January 6, 2026 16:26

fix(exampls/13_two_tensor_op_fusion): cover more case

30d32f8

revert(examples/45_dual_gemm): revert empty online on tail

411d837

github-actions bot added the inactive-30d label Feb 6, 2026

github-actions bot removed the inactive-30d label Feb 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(examples/test_run): use runtime sm arch#2916

feat(examples/test_run): use runtime sm arch#2916
tpoisonooo wants to merge 3 commits intoNVIDIA:mainfrom
tpoisonooo:use-runtime-sm-arch

tpoisonooo commented Dec 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

tpoisonooo commented Jan 6, 2026

Uh oh!

hwu36 commented Jan 7, 2026

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

tpoisonooo commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tpoisonooo commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue Summary

Background

Proposal

Benefits

Uh oh!

Uh oh!

tpoisonooo commented Jan 6, 2026

Uh oh!

hwu36 commented Jan 7, 2026

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

tpoisonooo commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tpoisonooo commented Dec 31, 2025 •

edited

Loading