NVIDIA / cutlass Public

Notifications You must be signed in to change notification settings
Fork 1k
Star 5.9k

Code
Issues 198
Pull requests 34
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: NVIDIA/cutlass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

198 Open 996 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[BUG] Where is 3.6.0 release? ? - Needs Triage bug

Something isn't working

#2012 opened Dec 25, 2024 by ankutalev

[BUG] [QST] Regression - why Sm90RowBroadcast in 3.5.1 stops support smem usage? ? - Needs Triage bug

Something isn't working

#2010 opened Dec 23, 2024 by ankutalev

[BUG] Removal of OpMultiplyAdd template substitutions from mma_sm80.h in 3.5.1 ? - Needs Triage bug

Something isn't working

#2009 opened Dec 23, 2024 by ankutalev

[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? ? - Needs Triage question

Question

#2008 opened Dec 23, 2024 by ziyuhuang123

[QST] How to Let __launch_bounds__ and setmaxnreg Work with Each Other? ? - Needs Triage question

Question

#2007 opened Dec 23, 2024 by Maximilianxu

[BUG] wmma should be enabled w/ clang. ? - Needs Triage bug

Something isn't working

#2006 opened Dec 20, 2024 by Artem-B

[BUG] Unaligned access in test/unit/gemm/threadblock/batched_gemv.cu ? - Needs Triage bug

Something isn't working

#2003 opened Dec 19, 2024 by Artem-B

[QST]Behavior of TMA Store and Wait Mechanism in CUTLASS ? - Needs Triage question

Question

#2002 opened Dec 19, 2024 by ziyuhuang123

[QST] When to use MainloopSm90TmaGmmaWarpSpecializedFP8? ? - Needs Triage question

Question

#2001 opened Dec 19, 2024 by ginowu

[Proposal] layout deduction ambiguity of Nested Layout Access Problem ? - Needs Triage bug

Something isn't working

#2000 opened Dec 18, 2024 by yiakwy-xpu-ml-framework-team

[QST]Is the Key Difference Between mbarrier and barrier Their Handling of Producer-Consumer Count? ? - Needs Triage question

Question

#1999 opened Dec 18, 2024 by ziyuhuang123

[QST]How to Handle Synchronization with Different Thread Counts for Producer and Consumer in CUTLASS? ? - Needs Triage question

Question

#1998 opened Dec 18, 2024 by ziyuhuang123

[BUG] calling cast_smem_ptr_to_uint(device fn) from make_gmma_desc(host device fn) is not allowed ? - Needs Triage bug

Something isn't working

#1997 opened Dec 18, 2024 by lygztq

[QST] Gemm got 'incomplete type is not allowed' when use Sm90 ? - Needs Triage question

Question

#1996 opened Dec 18, 2024 by TopIdiot

[QST] custom kernel integrated in Pytorch ? - Needs Triage question

Question

#1991 opened Dec 16, 2024 by IzanCatalan

[BUG] conv2d int8 doesn't work with python ? - Needs Triage bug

Something isn't working

#1990 opened Dec 16, 2024 by IzanCatalan

[QST] Global variable inside conv2d kernel ? - Needs Triage question

Question

#1987 opened Dec 15, 2024 by IzanCatalan

[QST] fp8 gemm ? - Needs Triage question

Question

#1986 opened Dec 15, 2024 by yangjianfengo1

Unconstrained definition of swap breaks types that pull in other namespaces with swap

#1984 opened Dec 12, 2024 by miscco

[BUG] Funcionality TensorOp 80+ s8 * s8 + s32 => {s32, s8} not working ? - Needs Triage bug

Something isn't working

#1981 opened Dec 11, 2024 by IzanCatalan

[QST] Cutlass kernel causes no grad in torch backward pass ? - Needs Triage question

Question

#1980 opened Dec 10, 2024 by MinghaoYan

[QST] Integer Data Types are available for Conv2d fprop? ? - Needs Triage question

Question

#1979 opened Dec 10, 2024 by IzanCatalan

[QST] What Conv2d definition is used with Python and Pytorch ? - Needs Triage question

Question

#1978 opened Dec 10, 2024 by IzanCatalan

[QST] Where is exactly the definiton code of fprop convolution? ? - Needs Triage question

Question

#1976 opened Dec 9, 2024 by IzanCatalan

[FEA]Is it support BlackWell Architecture feature request

New feature or request

#1975 opened Dec 9, 2024 by Emiyagzm

Previous 1 2 3 4 5 6 7 8 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly