Skip to content

Looking for hints/suggestions for debugging TMA-related crash #2054

Answered by Artem-B
Artem-B asked this question in Q&A
Discussion options

You must be logged in to vote

I've found the culprit of my TMA troubles. It's the && !CUTLASS_CLANG_CUDA here:

#if ((__CUDACC_VER_MAJOR__ >= 12) || ((__CUDACC_VER_MAJOR__ == 11) && (__CUDACC_VER_MINOR__ >= 7))) && !CUTLASS_CLANG_CUDA
# define CUTLASS_GRID_CONSTANT_SUPPORTED
#endif
// __grid_constant__ can be enabled only on SM70+

Apparently cutlass implicitly assumes that it can pass the pointer to kernels params to TMA. Without __grid_constant__ params gets copied into local memory and the pointer to it makes TMA unhappy.

Replies: 3 comments 10 replies

Comment options

You must be logged in to vote
8 replies
@Artem-B
Comment options

@d-k-b
Comment options

@thakkarV
Comment options

thakkarV Feb 7, 2025
Collaborator

@Artem-B
Comment options

@Artem-B
Comment options

Comment options

You must be logged in to vote
1 reply
@ccecka
Comment options

Answer selected by Artem-B
Comment options

You must be logged in to vote
1 reply
@thakkarV
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants