CUTLASS 4.3.2
CuTe DSL
-
New features
- New env var
CUTE_DSL_CACHE_DIRto specify the path for dumping caches
- New env var
-
Bug fixing and improvements
- Fixed an issue of CUDA JitExecutor when unloading kernels
- Fixed an issue of allocating max smem when there's statically allocated smem