Skip to content

CUTLASS 4.3.2

Choose a tag to compare

@hwu36 hwu36 released this 05 Dec 18:51
· 7 commits to release/4.3 since this release
5c149f5

CuTe DSL

  • New features

    • New env var CUTE_DSL_CACHE_DIR to specify the path for dumping caches
  • Bug fixing and improvements

    • Fixed an issue of CUDA JitExecutor when unloading kernels
    • Fixed an issue of allocating max smem when there's statically allocated smem