Skip to content

CUTLASS 3.9.2

Compare
Choose a tag to compare
@hwu36 hwu36 released this 04 May 04:25
· 21 commits to main since this release
ad7b2f5
  • Fixed Blockwise and Groupwise GEMM hang issue when problem size K is 128.
  • Optimal code generation with CUDA toolkit versions 12.9.