Skip to content

Commit f535c33

Browse files
authored
3.9.1 doc/version change (#2273)
1 parent e3cb8a7 commit f535c33

File tree

6 files changed

+10
-6
lines changed

6 files changed

+10
-6
lines changed

CHANGELOG.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,9 @@
11
# NVIDIA CUTLASS Changelog
22

3+
## [3.9.1](https://github.com/NVIDIA/cutlass/releases/tag/v3.9.1) (2025-04-30)
4+
5+
* Fixed Group Gemm hang issue in CUTLASS 3.x
6+
* Improved Hopper [Blockwise](./examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling.cu) and [Groupwise](./examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/67_hopper_fp8_warp_specialized_gemm_with_groupwise_scaling.cu) GEMM performance.
37

48
## [3.9.0](https://github.com/NVIDIA/cutlass/releases/tag/v3.9.0) (2025-04-24)
59

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
![ALT](./media/images/gemm-hierarchy-with-epilogue-no-labels.png "Complete CUDA GEMM decomposition")
22

3-
# CUTLASS 3.9.0
3+
# CUTLASS 3.9.1
44

5-
_CUTLASS 3.9.0 - April 2025_
5+
_CUTLASS 3.9.1 - April 2025_
66

77
CUTLASS is a collection of CUDA C++ template abstractions for implementing
88
high-performance matrix-matrix multiplication (GEMM) and related computations at all levels

include/cutlass/version.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@
3636

3737
#define CUTLASS_MAJOR 3
3838
#define CUTLASS_MINOR 9
39-
#define CUTLASS_PATCH 0
39+
#define CUTLASS_PATCH 1
4040

4141
#ifdef CUTLASS_VERSIONS_GENERATED
4242
#include "cutlass/version_extended.h"

python/cutlass/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -133,7 +133,7 @@ def get_option_registry():
133133
this._option_registry = OptionRegistry(device_cc())
134134
return this._option_registry
135135

136-
this.__version__ = '3.9.0'
136+
this.__version__ = '3.9.1'
137137

138138
from cutlass.backend import create_memory_pool
139139
from cutlass.emit.pytorch import pytorch

python/setup_library.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@
3636
def perform_setup():
3737
setup(
3838
name='cutlass_library',
39-
version='3.9.0',
39+
version='3.9.1',
4040
description='CUTLASS library generation scripts',
4141
packages=['cutlass_library']
4242
)

python/setup_pycute.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@
3636
def perform_setup():
3737
setup(
3838
name='pycute',
39-
version='3.9.0',
39+
version='3.9.1',
4040
description='Python implementation of CuTe',
4141
packages=['pycute'],
4242
)

0 commit comments

Comments
 (0)