add missing Mma-specialization for m16n8k32.s32.s4.s4.s32 #2936

dxqb · 2026-01-08T21:41:05Z

there appear to be missing specializations for certain operator shapes. This leads to an incomplete type:

using ElementA = cutlass::int4b_t;
using ElementB = cutlass::int4b_t;
using ElementC = int32_t;
using ElementAccumulator = int32_t;
using LayoutA = cutlass::layout::RowMajor;
using LayoutB = cutlass::layout::ColumnMajor;
using LayoutC = cutlass::layout::RowMajor;
using MMAOp  = cutlass::arch::OpClassTensorOp;
using SmArch = cutlass::arch::Sm80;
using OperatorShape    = cutlass::gemm::GemmShape<16, 8, 32>;
...
using Gemm = cutlass::gemm::device::Gemm<...

But there is m16n8k32 instruction for s4.

The attached code seems to work, but take this with a grain of salt because I'm very new to CUTLASS. It's also incomplete - if this is correct, it also needs to be added for u4 and its combinations.

int4k32

8aa5a45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add missing Mma-specialization for m16n8k32.s32.s4.s4.s32 #2936

add missing Mma-specialization for m16n8k32.s32.s4.s4.s32 #2936

dxqb commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

add missing Mma-specialization for m16n8k32.s32.s4.s4.s32 #2936

Are you sure you want to change the base?

add missing Mma-specialization for m16n8k32.s32.s4.s4.s32 #2936

Conversation

dxqb commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant