Skip to content

Conversation

@dxqb
Copy link

@dxqb dxqb commented Jan 8, 2026

there appear to be missing specializations for certain operator shapes. This leads to an incomplete type:

using ElementA = cutlass::int4b_t;
using ElementB = cutlass::int4b_t;
using ElementC = int32_t;
using ElementAccumulator = int32_t;
using LayoutA = cutlass::layout::RowMajor;
using LayoutB = cutlass::layout::ColumnMajor;
using LayoutC = cutlass::layout::RowMajor;
using MMAOp  = cutlass::arch::OpClassTensorOp;
using SmArch = cutlass::arch::Sm80;
using OperatorShape    = cutlass::gemm::GemmShape<16, 8, 32>;
...
using Gemm = cutlass::gemm::device::Gemm<...

But there is m16n8k32 instruction for s4.

The attached code seems to work, but take this with a grain of salt because I'm very new to CUTLASS. It's also incomplete - if this is correct, it also needs to be added for u4 and its combinations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant