Problem Description
I think /aiter/csrc/kernels/quant_kernels.cu
vector widths must be updated to set the vector size to
16 when fp8/int8,
8 when fp16/bf16.
Operating System
CentOS Stream 9
CPU
AMD EPYC 9655
GPU
MI350X
ROCm Version
rocm7
ROCm Component
No response
Steps to Reproduce
No response
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response