### Problem Description I think /aiter/csrc/kernels/quant_kernels.cu vector widths must be updated to set the vector size to 16 when fp8/int8, 8 when fp16/bf16. ### Operating System CentOS Stream 9 ### CPU AMD EPYC 9655 ### GPU MI350X ### ROCm Version rocm7 ### ROCm Component _No response_ ### Steps to Reproduce _No response_ ### (Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support _No response_ ### Additional Information _No response_