Skip to content

[Issue]: possible wrong vector width for fp8/int8 and fp16/bf16 #940

@michael604work

Description

@michael604work

Problem Description

I think /aiter/csrc/kernels/quant_kernels.cu
vector widths must be updated to set the vector size to
16 when fp8/int8,
8 when fp16/bf16.

Operating System

CentOS Stream 9

CPU

AMD EPYC 9655

GPU

MI350X

ROCm Version

rocm7

ROCm Component

No response

Steps to Reproduce

No response

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

No response

Additional Information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions