You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @wanzhenchn , thanks for taking an interest in our AWQ feature! We have merged most of the AWQ logic but we have a few TODOs related to the issues you are hitting (here and here). We wanted to add these in a separate PR so that the initial PR is largely a port of the code in AutoAWQ, and so we have an example for how additional mappings can be added for other architectures.
We will wrap this up in the next couple weeks and make a release and more public announcement that AWQ is ready for consumption.
Hi @wanzhenchn , thanks for taking an interest in our AWQ feature! We have merged most of the AWQ logic but we have a few TODOs related to the issues you are hitting (here and here). We wanted to add these in a separate PR so that the initial PR is largely a port of the code in AutoAWQ, and so we have an example for how additional mappings can be added for other architectures.
We will wrap this up in the next couple weeks and make a release and more public announcement that AWQ is ready for consumption.
Thanks for your feedback, looking forward to AWQ supporting more models.
How to run AWQ-W4Afp8 quantization on MoE models?
I have run awq-w4afp8 quantization on Qwen1.5-MoE-A2.7B, however, the ValueError occurred below
The text was updated successfully, but these errors were encountered: