Support FP8 Quantization and Inference Run on Intel Gaudi (HPU) using INC (Intel Neural Compressor)#12010
Merged
mgoin merged 35 commits intovllm-project:mainfrom Jul 16, 2025
Merged
Commits
Commits on Jun 24, 2025
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- authored andcommitted
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed