-
Notifications
You must be signed in to change notification settings - Fork 422
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AWQ] Update MoE mappings to include router in balance layers
#2451
opened Mar 6, 2026 by
brian-dellabetta
Loading…
2 of 3 tasks
[Agents] Add claude skills for When a PR is ready for review
style and test
ready
#2445
opened Mar 4, 2026 by
kylesayrs
Loading…
AWQ smooth layer quantization (v2) [not for land]
documentation
Improvements or additions to documentation
quality-failed
[docs] mkdocs material is EOL, move to a zensical docs build
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2415
opened Feb 26, 2026 by
aireilly
Loading…
refactor(awq): restructure AWQModifier to be similar to SmoothQuantCl…
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2402
opened Feb 24, 2026 by
vishnuprasanth-j
Loading…
[DDP][GPTQ] Fixes and Testing
documentation
Improvements or additions to documentation
needs-rebase
quality-failed
Feature/calibrate weights dfs fused modules
needs-rebase
#2394
opened Feb 23, 2026 by
GOavi101
Loading…
[Distributed] Extend QuantizationModifier to support distributed activation calibration
documentation
Improvements or additions to documentation
#2391
opened Feb 22, 2026 by
Etelis
Loading…
3 tasks done
perf: make MSE observer compatible with torch.compile (dual-path implementation)
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2384
opened Feb 18, 2026 by
Bias92
Loading…
feat: add Qwen3.5 MoE calibration module
documentation
Improvements or additions to documentation
nvfp4
For any PR / issue related to NVFP4 support
quality-failed
qwen
For any PR / issue related to Qwen support
ready
When a PR is ready for review
#2383
opened Feb 18, 2026 by
Sehyo
Loading…
Add model_free_ptq example for glm 4.6 block fp8
documentation
Improvements or additions to documentation
#2343
opened Feb 10, 2026 by
mgoin
Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2335
opened Feb 6, 2026 by
LudovicoYIN
Loading…
Add GSM8K evaluation script and AWQ+FP8 results
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2330
opened Feb 4, 2026 by
rtj1
Loading…
[AWQ] Add option to consider smooth layer quantization in scale search
awq
For any issue / PR related to AWQ support
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2323
opened Jan 31, 2026 by
Ramshankar07
Loading…
Benchmark torch.compile optimization for GPTQ
needs-rebase
ready
When a PR is ready for review
#2320
opened Jan 31, 2026 by
colldata79
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.