Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Compressors] Remove sparse compression
#2452 opened Mar 7, 2026 by kylesayrs Loading…
[AWQ] Update MoE mappings to include router in balance layers
#2451 opened Mar 6, 2026 by brian-dellabetta Loading…
2 of 3 tasks
[Agents] Add claude skills for style and test ready When a PR is ready for review
#2445 opened Mar 4, 2026 by kylesayrs Loading…
[Docs] Update author from NeuralMagic to vLLM
#2444 opened Mar 4, 2026 by kylesayrs Loading…
AWQ smooth layer quantization (v2) [not for land] documentation Improvements or additions to documentation quality-failed
#2431 opened Mar 3, 2026 by HDCharles Draft
[docs] mkdocs material is EOL, move to a zensical docs build documentation Improvements or additions to documentation ready When a PR is ready for review
#2415 opened Feb 26, 2026 by aireilly Loading…
refactor(awq): restructure AWQModifier to be similar to SmoothQuantCl… documentation Improvements or additions to documentation ready When a PR is ready for review
#2402 opened Feb 24, 2026 by vishnuprasanth-j Loading…
[DDP][GPTQ] Fixes and Testing documentation Improvements or additions to documentation needs-rebase quality-failed
#2400 opened Feb 24, 2026 by HDCharles Draft
Feature/intermediates cache prefetch
#2392 opened Feb 22, 2026 by GOavi101 Loading…
[Distributed] Extend QuantizationModifier to support distributed activation calibration documentation Improvements or additions to documentation
#2391 opened Feb 22, 2026 by Etelis Loading…
3 tasks done
perf: make MSE observer compatible with torch.compile (dual-path implementation) documentation Improvements or additions to documentation ready When a PR is ready for review
#2384 opened Feb 18, 2026 by Bias92 Loading…
feat: add Qwen3.5 MoE calibration module documentation Improvements or additions to documentation nvfp4 For any PR / issue related to NVFP4 support quality-failed qwen For any PR / issue related to Qwen support ready When a PR is ready for review
#2383 opened Feb 18, 2026 by Sehyo Loading…
[Qwen3.5 MoE Support] documentation Improvements or additions to documentation quality-failed
#2377 opened Feb 17, 2026 by dsikka Draft
Add model_free_ptq example for glm 4.6 block fp8 documentation Improvements or additions to documentation
#2343 opened Feb 10, 2026 by mgoin Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up documentation Improvements or additions to documentation ready When a PR is ready for review
#2335 opened Feb 6, 2026 by LudovicoYIN Loading…
Add GSM8K evaluation script and AWQ+FP8 results documentation Improvements or additions to documentation ready When a PR is ready for review
#2330 opened Feb 4, 2026 by rtj1 Loading…
[AWQ] Add option to consider smooth layer quantization in scale search awq For any issue / PR related to AWQ support documentation Improvements or additions to documentation ready When a PR is ready for review
#2323 opened Jan 31, 2026 by Ramshankar07 Loading…
Benchmark torch.compile optimization for GPTQ needs-rebase ready When a PR is ready for review
#2320 opened Jan 31, 2026 by colldata79 Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.