-
-
Notifications
You must be signed in to change notification settings - Fork 8.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[bugfix] fix syntax warning caused by backslash
documentation
Improvements or additions to documentation
speculative-decoding
v1
#21251
opened Jul 20, 2025 by
1195343015
Loading…
[Misc] fixed nvfp4_moe test failures due to invalid kwargs
#21246
opened Jul 20, 2025 by
chenyang78
Loading…
4 tasks
[Core] Optimize update checks in LogitsProcessor
v1
#21245
opened Jul 20, 2025 by
Jialin
Loading…
3 of 4 tasks
[CI] Cleanup modelscope version constraint in Dockerfile
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#21243
opened Jul 20, 2025 by
yankay
Loading…
4 tasks
[FEAT] [ROCm] [AITER]: Add AITER HIP block quant kernel
rocm
Related to AMD ROCm
#21242
opened Jul 20, 2025 by
tjtanaa
Loading…
3 of 4 tasks
[bugfix] Remove the attribute 'version' from docker compose
documentation
Improvements or additions to documentation
#21241
opened Jul 20, 2025 by
1195343015
Loading…
[DP] Internal Load Balancing Per Node [
one-pod-per-node
]
frontend
v1
#21238
opened Jul 20, 2025 by
robertgshaw2-redhat
•
Draft
3 of 4 tasks
[CI/Build] Add bc-linter to vLLM CI
ci/build
#21234
opened Jul 19, 2025 by
zhewenl
Loading…
3 of 4 tasks
Integrate TensorSchema with shape validation for Phi3VImagePixelInputs
#21232
opened Jul 19, 2025 by
bbeckca
Loading…
[WIP][Kernel]FusedMoE LoRA
ci/build
deepseek
Related to DeepSeek models
needs-rebase
#21229
opened Jul 19, 2025 by
CNTRYROA
Loading…
4 tasks
[Model][1/N] Support multiple poolers at model level
documentation
Improvements or additions to documentation
frontend
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
v1
#21227
opened Jul 19, 2025 by
DarkLight1337
Loading…
3 of 4 tasks
[Core] Introduce popleft_n and append_n in FreeKVCacheBlockQueue to further optimize block_pool
v1
#21222
opened Jul 19, 2025 by
Jialin
Loading…
3 of 4 tasks
[Bugfix] Fixed the missing metrics in output
frontend
v1
#21216
opened Jul 19, 2025 by
hsliuustc
Loading…
3 of 4 tasks
[wip] [Feature] [V1] intermediate logging
documentation
Improvements or additions to documentation
needs-rebase
v1
Add chat doc in quick start
documentation
Improvements or additions to documentation
#21213
opened Jul 19, 2025 by
TankNee
Loading…
1 of 4 tasks
[core] Set CUDA_VISIBLE_DEVICES before spawning the subprocesses
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#21211
opened Jul 19, 2025 by
yinghai
Loading…
4 tasks
[benchmark] Port benchmark request sent optimization to benchmark_serving
performance
Performance-related issues
#21209
opened Jul 18, 2025 by
Jialin
Loading…
3 of 4 tasks
[Refactor] Fix Compile Warning #1444-D: type "cub::CUB_200802_SM_1000::Sum" was declared deprecated ("use cuda::std::plus instead")
#21208
opened Jul 18, 2025 by
yewentao256
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.