Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Frontend] Avoid startup error log for models without chat template ready ONLY add when PR is ready to merge/full CI is needed
#37040 opened Mar 14, 2026 by DarkLight1337 Loading…
5 tasks
fix: sync delta_token_ids with delta_text during stop-sequence buffering ci/build cpu Related to CPU backends deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend gpt-oss Related to GPT-OSS models kv-connector llama Related to Llama models multi-modality Related to multi-modality (#4194) needs-rebase nvidia performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding structured-output tool-calling tpu Related to Google TPUs v1
#37039 opened Mar 14, 2026 by gambletan Loading…
3 tasks
[Bugfix] Fix FlatLogprobs empty slice crash and delta-mode logprobs stale data bug Something isn't working v1
#37038 opened Mar 14, 2026 by mango766 Loading…
3 tasks done
[Hardware] Replace memory related torch.cuda APIs nvidia performance Performance-related issues v1
#37031 opened Mar 14, 2026 by jikunshang Loading…
5 tasks
[Hardware][XPU][ROCm] Align memory usage with cuda on xpu/rocm nvidia rocm Related to AMD ROCm
#37029 opened Mar 14, 2026 by jikunshang Loading…
5 tasks
[CI] Add reasoning parser tests to CI ci/build
#37025 opened Mar 14, 2026 by sfeng33 Loading…
[bug] fix hang dpep pause bug Something isn't working v1
#37024 opened Mar 14, 2026 by hao-aaron Draft
5 tasks
Enable in-process engine core for AsyncLLM. v1
#37021 opened Mar 13, 2026 by wang2yn84 Loading…
5 tasks
[CI][Bugfix] Fix incorrect status handling with set -e in CI shell scripts bug Something isn't working ci/build
#37020 opened Mar 13, 2026 by gkapetanakis Loading…
3 tasks done
[BUG] Collective causing deadlock for DPEP MoE bug Something isn't working frontend v1
#37018 opened Mar 13, 2026 by hao-aaron Loading…
5 tasks
[CI] Split V1 Others into 3 separate jobs ci/build ready ONLY add when PR is ready to merge/full CI is needed
#37016 opened Mar 13, 2026 by khluu Loading…
3 tasks
[CI] Shard Multi-Modal Models (Standard) into 4 parallel jobs ci/build ready ONLY add when PR is ready to merge/full CI is needed
#37014 opened Mar 13, 2026 by khluu Loading…
2 tasks
[Spec Decode] Update extract_hidden_states to use deferred kv_connector clear kv-connector ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding v1
#37013 opened Mar 13, 2026 by fynnsu Loading…
3 of 5 tasks
[Bugfix] Fix FusedMoE weight loading with padded hidden dimensions bug Something isn't working
#37010 opened Mar 13, 2026 by SandishKumarHN Loading…
3 of 4 tasks
[ROCm] issue management - request information for bug issues on ROCm bug Something isn't working ci/build rocm Related to AMD ROCm
#37009 opened Mar 13, 2026 by hongxiayang Loading…
5 tasks
Enable loading of fused expert weights in the Transformers modelling backend ready ONLY add when PR is ready to merge/full CI is needed
#36997 opened Mar 13, 2026 by hmellor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.