Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 gpt-oss Related to GPT-OSS models ready ONLY add when PR is ready to merge/full CI is needed
#29708 opened Nov 29, 2025 by xyang16 Loading…
5 tasks
Fix RoPE failures in Transformers nightly
#29700 opened Nov 28, 2025 by hmellor Loading…
FlashInfer-Bench Integration for vLLM documentation Improvements or additions to documentation nvidia
#29695 opened Nov 28, 2025 by sfc-gh-goliaro Draft
4 of 11 tasks
[Misc] Refactor tokenizer interface ci/build deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend llama Related to Llama models multi-modality Related to multi-modality (#4194) performance Performance-related issues qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed ready-run-all-tests Trigger CI with all tests for wide-ranging PRs structured-output tool-calling v1
#29693 opened Nov 28, 2025 by DarkLight1337 Loading…
5 tasks
[Bugfix] Schedule failure due to wrong get_image_size_with_most_features qwen Related to Qwen models
#29692 opened Nov 28, 2025 by tomtomjhj Loading…
3 of 5 tasks
[WIP][Kernel]Support W4A8 Grouped GEMM on Hopper ci/build new-model Requests to new models nvidia
#29691 opened Nov 28, 2025 by czhu-cohere Loading…
5 tasks
[Chore]: Remove Olmo3 and FlexOlmo config copy ready ONLY add when PR is ready to merge/full CI is needed
#29677 opened Nov 28, 2025 by Isotr0py Loading…
1 of 5 tasks
[NIXL] Add remote_request_id to kv_transfer_params kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#29665 opened Nov 28, 2025 by markmc Loading…
[Bugfix] Fix prefix_repetition routing in bench throughput performance Performance-related issues
#29663 opened Nov 28, 2025 by jr-shen Loading…
3 of 5 tasks
simplify requires_files list creation
#29656 opened Nov 28, 2025 by nwaughachukwuma Loading…
fix potential object has no attribute 'bias' error
#29653 opened Nov 28, 2025 by allerou4 Loading…
5 tasks
[Model] Add step-deepresearch tool parser frontend tool-calling
#29652 opened Nov 28, 2025 by randzero Loading…
3 of 5 tasks
[P/D] Add P/D disaggregation deployment on Ray documentation Improvements or additions to documentation frontend kv-connector
#29649 opened Nov 28, 2025 by JackyMa1997 Loading…
5 tasks
[Attention] Make split_decodes_and_prefills(..., require_uniform=True) support padding ready ONLY add when PR is ready to merge/full CI is needed v1
#29644 opened Nov 28, 2025 by LucasWilkinson Loading…
[Kernel][MoE] optimize moe_align_block_size performance Performance-related issues
#29642 opened Nov 28, 2025 by jinzhen-lin Loading…
ProTip! Add no:assignee to see everything that’s not assigned.