Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[BUGFIX][DEEPSEEK][MODEL_LOAD] fix w13, w2 weight not initialized assert ready ONLY add when PR is ready to merge/full CI is needed
#20202 opened Jun 27, 2025 by xuechendi Loading…
1 of 4 tasks
[CI] reducing image size ci/build
#20201 opened Jun 27, 2025 by aarnphm Loading…
[Do not merge] Add out of place layernorm performance Performance-related issues
#20197 opened Jun 27, 2025 by charlifu Loading…
[CI][Intel Gaudi][vllm-Plugin]Add CI for hpu-plugin-v1-test ci/build documentation Improvements or additions to documentation
#20196 opened Jun 27, 2025 by xuechendi Loading…
3 tasks
FlashInfer generated decode kernels.
#20194 opened Jun 27, 2025 by wenscarl Draft
4 tasks
Eepp frontend needs-rebase v1
#20191 opened Jun 27, 2025 by ruisearch42 Draft
4 tasks
[WIP] Run eagle with full cudagraph documentation Improvements or additions to documentation v1
#20190 opened Jun 27, 2025 by zixi-qi Draft
[Nixl] Heterogeneous TP support FlashInfer
#20189 opened Jun 27, 2025 by NickLucche Loading…
Enabled BnB NF4 inference on Gaudi
#20172 opened Jun 27, 2025 by rsshaik1 Loading…
[Feature]: Implement check_health for V1 v1
#20164 opened Jun 27, 2025 by limbaniharsh Loading…
1 of 3 tasks
Add pynccl all-gatherv and reducescatterv
#20154 opened Jun 26, 2025 by trevor-m Loading…
3 of 4 tasks
ProTip! no:milestone will show everything without a milestone.