forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 50
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[ROCm][BugFix] Fix accuracy issue for AiterMLABackend for newest aiter main branch
#820
opened Nov 22, 2025 by
zhuyuhua-v
Loading…
[ROCm][MLA] Enable MLA persistent kernel with fp8 and bf16 support
#817
opened Nov 20, 2025 by
zejunchen-zejun
Loading…
[rocm]use aiter triton kernel as triton mha fallback path
#809
opened Nov 14, 2025 by
zhuyuhua-v
•
Draft
[DO NOT MERGE]Enable FP4 bmm for k_up_proj and v_up_proj in MLA
#797
opened Nov 7, 2025 by
ZhiweiYan-96
Loading…
5 tasks
[Triton] add a16w8 gemm for DS-R1 for o_proj for decode, add rocm_aiter_triton…
#788
opened Nov 4, 2025 by
k50112113
Loading…
[Triton] 355 wip Llama FP4 triton fusion + TP8 triton decode shape tunning
#783
opened Oct 31, 2025 by
k50112113
Loading…
add aiter fusion pattern for sequence parallel
#781
opened Oct 31, 2025 by
zhuyuhua-v
•
Draft
5 tasks
[ROCM] Llama4 VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE support
#763
opened Oct 24, 2025 by
tpopp
Loading…
[WIP] Support persistent MLA for ROCm MLA backend
#739
opened Oct 16, 2025 by
ganyi1996ppo
Loading…
5 tasks
[Perf] refactor attention backend for perf boost
#713
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
#705
opened Sep 24, 2025 by
xytpai
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#704
opened Sep 24, 2025 by
zejunchen-zejun
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.