forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 49
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feat][aiter][ROCm] Add aiter rmsnorm and quant fusion
#735
opened Oct 13, 2025 by
kliuae-amd
Loading…
5 tasks
[moe](feat): fuse shared expert to moe ops
#734
opened Oct 13, 2025 by
PerryZhang01
Loading…
5 tasks
[355_wip] [triton] fuse bf16_gemm_reduce_kernel + rope_kv_cache
#730
opened Oct 9, 2025 by
k50112113
Loading…
[FEAT] Add support for AITER bpreshuffle block scale gemm
#717
opened Sep 27, 2025 by
tjtanaavllm
Loading…
5 tasks
[Perf] refactor attention backend for perf boost
#713
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
#705
opened Sep 24, 2025 by
xytpai
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#704
opened Sep 24, 2025 by
zejunchen-zejun
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#695
opened Sep 18, 2025 by
zejunchen-zejun
Loading…
[ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)
#694
opened Sep 18, 2025 by
xudonlyu
Loading…
[355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator
#669
opened Sep 11, 2025 by
xytpai
Loading…
support ck-tile fused bias gemm for rocm unquantized gemm
#668
opened Sep 11, 2025 by
eliotwang
Loading…
add fp8 gemm path choice for rocm_aiter_gemm_w8a8_blockscale
#659
opened Sep 8, 2025 by
zhuyuhua-v
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-13.