Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Tiny fix bench tgv gemm
#2277 opened Dec 31, 2025 by vincentzed Loading…
5 tasks
feat: add GDN Attention
#2276 opened Dec 31, 2025 by guangyunh-nv Loading…
3 of 5 tasks
cicd: add a github workflow for xfails report script
#2273 opened Dec 30, 2025 by kahyunnam Loading…
5 tasks done
bugfix: skip CUTLASS kernel generation when AOT cache exists
#2248 opened Dec 19, 2025 by yongwww Loading…
3 of 5 tasks
fix: Handle zeros in Mistral Large 3 MoE inference
#2238 opened Dec 18, 2025 by dbari Draft
8 of 9 tasks
misc: support checks unit test tracking
#2224 opened Dec 16, 2025 by jimmyzho Loading…
5 tasks
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Loading…
4 of 5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148 opened Nov 28, 2025 by nvpohanh Loading…
5 tasks done
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
Sampling non contiguous
#1916 opened Oct 12, 2025 by zcin Loading…
5 tasks done
ProTip! Updated in the last three days: updated:>2025-12-29.