-
Notifications
You must be signed in to change notification settings - Fork 621
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Refactor: simplify torch -> cute-dsl boilerplate and enable tvm-ffi for cute-dsl kernels
#2279
opened Jan 1, 2026 by
yzh119
Loading…
5 tasks
cicd: add a github workflow for xfails report script
#2273
opened Dec 30, 2025 by
kahyunnam
Loading…
5 tasks done
[TRTLLM-Gen Fmha] add optimized trtllm-gen decode kernels for high throughput + speculative decoding
#2265
opened Dec 24, 2025 by
PerkzZheng
Loading…
5 tasks done
feat: Add FP8/NVFP4 quant fusion for MNNVL Allreduce
#2263
opened Dec 24, 2025 by
timlee0212
•
Draft
5 tasks
chore: add __all__ exports to Python modules and document missing APIs
#2251
opened Dec 20, 2025 by
yzh119
Loading…
5 tasks
bugfix: skip CUTLASS kernel generation when AOT cache exists
#2248
opened Dec 19, 2025 by
yongwww
Loading…
3 of 5 tasks
refactor: pull trtllm-gen batch-gemm/gemm headers from artifactory; update tma descriptor shape init
#2235
opened Dec 17, 2025 by
jimmyzho
Loading…
5 tasks
Fix: Add mask_indptr conversion in BatchPrefillWithPagedKVCacheWrapper.plan()
#2201
opened Dec 11, 2025 by
Dutch-voyage
Loading…
5 tasks
Add CUDA graph buffers for persistent attention
#2185
opened Dec 7, 2025 by
Edenzzzz
Loading…
5 tasks
[Flashinfer-Bench integration] HF end-to-end inference
#2151
opened Nov 30, 2025 by
sfc-gh-goliaro
•
Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148
opened Nov 28, 2025 by
nvpohanh
Loading…
5 tasks done
feat: BF16 GEMM using CUTLASS backend for SM100
#2070
opened Nov 10, 2025 by
raayandhar
Loading…
5 tasks done
Refactor flashinfer/__init__.py so that applications could selectively pack submodules without modifying __init__.py
#2027
opened Nov 3, 2025 by
bangshengtang
Loading…
5 tasks done
chore: agentic workflow for automatic version bump
#1947
opened Oct 19, 2025 by
yzh119
Loading…
5 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-12-29.