Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Removes MPI dependency from MNNVL AllReduce
#1379 opened Aug 4, 2025 by pranavm-nvidia Loading…
5 tasks
Unify and modularize decode and prefill test.
#1375 opened Aug 4, 2025 by weireweire Loading…
5 tasks
feat: Support sliding window for persistent kernel
#1368 opened Aug 3, 2025 by Edenzzzz Loading…
5 tasks
[Draft] Update autotune results
#1361 opened Jul 31, 2025 by kaixih Loading…
feat: Fused rope fp8 quantize kernel for MLA
#1339 opened Jul 28, 2025 by yzh119 Loading…
5 tasks
[WIP]: Masked layout fp4 gemm using cute-dsl
#1331 opened Jul 25, 2025 by yzh119 Draft
5 tasks
refactor: Improved metainfo for trtllm-gen kernels
#1328 opened Jul 25, 2025 by cyx-6 Loading…
5 tasks
Add moe benchmark routine
#1327 opened Jul 25, 2025 by aleozlx Draft
3 of 5 tasks
Add k_scale and v_scale to persistent attention
#1322 opened Jul 24, 2025 by Edenzzzz Loading…
5 tasks
Wrap cudnn backend to unified interface
#1312 opened Jul 23, 2025 by cyx-6 Loading…
5 tasks
Api regression test for trtllmgen fp8 moe
#1308 opened Jul 23, 2025 by aleozlx Loading…
5 tasks done
fix: a workaround to make fp8 kv-cache work for prefill
#1304 opened Jul 22, 2025 by chenyang78 Loading…
2 tasks
3rparty: upgrade cutlass dependency to v4.1.0
#1299 opened Jul 22, 2025 by yzh119 Loading…
5 tasks
ci: add github actions to upload sdist to pypi
#1270 opened Jul 16, 2025 by yzh119 Loading…
5 tasks
feat(aot): add nvshmem module for aot compilation
#1261 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
Mnnvl memory with custom communicator
#1245 opened Jul 14, 2025 by wenscarl Draft
5 tasks
feat: expose python APIs for cutlass blackwell fmha fp8 kernels
#1238 opened Jul 10, 2025 by yzh119 Loading…
5 tasks
feat: Restore convenience FLASHINFER_ENABLE_AOT option
#1235 opened Jul 8, 2025 by mgorny Loading…
3 of 5 tasks
[Feature] Support batch prefill for POD Attention
#1231 opened Jul 8, 2025 by Edenzzzz Loading…
7 tasks
Add ruff to pre-commit
#1201 opened Jul 1, 2025 by cyx-6 Draft
5 tasks
ProTip! no:milestone will show everything without a milestone.