- Notifications
You must be signed in to change notification settings - Fork 588
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
refactor: Move mla code from decode.py to mla.py and add to documentation
#2163 opened Dec 3, 2025 by bkryu Loading…
5 tasks done
feat: MxInt4 x Bf16 TRT-LLM Gen MoE support
#2159 opened Dec 2, 2025 by nekorobov Loading…
5 tasks done
[Flashinfer-Bench integration] HF end-to-end inference
#2151 opened Nov 30, 2025 by sfc-gh-goliaro • Draft
5 tasks
A unified API for the MNNVL and single-node AllReduce kernels.
#2130 opened Nov 21, 2025 by nvmbreughe • Draft
5 tasks
[wip] feat: support variable sequence length in decode kernel of trtllm-gen attention
#2125 opened Nov 20, 2025 by yaoyaoding • Draft
5 tasks
perf: using multi-cta optimization for top-k/top-p
#2119 opened Nov 20, 2025 by yzh119 Loading…
4 of 5 tasks
refactor: update fa3 codebase and fix hopper unittest [part 1]
#2111 opened Nov 19, 2025 by yzh119 Loading…
5 tasks done
feat: support more head dim in RoPE kernel
#2109 opened Nov 19, 2025 by raayandhar Loading…
5 tasks done
Port TRT-LLM communication kernels to flashinfer
#2102 opened Nov 18, 2025 by djns99 Loading…
5 tasks
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Rebase FP8 SM100 Cutlass FMHA Attention to main (original PR#1238)
#2047 opened Nov 5, 2025 by pavanimajety • Draft
5 tasks
Refactor flashinfer/__init__.py so that applications could selectively pack submodules without modifying __init__.py
#2027 opened Nov 3, 2025 by bangshengtang Loading…
5 tasks done
refactor: backend_requirement + supported_compute_capability decorator for gemm
#2000 opened Oct 29, 2025 by jimmyzho Loading…
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
Previous Next
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.