Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add data type check for deepseek fp4 moe
#2165 opened Dec 3, 2025 by samuellees Loading…
5 tasks
feat: MxInt4 x Bf16 TRT-LLM Gen MoE support
#2159 opened Dec 2, 2025 by nekorobov Loading…
5 tasks done
fix xqa mha_sm90.cu
#2157 opened Dec 2, 2025 by qsang-nv Loading…
5 tasks
enable sm103 moe dsl backend
#2149 opened Nov 28, 2025 by aleozlx Loading…
5 tasks done
Enable Hopper FA3 FP8 attention
#2148 opened Nov 28, 2025 by nvpohanh Draft
5 tasks
perf: using multi-cta optimization for top-k/top-p
#2119 opened Nov 20, 2025 by yzh119 Loading…
4 of 5 tasks
Refactor trtllm_mnnvl_allreduce
#2118 opened Nov 20, 2025 by timlee0212 Loading…
5 tasks done
refactor: update fa3 codebase and fix hopper unittest [part 1]
#2111 opened Nov 19, 2025 by yzh119 Loading…
5 tasks done
feat: support more head dim in RoPE kernel
#2109 opened Nov 19, 2025 by raayandhar Loading…
5 tasks done
Port TRT-LLM communication kernels to flashinfer
#2102 opened Nov 18, 2025 by djns99 Loading…
5 tasks
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.