- Notifications
You must be signed in to change notification settings - Fork 672
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[JAX] Add warning if using BSHD and max_segments_per_seq > 1
#2796 opened Mar 24, 2026 by jberchtold-nvidia Loading…
8 of 13 tasks
If model parameters are DTensors, optimizer states should also be DTensors.
#2795 opened Mar 24, 2026 by cspades Loading…
1 of 13 tasks
Avoid CPU offload wait_event for validation
#2793 opened Mar 23, 2026 by vasunvidia Loading…
13 tasks
[PyTorch] [torch.compile] Remove module reference from autograd function args
#2791 opened Mar 23, 2026 by pggPL Loading…
8 of 13 tasks
Optimize fp8 block scaling Allgather for FSDP2
#2789 opened Mar 23, 2026 by vthumbe1503 Loading…
1 of 13 tasks
[Common][JAX] Add CUB TopK MaxPairs interface
#2784 opened Mar 20, 2026 by huanghua1994 Loading…
8 of 13 tasks
Optimize naive top-k masking in fused router
#2783 opened Mar 19, 2026 by yosh20004 Loading…
3 of 13 tasks
Fused Adam Support for MXFP8 + FSDP2 integration
#2780 opened Mar 18, 2026 by vthumbe1503 • Draft
13 tasks
[fused_router][pytorch] Optimize naive topk path and add perf benchmark
#2776 opened Mar 18, 2026 by XiaomingFun233 Loading…
add mark_not_offload() interface for cpu_offload_v1
#2770 opened Mar 17, 2026 by lhb8125 Loading…
13 tasks
GEMM + Swiglu fused Grouped MLP for MXFP8 2.14.0 MoE
#2769 opened Mar 17, 2026 by ksivaman Loading…
13 tasks
[Draft]Support for score_mod and score_mod_bprop in cuDNN's sdpa
#2767 opened Mar 16, 2026 by vcherepanov-nv Loading…
2 of 13 tasks
[PyTorch] transformer_engine.pytorch.autocast suport inside torch.compile
#2759 opened Mar 13, 2026 by pggPL Loading…
4 of 26 tasks
[JAX] Grouped GEMM Refactor to use first_dims and last_dims
#2749 opened Mar 10, 2026 by jberchtold-nvidia Loading…
1 of 13 tasks
[Common] Persistent Grouped NVFP4 quantization kernel
#2743 opened Mar 6, 2026 by Oleg-Goncharov • Draft
8 of 13 tasks
[Common] Persistent Grouped MXFP8 quantization kernel enhancement New feature or request MoE
#2738 opened Mar 5, 2026 by Oleg-Goncharov Loading…
9 of 13 tasks
Previous Next
ProTip! Mix and match filters to narrow down what you’re looking for.