- Notifications
You must be signed in to change notification settings - Fork 565
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Pytorch][Bug]MXFP8 Split tensor Bug fix
#2427 opened Nov 26, 2025 by vthumbe1503 Loading…
2 of 13 tasks
[PyTorch] Convert sample tuple to list in cudagraph input reuse
#2426 opened Nov 26, 2025 by buptzyb Loading…
13 tasks
[JAX] Add tutorial for integrating TE/JAX quantization into an existing framework
#2423 opened Nov 26, 2025 by jberchtold-nvidia Loading…
8 of 13 tasks
[Common] Persistent NVFP4 cast + transpose kernel 2.11.0
#2412 opened Nov 21, 2025 by Oleg-Goncharov Loading…
6 of 13 tasks
[PyTorch][NVFP4][MOE] NVFP4 Grouped Quantize with Hadamard Transform MoE
#2411 opened Nov 21, 2025 by zhongbozhu Loading…
2 of 16 tasks
[Common] NVTEGroupedTensor class and helpers MoE
#2388 opened Nov 14, 2025 by phu0ngng Loading…
7 of 13 tasks
[JAX] Re-use RHT matrix constant
#2386 opened Nov 14, 2025 by jberchtold-nvidia • Draft
8 of 13 tasks
Set RPATH for cuda libraries from python package
#2381 opened Nov 14, 2025 by take-cheeze • Draft
4 of 13 tasks
[JAX] Add CP + THD + AG + Striped>1 + SWA support
#2379 opened Nov 13, 2025 by KshitijLakhani Loading…
8 of 13 tasks
[JAX] cuBlasMp integration for CollectiveGemm custom op
#2361 opened Nov 7, 2025 by denera Loading…
5 of 13 tasks
Add device-Initiated Grouped GEMM supporting m_splits on device MoE
#2360 opened Nov 7, 2025 by QiZhangNV Loading…
1 of 13 tasks
[Core] Fix inconsistent logic in C++ tensor class
#2330 opened Nov 1, 2025 by timmoon10 Loading…
7 of 13 tasks
[Common] Added an optimized gated rowwise MXFP8 SwiGLU kernel
#2328 opened Oct 31, 2025 by Oleg-Goncharov Loading…
5 of 13 tasks
[Pytorch] change fused cross entropy backward grad to fp32 and reduce one read/…
#2325 opened Oct 31, 2025 by RandMist Loading…
8 of 13 tasks
Previous Next
ProTip! Updated in the last three days: updated:>2025-11-27.