Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove integration test for Lightning-Thunder testing Improvements to tests or testing infrastructure
#2822 opened Apr 1, 2026 by timmoon10 Loading…
8 of 14 tasks
Fix fused router for large top-K and expert counts
#2821 opened Apr 1, 2026 by harryzhou2000 Loading…
7 of 13 tasks
Refactor Amax Kernel ldmatrix loads, TMA/compute barriers, swizzle_idx
#2820 opened Apr 1, 2026 by cael-ling Loading…
6 of 13 tasks
[PyTorch] Fix bug with PR 2677
#2819 opened Mar 31, 2026 by sudhakarsingh27 Loading…
1 of 13 tasks
Comm gemm fixes
#2818 opened Mar 31, 2026 by almogsegal Loading…
13 tasks
[Pytorch][Common] Hybrid quantization
#2817 opened Mar 31, 2026 by negvet Loading…
1 of 13 tasks
Fix nvshmem build
#2815 opened Mar 31, 2026 by GaetanLepage Loading…
2 of 13 tasks
Pass input_output_alias to TritonAutotunedKernelCall
#2814 opened Mar 31, 2026 by tdophung Loading…
5 of 13 tasks
Streamline group Hadamard ComputeKernel loads
#2810 opened Mar 29, 2026 by cael-ling Loading…
5 of 13 tasks
Single __syncthreads per stage in GroupHadamardAmaxTmaKernel
#2809 opened Mar 29, 2026 by cael-ling Loading…
8 of 13 tasks
Precomputed swizzle_idx into group Hadamard ComputeKernel
#2808 opened Mar 29, 2026 by cael-ling Loading…
8 of 13 tasks
[PyTorch][Flash Attn] Add fallback import for FA3
#2806 opened Mar 26, 2026 by eattia-nvidia Loading…
7 of 13 tasks
[PyT] Fix FSDP2 memory leaks for FP8 weight workspaces and transpose caches
#2805 opened Mar 26, 2026 by pstjohn Loading…
3 tasks done
2
3
[PyT][Test] Add xfailing FSDP2 memory leak detection tests
#2803 opened Mar 25, 2026 by pstjohn Loading…
4 tasks done
[JAX] Warmup FFIs with "initialize" stage
#2800 opened Mar 25, 2026 by jberchtold-nvidia Loading…
1 of 13 tasks
adds NVFP4 Fused Adam support
#2797 opened Mar 24, 2026 by jomitchellnv Loading…
2 of 13 tasks
[JAX] TE GMM v2 enforcement Env Var
#2794 opened Mar 23, 2026 by jberchtold-nvidia Draft
13 tasks
Avoid CPU offload wait_event for validation
#2793 opened Mar 23, 2026 by vasunvidia Loading…
13 tasks
ProTip! Add no:assignee to see everything that’s not assigned.