Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add optimised top-k kernel AIR.
#2890 opened Apr 16, 2026 by dcampora Loading…
8 of 13 tasks
[JAX] Fix grouped quant checkpointing
#2889 opened Apr 16, 2026 by jberchtold-nvidia Collaborator Loading…
8 of 13 tasks
[PyTorch] Minor optimizations in fused grouped MLP
#2888 opened Apr 15, 2026 by ksivaman Member Loading…
6 of 14 tasks
Add AI written qwen3_moe example
#2887 opened Apr 15, 2026 by skyw Loading…
4 of 13 tasks
[PyTorch] Add method for mcore to register wgrad accumulation hook
#2886 opened Apr 15, 2026 by ksivaman Member Loading…
7 of 13 tasks
Scaled Bias Add support after CUBLAS GGEMM
#2885 opened Apr 15, 2026 by vthumbe1503 Collaborator Loading…
13 tasks
[Debug] Add AutoswitchGEmm for Debug Precision Tool
#2883 opened Apr 15, 2026 by shangxiaokang Draft
3 of 13 tasks
SMEM offset caching RHT
#2882 opened Apr 15, 2026 by sraman-rgb Loading…
13 tasks
Check numerics in MXFP8 C++ tests by dequantizing to FP32 2.15.0 testing Improvements to tests or testing infrastructure
#2881 opened Apr 15, 2026 by timmoon10 Collaborator Loading…
6 of 13 tasks
fix(readme): update broken links and modernize project description
#2879 opened Apr 14, 2026 by sbhavani Collaborator Loading…
3 of 13 tasks
[PyTorch] Split TE ops op_forward into op_forward and setup_context
#2877 opened Apr 14, 2026 by pggPL Collaborator Draft
13 tasks
[DONOT MERGE] Wgrad cute dsl v2
#2872 opened Apr 13, 2026 by vthumbe1503 Collaborator Draft
13 tasks
Optimizations for MXFP8/NVFP4 dequantize kernels
#2865 opened Apr 10, 2026 by YigongQin Loading…
8 of 13 tasks
Adds GEMM Profiling Guide to TE
#2863 opened Apr 9, 2026 by jomitchellnv Contributor Loading…
7 tasks
[DO NOT MERGE] Test CI
#2862 opened Apr 9, 2026 by cyanguwa Collaborator Draft
13 tasks
Add cpplint and ruff linter to pre-commit and fix lint violations
#2853 opened Apr 8, 2026 by pstjohn Contributor Loading…
Bump transformers from 4.55.0 to 5.0.0rc3 in /docs/examples/te_gemma dependencies Pull requests that update a dependency file python Pull requests that update python code
#2851 opened Apr 8, 2026 by dependabot bot Loading…
Bump transformers from 4.57.0 to 5.0.0rc3 in /docs/examples/te_llama dependencies Pull requests that update a dependency file python Pull requests that update python code
#2850 opened Apr 8, 2026 by dependabot bot Loading…
Skip activation kernels when tensor size is zero bug Something isn't working
#2848 opened Apr 8, 2026 by timmoon10 Collaborator Loading…
8 of 13 tasks
[Common] Multicast Fixes
#2847 opened Apr 8, 2026 by phu0ngng Collaborator Draft
13 tasks
[Core] Report CUDA versions when NVRTC compilation fails enhancement New feature or request
#2842 opened Apr 7, 2026 by timmoon10 Collaborator Loading…
8 of 13 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.