Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Added support for AITER JIT native splitkv kernel ci-level 3 CI test level 3
#631 opened Jun 16, 2026 by Micky774 Contributor Draft
13 tasks
gfx1250 mxfp8 gemm: add NN/NT transpose workaround ci-level 1 CI test level 1
#630 opened Jun 16, 2026 by matthiasdiener Contributor Draft
1 of 13 tasks
Hotfix for Maxtext regression with JAX 0.9 changes ci-level 2 CI test level 2
#629 opened Jun 16, 2026 by ipanfilo Collaborator Loading…
1 of 13 tasks
[WIP] Enable MultiCastTranspose for expert weights
#628 opened Jun 16, 2026 by sudhu2k Contributor Draft
1 of 13 tasks
gfx1250 mxfp8 gemm: loosen restrictions on K ci-level 1 CI test level 1
#627 opened Jun 16, 2026 by matthiasdiener Contributor Loading…
1 of 13 tasks
Add gfx1250 support to CK tile group GEMM ci-level 1 CI test level 1
#626 opened Jun 16, 2026 by aris134 Contributor Loading…
1 of 13 tasks
Add ROCm HIP small-seq fused attention via crossattn_hip_kernel
#625 opened Jun 15, 2026 by VeeraRajasekhar Contributor Loading…
13 tasks
[CI] Add resilience to artifacts fetch
#622 opened Jun 9, 2026 by leo-automation Collaborator Loading…
[FEAT] Microbenchmark add visualization
#620 opened Jun 8, 2026 by Micky774 Contributor Loading…
13 tasks
Refactored reduction kernels ci-level 3 CI test level 3
#618 opened Jun 8, 2026 by Micky774 Contributor Loading…
13 tasks
Ifu dev 260419 v2.15
#616 opened Jun 8, 2026 by VeeraRajasekhar Contributor Loading…
13 tasks
Incorporate statistical significance testing to benchmarks
#614 opened Jun 8, 2026 by Micky774 Contributor Loading…
13 tasks
Ipanfilo/ci test fixes ci-level 3 CI test level 3
#612 opened Jun 5, 2026 by ipanfilo Collaborator Draft
13 tasks
microbenchmarks: add kernel profiling option
#610 opened Jun 3, 2026 by matthiasdiener Contributor Loading…
1 of 13 tasks
enable blockwise FP8 quantization on rocm ci-level 1 CI test level 1
#609 opened Jun 3, 2026 by asdfvg123 Loading…
1 of 13 tasks
WIP Lightning Indexer + DSA/HCA API
#606 opened Jun 1, 2026 by Micky774 Contributor Draft
1 of 13 tasks
TE AITER gfx1250 integration WIP
#603 opened May 29, 2026 by Micky774 Contributor Draft
13 tasks
Update QoLA/AITER ci-level 3 CI test level 3
#599 opened May 28, 2026 by Micky774 Contributor Loading…
13 tasks
Mxfp8 grouped and multi quantize ci-level 3 CI test level 3
#598 opened May 27, 2026 by alextmagro Contributor Loading…
Bump CI retention days ci-level 1 CI test level 1
#591 opened May 20, 2026 by matthiasdiener Contributor Draft
1 of 13 tasks
add production GEMM tests ci-level 1 CI test level 1
#590 opened May 19, 2026 by matthiasdiener Contributor Loading…
1 of 13 tasks
Add custom multi_tensor_apply kernels (L2norm, Adam) ci-level 1 CI test level 1
#585 opened May 13, 2026 by matthiasdiener Contributor Loading…
1 of 13 tasks
Add Tealite: pure-Python TransformerEngine for ROCm/AMD GPUs
#581 opened May 7, 2026 by jayfurmanek Contributor Loading…
7 of 8 tasks
ProTip! What’s not been updated in a month: updated:<2026-05-16.