Skip to content

Pull requests: ROCm/aiter

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add HipKittens based nhead=32 MLA kernel on MI35x / gfx950
#3003 opened May 1, 2026 by hubertlu-tw Contributor Loading…
8 of 9 tasks
Remove sorting for fmoe
#3001 opened May 1, 2026 by JohnNikolay84 Contributor Draft
1 task
Replace QH16 bf16 kernel with a new one that does not use ptr_RP
#2999 opened May 1, 2026 by JohnNikolay84 Contributor Loading…
1 task
Dsv4 sparse indexer
#2998 opened May 1, 2026 by Oseltamivir Loading…
1 task done
add topk_softplus kernel
#2995 opened May 1, 2026 by yzhou103 Contributor Loading…
1 task
[Gluon]: Gluon kernel for mxfp4 quant
#2994 opened May 1, 2026 by NimitPtl Draft
1 task
[FLYDSL] Extend gfx1201 FA backend coverage to Wan2.2 TI2V-5B shapes (H=24, D=128)
#2990 opened May 1, 2026 by sunway513 Collaborator Loading…
4 of 6 tasks
ci: make Standard Tests resilient to PRs missing install_triton.sh
#2985 opened Apr 30, 2026 by sunway513 Collaborator Loading…
[TRITON] Split test_mha.py into smaller test files ci:triton-300x ci:triton-355 enhancement New feature or request triton
#2984 opened Apr 30, 2026 by brunomazzottiamd Contributor Loading…
1 task done
[MLA] Fix nhead=32 non-persistent decode crash on gfx950
#2983 opened Apr 30, 2026 by frida-andersson Contributor Loading…
Add MiniMax M25 FMoE tunings
#2982 opened Apr 30, 2026 by akii96 Contributor Draft
Add MiniMax M25 A8W8 blockscale GEMM tunings
#2979 opened Apr 30, 2026 by akii96 Contributor Draft
CI: retry docker pulls in workflow image downloads ci:all
#2977 opened Apr 30, 2026 by gyohuangxin Member Loading…
3 tasks done
mxfp4 quantize kernel
#2976 opened Apr 30, 2026 by amd-yilizhao Contributor Loading…
[Moe_sorting_opus] refactor ci:all
#2974 opened Apr 30, 2026 by amd-ruitang3 Contributor Loading…
1 task
add swiglu a4w4 moe path for gpt-oss model
#2972 opened Apr 30, 2026 by XiaobingSuper Contributor Draft
1 task
[Triton] [gfx1250] GEMM A16W16 Kernel
#2971 opened Apr 29, 2026 by azaidy Contributor Draft
[FLYDSL] Add gfx1201 (RDNA4) flash_attn_func backend
#2969 opened Apr 29, 2026 by sunway513 Collaborator Loading…
[GFX1250] Add Triton TDM to MoE Metadata kernels
#2968 opened Apr 29, 2026 by nsusanto Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.