-
Notifications
You must be signed in to change notification settings - Fork 295
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
mla/gather_kv_b_proj: handle unquantized kv_b_proj weight (kv_proj_scale=None)
ci:all
#3006
opened May 2, 2026 by
kzjeef
Loading…
[Silo] Bulk merge: kernel fixes and features (SplitK, MoE fixes, Qwen3-Next, pa_mqa OOB)
#3005
opened May 1, 2026 by
sunway513
Collaborator
Loading…
3 tasks
Add HipKittens based nhead=32 MLA kernel on MI35x /
gfx950
#3003
opened May 1, 2026 by
hubertlu-tw
Contributor
Loading…
8 of 9 tasks
ci(nightly): fix wheel/image ABI mismatch + 0-test false-pass (run 25202894144)
#3002
opened May 1, 2026 by
sunway513
Collaborator
Loading…
Replace QH16 bf16 kernel with a new one that does not use ptr_RP
#2999
opened May 1, 2026 by
JohnNikolay84
Contributor
Loading…
1 task
mla: refuse page_size > 1 on bf16 decode-stage1 kernel (no _ps variant shipped)
#2997
opened May 1, 2026 by
kzjeef
Loading…
4 tasks done
[FLYDSL] Extend gfx1201 FA backend coverage to Wan2.2 TI2V-5B shapes (H=24, D=128)
#2990
opened May 1, 2026 by
sunway513
Collaborator
Loading…
4 of 6 tasks
ci: make Standard Tests resilient to PRs missing install_triton.sh
#2985
opened Apr 30, 2026 by
sunway513
Collaborator
Loading…
[TRITON] Split New feature or request
triton
test_mha.py into smaller test files
ci:triton-300x
ci:triton-355
enhancement
#2984
opened Apr 30, 2026 by
brunomazzottiamd
Contributor
Loading…
1 task done
[MLA] Fix nhead=32 non-persistent decode crash on gfx950
#2983
opened Apr 30, 2026 by
frida-andersson
Contributor
Loading…
[configs] Add MI355X tuned GEMM and FMoE configs for DeepSeek-V3.2
#2981
opened Apr 30, 2026 by
frida-andersson
Contributor
Loading…
CI: retry docker pulls in workflow image downloads
ci:all
#2977
opened Apr 30, 2026 by
gyohuangxin
Member
Loading…
3 tasks done
[Moe_sorting_opus] refactor
ci:all
#2974
opened Apr 30, 2026 by
amd-ruitang3
Contributor
Loading…
1 task
add swiglu a4w4 moe path for gpt-oss model
#2972
opened Apr 30, 2026 by
XiaobingSuper
Contributor
•
Draft
1 task
[FLYDSL] Add gfx1201 (RDNA4) flash_attn_func backend
#2969
opened Apr 29, 2026 by
sunway513
Collaborator
Loading…
[GFX1250] Add Triton TDM to MoE Metadata kernels
#2968
opened Apr 29, 2026 by
nsusanto
Contributor
Loading…
[TRITON] mHC: Apply post-stream and res-stream mixing
#2967
opened Apr 29, 2026 by
waqahmed-amd-fi
•
Draft
1 task
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.