-
Notifications
You must be signed in to change notification settings - Fork 281
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add glm5 70k 300 triton a8w8 blockscale configs
#2743
opened Apr 14, 2026 by
amd-pedghazi
Loading…
1 task done
Hoist introspection out of per-call ctype dispatch
#2742
opened Apr 14, 2026 by
gronsti-amd
Loading…
1 task done
gather support qk_nope_head_dim != v_head_dim
#2739
opened Apr 14, 2026 by
jiayyu
Contributor
Loading…
1 task
Revert max_size from 1GB to 128MB to fix KV cache regression
#2737
opened Apr 14, 2026 by
AMD-yanfeiwang
Contributor
•
Draft
1 task
feat: add/retune BF16 GEMM configs with FlyDSL backend for 6 models
#2733
opened Apr 14, 2026 by
sunway513
Collaborator
Loading…
2 tasks
Make FlyDSL LDS checks architecture-aware and reduce tuner failure noise
#2732
opened Apr 14, 2026 by
yzhou103
Contributor
Loading…
1 task
introduce g1u0 smoothquant int8 fused moe : fused_moe_gelu_sqi8
#2730
opened Apr 14, 2026 by
tingqli
Loading…
1 task
Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent)
#2729
opened Apr 14, 2026 by
fangche123
Contributor
Loading…
MI350 mla ps mode suppport nhead128,1 128,2 128,3 128,4 64,4 64,2 32,4 through kernel hsa/gfx950/mla/mla_a16w16_qh32_qseqlen4_gqaratio32_ps.co
#2727
opened Apr 14, 2026 by
minmengdie
Contributor
Loading…
1 task
[TRITON] Add unified attention support to bench_models
enhancement
New feature or request
triton
#2724
opened Apr 13, 2026 by
lucas-santos-amd
Contributor
Loading…
1 task
Fix Triton MoE GEMM shared memory exhaustion by reducing stage count
bug
Something isn't working
ci:triton-355
triton
#2723
opened Apr 13, 2026 by
nidal567
Contributor
Loading…
1 task done
docs: comprehensive documentation overhaul
#2706
opened Apr 12, 2026 by
sunway513
Collaborator
Loading…
4 tasks
feat: add Gemma4 31B support (ProportionalRotaryEmbedding, rmsnorm dtype)
#2705
opened Apr 12, 2026 by
ClementLinCF
Collaborator
Loading…
1 task done
Update quant.pyfix: add pack_dim to per_1x32_f4_quant for tl.dot_scaled RHS compatibility
#2704
opened Apr 12, 2026 by
GeisYaO
Loading…
fea(car): support custom group device
#2703
opened Apr 12, 2026 by
TennyWang1223
Contributor
Loading…
1 task
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.