Skip to content

[Triton] [Gluon] [GFX12] add UA3D gluon kernel for gfx12#2472

Open
k50112113 wants to merge 30 commits intomainfrom
shaoclee/ua3d-gfx12
Open

[Triton] [Gluon] [GFX12] add UA3D gluon kernel for gfx12#2472
k50112113 wants to merge 30 commits intomainfrom
shaoclee/ua3d-gfx12

Conversation

@k50112113
Copy link
Copy Markdown
Contributor

add Unified Attention 3D gluon kernels for gfx12, including UT and bench
add simple copy kernel for bench at aiter/op_tests/op_benchmarks/triton/bench_cache_copy.py, this kernel simulates the copy of kv cache to benchmark the max bandwidth of a machine.

@k50112113 k50112113 requested a review from a team March 25, 2026 22:38
@github-actions
Copy link
Copy Markdown
Contributor

🏷️ CI Guide

Runs automatically on every PR:

  • ✅ Pre-checks (submodule verification, code formatting)
  • ✅ Aiter op tests (gfx942 + gfx950)
  • ✅ Triton tests (only when aiter/ops/triton/** or related paths are changed)

Extended tests (opt-in via labels):

Label Tests
ci:triton-355 Run Triton tests on MI355 in addition to MI325
ci:sglang SGLang integration tests
ci:atom ATOM benchmark (DeepSeek-R1 + GPT-OSS)
ci:vllm vLLM benchmark
ci:all All of the above

Add labels via the sidebar or gh pr edit 2472 --add-label <label>

@k50112113 k50112113 marked this pull request as draft March 26, 2026 21:51
@k50112113 k50112113 force-pushed the shaoclee/ua3d-gfx12 branch from c34eaee to eb4a310 Compare March 26, 2026 22:03
@k50112113 k50112113 marked this pull request as ready for review April 8, 2026 22:39
@brunomazzottiamd
Copy link
Copy Markdown
Contributor

Adding ci:triton-355 label to prevent new failures on gfx950.

from aiter.ops.triton.gluon.unified_attention_3d import (
_unified_attention_gluon_kernel_3d,
)
except:
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ [ruff] <E722> reported by reviewdog 🐶
Do not use bare except

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants