Draft
Conversation
Contributor
🏷️ CI GuideRuns automatically on every PR:
Extended tests (opt-in via labels):
|
| @@ -0,0 +1,857 @@ | |||
| # SPDX-License-Identifier: MIT | |||
| # Copyright (C) 2024-2026, Advanced Micro Devices, Inc. All rights reserved. | |||
| s | |||
Contributor
| # SPDX-License-Identifier: MIT | ||
| # Copyright (C) 2024-2026, Advanced Micro Devices, Inc. All rights reserved. | ||
| s | ||
| import torch |
Contributor
| # Copyright (C) 2024-2026, Advanced Micro Devices, Inc. All rights reserved. | ||
| s | ||
| import torch | ||
| import flydsl.compiler as flyc |
Contributor
| s | ||
| import torch | ||
| import flydsl.compiler as flyc | ||
| import flydsl.expr as fx |
Contributor
| import torch | ||
| import flydsl.compiler as flyc | ||
| import flydsl.expr as fx | ||
| from flydsl._mlir import ir |
Contributor
| from flydsl.compiler.kernel_function import CompilationContext | ||
| from flydsl.expr import arith, buffer_ops, gpu, range_constexpr, rocdl, tdm_ops, vector | ||
| from flydsl.expr.arith import _to_raw as _raw | ||
| from flydsl.expr.typing import T |
Contributor
| from flydsl.expr import arith, buffer_ops, gpu, range_constexpr, rocdl, tdm_ops, vector | ||
| from flydsl.expr.arith import _to_raw as _raw | ||
| from flydsl.expr.typing import T | ||
| from flydsl.runtime.device import get_rocm_arch as get_hip_arch |
Contributor
| from flydsl.expr.arith import _to_raw as _raw | ||
| from flydsl.expr.typing import T | ||
| from flydsl.runtime.device import get_rocm_arch as get_hip_arch | ||
| from flydsl.utils.smem_allocator import SmemAllocator, SmemPtr, get_op_result_or_value |
Contributor
| from flydsl.expr.typing import T | ||
| from flydsl.runtime.device import get_rocm_arch as get_hip_arch | ||
| from flydsl.utils.smem_allocator import SmemAllocator, SmemPtr, get_op_result_or_value | ||
| from flydsl.expr import idx2crd |
Contributor
| from flydsl.runtime.device import get_rocm_arch as get_hip_arch | ||
| from flydsl.utils.smem_allocator import SmemAllocator, SmemPtr, get_op_result_or_value | ||
| from flydsl.expr import idx2crd | ||
| from typing import Optional |
Contributor
Collaborator
Contributor
Author
|
@coderfeli Its only been tested on AM workflows as of now, I will further optimize the code and test it on hardware soon. Added FlyDSL PR to the description |
1 task
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
To write the gemm for gfx1250 in flydsl (WIP)
FlyDsl Pr: ROCm/FlyDSL#401
Test Plan
Unit tests
Test Result
All tests pass
Submission Checklist