[MLIR][XeGPU] fix load/store/prefetch op offset verifier #166137

tkarna · 2025-11-03T09:03:04Z

The verifier of xegpu.{load/store/prefetch}_nd op fails if offset a mix of static and dynamic values, e.g. offset = [0, %c0]. In this case the length of dynamic offsets is 1 and the check offsetSize != tDescRank (=2) fails. Instead, we should check the length of getMixedOffsets().

llvmbot · 2025-11-03T09:03:36Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-gpu

Author: Tuomas Kärnä (tkarna)

Changes

The verifier of xegpu.{load/store/prefetch}_nd op verifier fails if offset a mix of static and dynamic values, e.g. offset = [0, %c0]. In this case the length of dynamic offsets is 1 and the check offsetSize != tDescRank (=2) fails. Instead, we should check the length of getMixedOffsets().

Full diff: https://github.com/llvm/llvm-project/pull/166137.diff

2 Files Affected:

(modified) mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp (+6-15)
(modified) mlir/test/Dialect/XeGPU/ops.mlir (+9)

diff --git a/mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp b/mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp index abd12e2e69ac0..0bfc8a12ea552 100644 --- a/mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp +++ b/mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp @@ -472,11 +472,8 @@ LogicalResult PrefetchNdOp::verify() { return emitOpError("invalid l3_hint: ") << getL3HintAttr(); int64_t tDescRank = tdescTy.getRank(); - int64_t offsetSize = static_cast<int64_t>(getOffsets().size()); - int64_t constOffsetSize = - getConstOffsetsAttr() ? getConstOffsetsAttr().size() : 0; - if (((offsetSize != 0) && (offsetSize != tDescRank)) || - ((constOffsetSize != 0) && (constOffsetSize != tDescRank))) + int64_t offsetSize = getMixedOffsets().size(); + if (offsetSize != 0 && offsetSize != tDescRank) return emitOpError( "Mismatched ranks between offsets and tensor descriptor"); @@ -597,11 +594,8 @@ LogicalResult LoadNdOp::verify() { << tdescTy; int64_t tDescRank = tdescTy.getRank(); - int64_t offsetSize = static_cast<int64_t>(getOffsets().size()); - int64_t constOffsetSize = - getConstOffsetsAttr() ? getConstOffsetsAttr().size() : 0; - if (((offsetSize != 0) && (offsetSize != tDescRank)) || - ((constOffsetSize != 0) && (constOffsetSize != tDescRank))) + int64_t offsetSize = getMixedOffsets().size(); + if (offsetSize != 0 && offsetSize != tDescRank) return emitOpError( "Mismatched ranks between offsets and tensor descriptor"); @@ -691,11 +685,8 @@ LogicalResult StoreNdOp::verify() { << dstTy; int64_t tDescRank = dstTy.getRank(); - int64_t offsetSize = static_cast<int64_t>(getOffsets().size()); - int64_t constOffsetSize = - getConstOffsetsAttr() ? getConstOffsetsAttr().size() : 0; - if (((offsetSize != 0) && (offsetSize != tDescRank)) || - ((constOffsetSize != 0) && (constOffsetSize != tDescRank))) + int64_t offsetSize = getMixedOffsets().size(); + if (offsetSize != 0 && offsetSize != tDescRank) return emitOpError( "Mismatched ranks between offsets and tensor descriptor"); diff --git a/mlir/test/Dialect/XeGPU/ops.mlir b/mlir/test/Dialect/XeGPU/ops.mlir index 0a10f6814ae96..9b3829664108d 100644 --- a/mlir/test/Dialect/XeGPU/ops.mlir +++ b/mlir/test/Dialect/XeGPU/ops.mlir @@ -278,6 +278,15 @@ gpu.func @subgroup_load_nd_offset_1(%src: memref<24x32xf32>, %x : index, %y : in gpu.return } +// CHECK: func @subgroup_load_nd_offset_2(%[[arg0:.*]]: memref<24x32xf32>, %arg1: index) { +gpu.func @subgroup_load_nd_offset_2(%src: memref<24x32xf32>, %x : index) { + // CHECK: %[[R0:.*]] = xegpu.create_nd_tdesc %arg0 : memref<24x32xf32> -> !xegpu.tensor_desc<16x8xf32> + %1 = xegpu.create_nd_tdesc %src : memref<24x32xf32> -> !xegpu.tensor_desc<16x8xf32> + // CHECK: %[[R1:.*]] = xegpu.load_nd %[[R0]][%arg1, 0] <{l1_hint = #xegpu.cache_hint<cached>, l2_hint = #xegpu.cache_hint<uncached>, transpose = array<i64: 1, 0>}> : !xegpu.tensor_desc<16x8xf32> -> vector<8x16xf32> + %2 = xegpu.load_nd %1[%x, 0] <{l1_hint = #xegpu.cache_hint<cached>, l2_hint = #xegpu.cache_hint<uncached>, transpose = array<i64: 1, 0>}> : !xegpu.tensor_desc<16x8xf32> -> vector<8x16xf32> + gpu.return +} + // CHECK: func @simt_load_nd_8(%[[arg0:.*]]: memref<24x32xf32>) { gpu.func @simt_load_nd_8(%src: memref<24x32xf32>) { // CHECK: %[[R0:.*]] = xegpu.create_nd_tdesc %arg0[0, 0] : memref<24x32xf32> -> !xegpu.tensor_desc<16x8xf32>

tkarna · 2025-11-03T09:03:40Z

FYI @Garra1980 @Jianhui-Li @adam-smnk

charithaintc

LGTM. Thanks for the fix.

Jianhui-Li

LGTM. Thanks!

llvmbot added mlir:gpu mlir labels Nov 3, 2025

[mlir][xegpu] fix load/store/prefetch op offset verifier

3b6e09a

tkarna force-pushed the tkarna/xegpu-fix-load-verify branch from d217d00 to 3b6e09a Compare November 3, 2025 09:18

adam-smnk requested a review from charithaintc November 3, 2025 09:32

adam-smnk approved these changes Nov 3, 2025

View reviewed changes

adam-smnk requested a review from Jianhui-Li November 3, 2025 09:36

charithaintc approved these changes Nov 3, 2025

View reviewed changes

Jianhui-Li approved these changes Nov 4, 2025

View reviewed changes

adam-smnk merged commit ed45c05 into llvm:main Nov 4, 2025
10 checks passed

tkarna deleted the tkarna/xegpu-fix-load-verify branch November 6, 2025 16:49

kerbowa mentioned this pull request Nov 10, 2025

[AMDGPU] Verify dominance when rewriting spills to registers #167347

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLIR][XeGPU] fix load/store/prefetch op offset verifier #166137

[MLIR][XeGPU] fix load/store/prefetch op offset verifier #166137

Uh oh!

tkarna commented Nov 3, 2025 •

edited

Loading

llvmbot commented Nov 3, 2025 •

edited

Loading

tkarna commented Nov 3, 2025

charithaintc left a comment

Jianhui-Li left a comment

Uh oh!

Labels

5 participants

[MLIR][XeGPU] fix load/store/prefetch op offset verifier #166137

[MLIR][XeGPU] fix load/store/prefetch op offset verifier #166137

Uh oh!

Conversation

tkarna commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

llvmbot commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

tkarna commented Nov 3, 2025

charithaintc left a comment

Choose a reason for hiding this comment

Jianhui-Li left a comment

Choose a reason for hiding this comment

Uh oh!

Labels

5 participants

tkarna commented Nov 3, 2025 •

edited

Loading

llvmbot commented Nov 3, 2025 •

edited

Loading