[mlir][TD] Fix the order of return handles #76929

banach-space · 2024-01-04T09:34:16Z

Replace (in tests and docs):

%forall, %tiled = transform.structured.tile_using_forall

with (updated order of return handles):

%tiled, %forall = transform.structured.tile_using_forall

Similar change is applied to (in the TD tutorial):

transform.structured.fuse_into_containing_op

This update makes sure that the tests/documentation are consistent with
the Op specifications. Follow-up for #67320 which updated the order of
the return handles for tile_using_forall.

llvmbot · 2024-01-04T09:34:43Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-gpu

Author: Andrzej Warzyński (banach-space)

Changes

Replace (in tests and docs):

%forall, %tiled = transform.structured.tile_using_forall

with:

%tiled, %forall = transform.structured.tile_using_forall

Applies similar change to (in the TD tutorial):

transform.structured.fuse_into_containing_op

This update makes sure that the tests/documentation are consistent with
the Op specifications. Follow-up for #67320 which updated the order of
the return handles for tile_using_forall.

Full diff: https://github.com/llvm/llvm-project/pull/76929.diff

5 Files Affected:

(modified) mlir/docs/Tutorials/transform/Ch1.md (+4-4)
(modified) mlir/include/mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.td (+2-2)
(modified) mlir/test/Dialect/GPU/transform-gpu-failing.mlir (+1-1)
(modified) mlir/test/Dialect/Linalg/tile-to-forall.mlir (+2-2)
(modified) mlir/test/Examples/transform/Ch1/invalidation-1.mlir (+2-2)

diff --git a/mlir/docs/Tutorials/transform/Ch1.md b/mlir/docs/Tutorials/transform/Ch1.md index 95b37eb6ca4130..90b30300a98ced 100644 --- a/mlir/docs/Tutorials/transform/Ch1.md +++ b/mlir/docs/Tutorials/transform/Ch1.md @@ -261,7 +261,7 @@ transform.sequence failures(propagate) { // The actual tiling transformation takes tile sizes as attributes. It // produces a handle to the loop generated during tiling. - %loop, %tiled_max = + %tiled_max, %loop = transform.structured.tile_using_forall %max tile_sizes [8, 32] : (!transform.any_op) -> (!transform.any_op, !transform.any_op) @@ -271,12 +271,12 @@ transform.sequence failures(propagate) { // important. We could also use "transform.merge_handles" to obtain a single // handle to all operations and give it to `fuse_into_containing_op` that // would take care of the ordering in this case. - %loop_0, %tiled_add = + %add_fused, %loop_0 = transform.structured.fuse_into_containing_op %add into %loop : (!transform.any_op, !transform.any_op) -> (!transform.any_op, !transform.any_op) - %loop_1, %tiled_matmul = - transform.structured.fuse_into_containing_op %arg1 into %loop + %matmul_fused, %loop_1 = + transform.structured.fuse_into_containing_op %arg1 into %loop_0 : (!transform.op<"linalg.matmul">, !transform.any_op) -> (!transform.any_op, !transform.any_op) diff --git a/mlir/include/mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.td b/mlir/include/mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.td index 77ed9db5e71bd1..bc257d17483e3b 100644 --- a/mlir/include/mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.td +++ b/mlir/include/mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.td @@ -1911,8 +1911,8 @@ def TileUsingForallOp : tiled operations, which can all be empty. These two returned handles point to: - - the new scf.forall op, - - the tiled op that implements TilingInterface. + - the tiled op that implements TilingInterface, + - the new scf.forall op. #### Example using `num_threads` diff --git a/mlir/test/Dialect/GPU/transform-gpu-failing.mlir b/mlir/test/Dialect/GPU/transform-gpu-failing.mlir index f81f8b64afdfc6..8d7a1aa2a55fca 100644 --- a/mlir/test/Dialect/GPU/transform-gpu-failing.mlir +++ b/mlir/test/Dialect/GPU/transform-gpu-failing.mlir @@ -144,7 +144,7 @@ func.func @map_nested_forall_to_threads_not_buffer(%x: tensor<32x32xf32>, %y: te module attributes {transform.with_named_sequence} { transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) { %matmul = transform.structured.match ops{["linalg.matmul"]} in %arg0 : (!transform.any_op) -> !transform.any_op - %forall, %tiled = transform.structured.tile_using_forall %matmul num_threads [2, 3, 1] (mapping = [ #gpu.thread<y>, #gpu.thread<x>, #gpu.thread<z> ] ) + %tiled, %forall = transform.structured.tile_using_forall %matmul num_threads [2, 3, 1] (mapping = [ #gpu.thread<y>, #gpu.thread<x>, #gpu.thread<z> ] ) : (!transform.any_op) -> (!transform.any_op, !transform.any_op) %funcop = transform.structured.match ops{["gpu.launch"]} in %arg0 : (!transform.any_op) -> !transform.any_op // expected-error @below {{only bufferized scf.forall can be mapped}} diff --git a/mlir/test/Dialect/Linalg/tile-to-forall.mlir b/mlir/test/Dialect/Linalg/tile-to-forall.mlir index 38742028e48101..2192d160b1150f 100644 --- a/mlir/test/Dialect/Linalg/tile-to-forall.mlir +++ b/mlir/test/Dialect/Linalg/tile-to-forall.mlir @@ -389,7 +389,7 @@ module attributes {transform.with_named_sequence} { module attributes {transform.with_named_sequence} { transform.named_sequence @__transform_main(%arg1: !transform.any_op {transform.readonly}) { %0 = transform.structured.match ops{["linalg.generic"]} in %arg1 : (!transform.any_op) -> !transform.any_op - %forall, %tiled_generic = transform.structured.tile_using_forall %0 num_threads [7] + %tiled_generic, %forall = transform.structured.tile_using_forall %0 num_threads [7] : (!transform.any_op) -> (!transform.any_op, !transform.any_op) transform.yield } @@ -445,7 +445,7 @@ module attributes {transform.with_named_sequence} { module attributes {transform.with_named_sequence} { transform.named_sequence @__transform_main(%IN_MAT2: !transform.any_op {transform.readonly}) { %0 = transform.structured.match ops{["linalg.generic"]} in %IN_MAT2 : (!transform.any_op) -> !transform.any_op - %forall, %tiled_generic = transform.structured.tile_using_forall %0 num_threads [4] + %tiled_generic, %forall = transform.structured.tile_using_forall %0 num_threads [4] : (!transform.any_op) -> (!transform.any_op, !transform.any_op) transform.yield } diff --git a/mlir/test/Examples/transform/Ch1/invalidation-1.mlir b/mlir/test/Examples/transform/Ch1/invalidation-1.mlir index 1b71cfbe9b3607..825e2abe48229f 100644 --- a/mlir/test/Examples/transform/Ch1/invalidation-1.mlir +++ b/mlir/test/Examples/transform/Ch1/invalidation-1.mlir @@ -19,7 +19,7 @@ transform.sequence failures(propagate) { %arg2: !transform.op<"linalg.elemwise_binary">): // The actual tiling transformation takes tile sizes as attributes. // expected-note @below {{invalidated by this transform op that consumes its operand #0 and invalidates all handles to payload IR entities associated with this operand and entities nested in them}} - %loop, %tiled = transform.structured.tile_using_forall %arg1 tile_sizes [4, 32] + %tiled, %loop = transform.structured.tile_using_forall %arg1 tile_sizes [4, 32] : (!transform.op<"linalg.matmul">) -> (!transform.any_op, !transform.any_op) // This is trying to use an invalidated handle leading to undefined behavior. @@ -64,7 +64,7 @@ transform.sequence failures(propagate) { // The actual tiling transformation takes tile sizes as attributes. // expected-note @below {{invalidated by this transform op that consumes its operand #0 and invalidates all handles to payload IR entities associated with this operand and entities nested in them}} - %loop, %tiled = transform.structured.tile_using_forall %arg1 tile_sizes [4, 32] + %tiled, %loop = transform.structured.tile_using_forall %arg1 tile_sizes [4, 32] : (!transform.op<"linalg.matmul">) -> (!transform.any_op, !transform.any_op) // Consuming an operand invalidates the consumed handle and any other handle that is

Replace (in tests and docs): %forall, %tiled = transform.structured.tile_using_forall with: %tiled, %forall = transform.structured.tile_using_forall Applies similar change to (in the TD tutorial): transform.structured.fuse_into_containing_op This update makes sure that the tests/documentation are consistent with the Op specifications. Follow-up for llvm#67320 which updated the order of the return handles for `tile_using_forall`.

ftynse

Thanks!

banach-space requested review from dcaballe and nicolasvasilache as code owners January 4, 2024 09:34

llvmbot added mlir:linalg mlir:gpu mlir labels Jan 4, 2024

banach-space requested a review from ftynse January 4, 2024 09:35

banach-space force-pushed the andrzej/update_forall_examples_and_docs branch from 3266b8d to ace1ed9 Compare January 4, 2024 09:50

ftynse approved these changes Jan 4, 2024

View reviewed changes

banach-space merged commit ca5d34e into llvm:main Jan 4, 2024

banach-space deleted the andrzej/update_forall_examples_and_docs branch March 16, 2024 19:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][TD] Fix the order of return handles #76929

[mlir][TD] Fix the order of return handles #76929

Uh oh!

banach-space commented Jan 4, 2024 •

edited

Loading

llvmbot commented Jan 4, 2024 •

edited

Loading

ftynse left a comment

Labels

3 participants

[mlir][TD] Fix the order of return handles #76929

[mlir][TD] Fix the order of return handles #76929

Uh oh!

Conversation

banach-space commented Jan 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

llvmbot commented Jan 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ftynse left a comment

Choose a reason for hiding this comment

Labels

3 participants

banach-space commented Jan 4, 2024 •

edited

Loading

llvmbot commented Jan 4, 2024 •

edited

Loading