[mlir][SME] Update E2E test to show potential optimisation (NFC) #107585

nujaa · 2024-09-06T13:34:23Z

Introduces loop hoisting to ARM SME E2E tests to allow the hoisting of the tile load offering very important speedup.

Discussed here : https://discourse.llvm.org/t/mlir-for-arm-sme-reducing-tile-data-transfers/80065/2

llvmbot · 2024-09-06T14:13:56Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-linalg

Author: Hugo Trachino (nujaa)

Changes

Introduces loop hoisting to ARM SME E2E tests to allow the hoisting of the tile load offering very important speedup.

Discussed here : https://discourse.llvm.org/t/mlir-for-arm-sme-reducing-tile-data-transfers/80065/2

Full diff: https://github.com/llvm/llvm-project/pull/107585.diff

2 Files Affected:

(modified) mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir (+8)
(modified) mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir (+8)

diff --git a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir
index a57348a543c3cf..886211b65efa2d 100644
--- a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir
+++ b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir
@@ -82,8 +82,16 @@ module attributes {transform.with_named_sequence} {
     transform.apply_patterns to %func {
       transform.apply_patterns.vector.lower_contraction lowering_strategy = "outerproduct"
       transform.apply_patterns.vector.lower_masks
+      transform.apply_patterns.canonicalization
     } : !transform.any_op
 
+    // Step 5: Hoist load of accumulator.
+    %func_h = transform.structured.hoist_redundant_vector_transfers %func
+        : (!transform.any_op) -> !transform.any_op
+    %all_loops = transform.structured.match interface{LoopLikeInterface} in %module
+      : (!transform.any_op) -> !transform.any_op
+    transform.apply_licm to %all_loops : !transform.any_op
+    transform.loop.hoist_loop_invariant_subsets %all_loops : !transform.any_op
     transform.yield
   }
 }
diff --git a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir
index 79c9fcac70604b..4b6b9a9c746499 100644
--- a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir
+++ b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir
@@ -88,8 +88,16 @@ module attributes {transform.with_named_sequence} {
       transform.apply_patterns.vector.lower_contraction lowering_strategy = "outerproduct"
       transform.apply_patterns.vector.lower_masks
       transform.apply_patterns.vector.rank_reducing_subview_patterns
+      transform.apply_patterns.canonicalization
     } : !transform.any_op
 
+    // Step 6: Hoist load of accumulator.
+    %func_h = transform.structured.hoist_redundant_vector_transfers %func
+        : (!transform.any_op) -> !transform.any_op
+    %all_loops = transform.structured.match interface{LoopLikeInterface} in %bufferize
+      : (!transform.any_op) -> !transform.any_op
+    transform.apply_licm to %all_loops : !transform.any_op
+    transform.loop.hoist_loop_invariant_subsets %all_loops : !transform.any_op
     transform.yield
   }
 }

llvmbot · 2024-09-06T14:13:56Z

@llvm/pr-subscribers-mlir-sme

Author: Hugo Trachino (nujaa)

Changes

Introduces loop hoisting to ARM SME E2E tests to allow the hoisting of the tile load offering very important speedup.

Discussed here : https://discourse.llvm.org/t/mlir-for-arm-sme-reducing-tile-data-transfers/80065/2

Full diff: https://github.com/llvm/llvm-project/pull/107585.diff

2 Files Affected:

(modified) mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir (+8)
(modified) mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir (+8)

diff --git a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir
index a57348a543c3cf..886211b65efa2d 100644
--- a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir
+++ b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir
@@ -82,8 +82,16 @@ module attributes {transform.with_named_sequence} {
     transform.apply_patterns to %func {
       transform.apply_patterns.vector.lower_contraction lowering_strategy = "outerproduct"
       transform.apply_patterns.vector.lower_masks
+      transform.apply_patterns.canonicalization
     } : !transform.any_op
 
+    // Step 5: Hoist load of accumulator.
+    %func_h = transform.structured.hoist_redundant_vector_transfers %func
+        : (!transform.any_op) -> !transform.any_op
+    %all_loops = transform.structured.match interface{LoopLikeInterface} in %module
+      : (!transform.any_op) -> !transform.any_op
+    transform.apply_licm to %all_loops : !transform.any_op
+    transform.loop.hoist_loop_invariant_subsets %all_loops : !transform.any_op
     transform.yield
   }
 }
diff --git a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir
index 79c9fcac70604b..4b6b9a9c746499 100644
--- a/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir
+++ b/mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul.mlir
@@ -88,8 +88,16 @@ module attributes {transform.with_named_sequence} {
       transform.apply_patterns.vector.lower_contraction lowering_strategy = "outerproduct"
       transform.apply_patterns.vector.lower_masks
       transform.apply_patterns.vector.rank_reducing_subview_patterns
+      transform.apply_patterns.canonicalization
     } : !transform.any_op
 
+    // Step 6: Hoist load of accumulator.
+    %func_h = transform.structured.hoist_redundant_vector_transfers %func
+        : (!transform.any_op) -> !transform.any_op
+    %all_loops = transform.structured.match interface{LoopLikeInterface} in %bufferize
+      : (!transform.any_op) -> !transform.any_op
+    transform.apply_licm to %all_loops : !transform.any_op
+    transform.loop.hoist_loop_invariant_subsets %all_loops : !transform.any_op
     transform.yield
   }
 }

banach-space

Thanks Hugo, LGTM!

Could you add a note that the additional step is not required for functional correctness and that instead it's an optimisation? This is obvious today, but our future selves might forget ;-) Thanks!

MacDue · 2024-09-07T09:01:02Z

mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/matmul-transpose-a.mlir

    } : !transform.any_op

+    // Step 5: Hoist load of accumulator.


Nit: It's both the load and store of the accumulator that's hoisted.

MacDue · 2024-09-09T11:22:12Z

Typo optionnal -> optional (also maybe say optimization rather than optional)

…ion (NFC)

[mlir][SME] Update E2E test to show potential optimisation (NFC)

b7d6b76

nujaa requested a review from MacDue September 6, 2024 13:49

nujaa marked this pull request as ready for review September 6, 2024 14:13

nujaa requested review from banach-space, dcaballe and nicolasvasilache as code owners September 6, 2024 14:13

llvmbot added mlir:linalg mlir mlir:sme labels Sep 6, 2024

banach-space approved these changes Sep 6, 2024

View reviewed changes

MacDue approved these changes Sep 7, 2024

View reviewed changes

fixup! [mlir][SME] Update E2E test to show potential optimisation (NFC)

b6cc591

fixup! fixup! [mlir][SME] Update E2E test to show potential optimisat…

784ee2d

…ion (NFC)

nujaa merged commit 8aeb104 into llvm:main Sep 10, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][SME] Update E2E test to show potential optimisation (NFC) #107585

[mlir][SME] Update E2E test to show potential optimisation (NFC) #107585

Uh oh!

nujaa commented Sep 6, 2024

Uh oh!

llvmbot commented Sep 6, 2024 •

edited

Loading

Uh oh!

llvmbot commented Sep 6, 2024

Uh oh!

banach-space left a comment

Uh oh!

MacDue Sep 7, 2024

Uh oh!

MacDue commented Sep 9, 2024

Uh oh!

Uh oh!

Uh oh!

[mlir][SME] Update E2E test to show potential optimisation (NFC) #107585

[mlir][SME] Update E2E test to show potential optimisation (NFC) #107585

Uh oh!

Conversation

nujaa commented Sep 6, 2024

Uh oh!

llvmbot commented Sep 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Sep 6, 2024

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

MacDue Sep 7, 2024

Choose a reason for hiding this comment

Uh oh!

MacDue commented Sep 9, 2024

Uh oh!

Uh oh!

Uh oh!

llvmbot commented Sep 6, 2024 •

edited

Loading