Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance #68767

Dinistro · 2023-10-11T07:07:46Z

Reverts the revert commit and fixes the weak ordering requirement of llvm::sort.

Original commit message:

This commit fixes a bug in the Mem2Reg operation erasure order. Replacing the topological order with a dominance based order ensures that no operation is removed before all its uses have been replaced. Additionally, the reliance on the DenseMap key order was eliminated by switching to a MapVector, that gives a deterministic iteration order.

Example:

%ptr = alloca ...
...
%val0 = %load %ptr ... // LOAD0
store %val0 %ptr ...
%val1 = load %ptr ... // LOAD1

When promoting the slot backing %ptr, it can happen that the LOAD0 was cleaned before LOAD1. This results in all uses of LOAD0 being replaced by its reaching definition, before LOAD1's result is replaced by LOAD0's result. The subsequent erasure of LOAD0 can thus not succeed, as it has remaining usages.

…ance Reverts the revert commit and fixes the weak ordering requirement of `llvm::sort`. Original commit message: This commit fixes a bug in the Mem2Reg operation erasure order. Replacing the topological order with a dominance based order ensures that no operation is removed before all its uses have been replaced. Additionally, the reliance on the `DenseMap` key order was eliminated by switching to a `MapVector`, that gives a deterministic iteration order. Example: ``` %ptr = alloca ... ... %val0 = %load %ptr ... // LOAD0 store %val0 %ptr ... %val1 = load %ptr ... // LOAD1 ```` When promoting the slot backing %ptr, it can happen that the LOAD0 was cleaned before LOAD1. This results in all uses of LOAD0 being replaced by its reaching definition, before LOAD1's result is replaced by LOAD0's result. The subsequent erasure of LOAD0 can thus not succeed, as it has remaining usages.

Dinistro · 2023-10-11T07:08:41Z

The original PR ( #68687) cannot be reopened, that's why this new one was created.

llvmbot · 2023-10-11T07:08:54Z

@llvm/pr-subscribers-mlir-llvm
@llvm/pr-subscribers-mlir-core

@llvm/pr-subscribers-mlir

Author: Christian Ulmann (Dinistro)

Changes

…ance

Reverts the revert commit and fixes the weak ordering requirement of llvm::sort.

Original commit message:

This commit fixes a bug in the Mem2Reg operation erasure order. Replacing the topological order with a dominance based order ensures that no operation is removed before all its uses have been replaced. Additionally, the reliance on the DenseMap key order was eliminated by switching to a MapVector, that gives a deterministic iteration order.

Example:

%ptr = alloca ...
...
%val0 = %load %ptr ... // LOAD0
store %val0 %ptr ...
%val1 = load %ptr ... // LOAD1

When promoting the slot backing %ptr, it can happen that the LOAD0 was cleaned before LOAD1. This results in all uses of LOAD0 being replaced by its reaching definition, before LOAD1's result is replaced by LOAD0's result. The subsequent erasure of LOAD0 can thus not succeed, as it has remaining usages.

Full diff: https://github.com/llvm/llvm-project/pull/68767.diff

2 Files Affected:

(modified) mlir/lib/Transforms/Mem2Reg.cpp (+21-12)
(modified) mlir/test/Dialect/LLVMIR/mem2reg.mlir (+13)

diff --git a/mlir/lib/Transforms/Mem2Reg.cpp b/mlir/lib/Transforms/Mem2Reg.cpp
index 65de25dd2f32663..3132d5b2f82a6a0 100644
--- a/mlir/lib/Transforms/Mem2Reg.cpp
+++ b/mlir/lib/Transforms/Mem2Reg.cpp
@@ -96,6 +96,9 @@ using namespace mlir;
 
 namespace {
 
+using BlockingUsesMap =
+    llvm::MapVector<Operation *, SmallPtrSet<OpOperand *, 4>>;
+
 /// Information computed during promotion analysis used to perform actual
 /// promotion.
 struct MemorySlotPromotionInfo {
@@ -106,7 +109,7 @@ struct MemorySlotPromotionInfo {
   /// its uses, it is because the defining ops of the blocking uses requested
   /// it. The defining ops therefore must also have blocking uses or be the
   /// starting point of the bloccking uses.
-  DenseMap<Operation *, SmallPtrSet<OpOperand *, 4>> userToBlockingUses;
+  BlockingUsesMap userToBlockingUses;
 };
 
 /// Computes information for basic slot promotion. This will check that direct
@@ -129,8 +132,7 @@ class MemorySlotPromotionAnalyzer {
   /// uses (typically, removing its users because it will delete itself to
   /// resolve its own blocking uses). This will fail if one of the transitive
   /// users cannot remove a requested use, and should prevent promotion.
-  LogicalResult computeBlockingUses(
-      DenseMap<Operation *, SmallPtrSet<OpOperand *, 4>> &userToBlockingUses);
+  LogicalResult computeBlockingUses(BlockingUsesMap &userToBlockingUses);
 
   /// Computes in which blocks the value stored in the slot is actually used,
   /// meaning blocks leading to a load. This method uses `definingBlocks`, the
@@ -233,7 +235,7 @@ Value MemorySlotPromoter::getLazyDefaultValue() {
 }
 
 LogicalResult MemorySlotPromotionAnalyzer::computeBlockingUses(
-    DenseMap<Operation *, SmallPtrSet<OpOperand *, 4>> &userToBlockingUses) {
+    BlockingUsesMap &userToBlockingUses) {
   // The promotion of an operation may require the promotion of further
   // operations (typically, removing operations that use an operation that must
   // delete itself). We thus need to start from the use of the slot pointer and
@@ -243,7 +245,7 @@ LogicalResult MemorySlotPromotionAnalyzer::computeBlockingUses(
   // use it.
   for (OpOperand &use : slot.ptr.getUses()) {
     SmallPtrSet<OpOperand *, 4> &blockingUses =
-        userToBlockingUses.getOrInsertDefault(use.getOwner());
+        userToBlockingUses[use.getOwner()];
     blockingUses.insert(&use);
   }
 
@@ -281,7 +283,7 @@ LogicalResult MemorySlotPromotionAnalyzer::computeBlockingUses(
       assert(llvm::is_contained(user->getResults(), blockingUse->get()));
 
       SmallPtrSetImpl<OpOperand *> &newUserBlockingUseSet =
-          userToBlockingUses.getOrInsertDefault(blockingUse->getOwner());
+          userToBlockingUses[blockingUse->getOwner()];
       newUserBlockingUseSet.insert(blockingUse);
     }
   }
@@ -516,14 +518,21 @@ void MemorySlotPromoter::computeReachingDefInRegion(Region *region,
 }
 
 void MemorySlotPromoter::removeBlockingUses() {
-  llvm::SetVector<Operation *> usersToRemoveUses;
-  for (auto &user : llvm::make_first_range(info.userToBlockingUses))
-    usersToRemoveUses.insert(user);
-  SetVector<Operation *> sortedUsersToRemoveUses =
-      mlir::topologicalSort(usersToRemoveUses);
+  llvm::SmallVector<Operation *> usersToRemoveUses(
+      llvm::make_first_range(info.userToBlockingUses));
+
+  // The uses need to be traversed in *reverse dominance* order to ensure that
+  // transitive replacements are performed correctly.
+  // NOTE: The order can be non-deterministic, due to a pointer comparision, but
+  // this has no effect on the result of the pattern. This is necessary to get a
+  // strict weak order relation.
+  llvm::sort(usersToRemoveUses, [&](Operation *lhs, Operation *rhs) {
+    return dominance.properlyDominates(rhs, lhs) ||
+           (!dominance.properlyDominates(lhs, rhs) && rhs < lhs);
+  });
 
   llvm::SmallVector<Operation *> toErase;
-  for (Operation *toPromote : llvm::reverse(sortedUsersToRemoveUses)) {
+  for (Operation *toPromote : usersToRemoveUses) {
     if (auto toPromoteMemOp = dyn_cast<PromotableMemOpInterface>(toPromote)) {
       Value reachingDef = reachingDefs.lookup(toPromoteMemOp);
       // If no reaching definition is known, this use is outside the reach of
diff --git a/mlir/test/Dialect/LLVMIR/mem2reg.mlir b/mlir/test/Dialect/LLVMIR/mem2reg.mlir
index 30ba459d07a49f3..32e3fed7e5485df 100644
--- a/mlir/test/Dialect/LLVMIR/mem2reg.mlir
+++ b/mlir/test/Dialect/LLVMIR/mem2reg.mlir
@@ -683,3 +683,16 @@ llvm.func @no_inner_alloca_promotion(%arg: i64) -> i64 {
   // CHECK: llvm.return %[[RES]] : i64
   llvm.return %2 : i64
 }
+
+// -----
+
+// CHECK-LABEL: @transitive_reaching_def
+llvm.func @transitive_reaching_def() -> !llvm.ptr {
+  %0 = llvm.mlir.constant(1 : i32) : i32
+  // CHECK-NOT: alloca
+  %1 = llvm.alloca %0 x !llvm.ptr {alignment = 8 : i64} : (i32) -> !llvm.ptr
+  %2 = llvm.load %1 {alignment = 8 : i64} : !llvm.ptr -> !llvm.ptr
+  llvm.store %2, %1 {alignment = 8 : i64} : !llvm.ptr, !llvm.ptr
+  %3 = llvm.load %1 {alignment = 8 : i64} : !llvm.ptr -> !llvm.ptr
+  llvm.return %3 : !llvm.ptr
+}

gysit

LGTM

metaflow · 2023-10-11T14:09:46Z

hi! I see a test failure in mem2reg.mlir.test assertion __comp(*(__first + __a), *(__first + __b)) failed: Your comparator is not a valid strict-weak ordering.

Dinistro · 2023-10-11T14:31:33Z

Did you see this on the relanded commit or on the original version?

metaflow · 2023-10-11T14:43:18Z

Did you see this on the relanded commit or on the original version?

for reland

joker-eph · 2023-10-11T15:03:26Z

Please be careful about the title wrapping (both authors and reviewer).
GitHub does that but you need to manually undo it.

…ct dominance (#68767)" This reverts commit 59fec73. This reland did not properly fix the partial order.

Dinistro · 2023-10-11T15:14:48Z

I reverted the commit again, thanks for the heads up.
It seems that we cannot use a normal sort algorithm but have to implement a manual topological sort to deal with the partial order problem 😞

Dinistro requested review from gysit and Moxinilian October 11, 2023 07:07

llvmbot added mlir:core MLIR Core Infrastructure mlir:llvm mlir labels Oct 11, 2023

gysit approved these changes Oct 11, 2023

View reviewed changes

Dinistro merged commit 59fec73 into llvm:main Oct 11, 2023

Dinistro deleted the fix-mem2reg-removal-order branch October 11, 2023 07:23

Dinistro restored the fix-mem2reg-removal-order branch October 11, 2023 15:11

Dinistro added a commit that referenced this pull request Oct 11, 2023

Revert "Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respe…

660a78f

…ct dominance (#68767)" This reverts commit 59fec73. This reland did not properly fix the partial order.

Dinistro changed the title ~~Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect domin…~~ Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance Oct 11, 2023

Dinistro mentioned this pull request Oct 12, 2023

Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance #68877

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance #68767

Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance #68767

Uh oh!

Dinistro commented Oct 11, 2023 •

edited

Loading

Uh oh!

Dinistro commented Oct 11, 2023 •

edited

Loading

Uh oh!

llvmbot commented Oct 11, 2023 •

edited

Loading

Uh oh!

gysit left a comment

Uh oh!

metaflow commented Oct 11, 2023

Uh oh!

Dinistro commented Oct 11, 2023

Uh oh!

metaflow commented Oct 11, 2023 •

edited

Loading

Uh oh!

joker-eph commented Oct 11, 2023

Uh oh!

Dinistro commented Oct 11, 2023

Uh oh!

Uh oh!

Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance #68767

Reland: [MLIR][Transforms] Fix Mem2Reg removal order to respect dominance #68767

Uh oh!

Conversation

Dinistro commented Oct 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dinistro commented Oct 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Oct 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gysit left a comment

Choose a reason for hiding this comment

Uh oh!

metaflow commented Oct 11, 2023

Uh oh!

Dinistro commented Oct 11, 2023

Uh oh!

metaflow commented Oct 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joker-eph commented Oct 11, 2023

Uh oh!

Dinistro commented Oct 11, 2023

Uh oh!

Uh oh!

Dinistro commented Oct 11, 2023 •

edited

Loading

Dinistro commented Oct 11, 2023 •

edited

Loading

llvmbot commented Oct 11, 2023 •

edited

Loading

metaflow commented Oct 11, 2023 •

edited

Loading