[LV] Fix '-1U' bits for smallest type in getSmallestAndWidestTypes #135783

sdesmalen-arm · 2025-04-15T12:19:21Z

For loops without loads/stores, where the smallest/widest types are calculated from the reduction, the smallest type returned is always -1U and it actually returns the smallest type as the widest type. This PR fixes the calculation.

This follows from #132190 (comment)

For loops without loads/stores, where the smallest/widest types are calculated from the reduction, the smallest type returned is always -1U and it actually returns the smallest type as the widest type. This PR fixes the calculation.

llvmbot · 2025-04-15T12:20:08Z

@llvm/pr-subscribers-vectorizers

@llvm/pr-subscribers-llvm-transforms

Author: Sander de Smalen (sdesmalen-arm)

Changes

For loops without loads/stores, where the smallest/widest types are calculated from the reduction, the smallest type returned is always -1U and it actually returns the smallest type as the widest type. This PR fixes the calculation.

This follows from #132190 (comment)

Full diff: https://github.com/llvm/llvm-project/pull/135783.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/LoopVectorize.cpp (+4-5)
(modified) llvm/test/Transforms/LoopVectorize/AArch64/smallest-and-widest-types.ll (+3-3)

diff --git a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
index af94dc01c8c5c..c21b6c4c88929 100644
--- a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+++ b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
@@ -4798,17 +4798,16 @@ LoopVectorizationCostModel::getSmallestAndWidestTypes() {
   // if there are no loads/stores in the loop. In this case, check through the
   // reduction variables to determine the maximum width.
   if (ElementTypesInLoop.empty() && !Legal->getReductionVars().empty()) {
-    // Reset MaxWidth so that we can find the smallest type used by recurrences
-    // in the loop.
-    MaxWidth = -1U;
     for (const auto &PhiDescriptorPair : Legal->getReductionVars()) {
       const RecurrenceDescriptor &RdxDesc = PhiDescriptorPair.second;
       // When finding the min width used by the recurrence we need to account
       // for casts on the input operands of the recurrence.
-      MaxWidth = std::min<unsigned>(
-          MaxWidth, std::min<unsigned>(
+      MinWidth = std::min<unsigned>(
+          MinWidth, std::min<unsigned>(
                         RdxDesc.getMinWidthCastToRecurrenceTypeInBits(),
                         RdxDesc.getRecurrenceType()->getScalarSizeInBits()));
+      MaxWidth = std::max<unsigned>(
+          MaxWidth, RdxDesc.getRecurrenceType()->getScalarSizeInBits());
     }
   } else {
     for (Type *T : ElementTypesInLoop) {
diff --git a/llvm/test/Transforms/LoopVectorize/AArch64/smallest-and-widest-types.ll b/llvm/test/Transforms/LoopVectorize/AArch64/smallest-and-widest-types.ll
index 269562fa70549..b34ba5e38811a 100644
--- a/llvm/test/Transforms/LoopVectorize/AArch64/smallest-and-widest-types.ll
+++ b/llvm/test/Transforms/LoopVectorize/AArch64/smallest-and-widest-types.ll
@@ -37,7 +37,7 @@ for.end:
 ; chosen. The following 3 cases check different combinations of widths.
 
 ; CHECK-LABEL: Checking a loop in 'no_loads_stores_32'
-; CHECK: The Smallest and Widest types: 4294967295 / 32 bits
+; CHECK: The Smallest and Widest types: 32 / 64 bits
 ; CHECK: Selecting VF: 4
 
 define double @no_loads_stores_32(i32 %n) {
@@ -60,7 +60,7 @@ for.end:
 }
 
 ; CHECK-LABEL: Checking a loop in 'no_loads_stores_16'
-; CHECK: The Smallest and Widest types: 4294967295 / 16 bits
+; CHECK: The Smallest and Widest types: 16 / 64 bits
 ; CHECK: Selecting VF: 8
 
 define double @no_loads_stores_16() {
@@ -82,7 +82,7 @@ for.end:
 }
 
 ; CHECK-LABEL: Checking a loop in 'no_loads_stores_8'
-; CHECK: The Smallest and Widest types: 4294967295 / 8 bits
+; CHECK: The Smallest and Widest types: 8 / 32 bits
 ; CHECK: Selecting VF: 16
 
 define float @no_loads_stores_8() {

fhahn · 2025-04-16T07:29:36Z

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

                        RdxDesc.getMinWidthCastToRecurrenceTypeInBits(),
                        RdxDesc.getRecurrenceType()->getScalarSizeInBits()));
+      MaxWidth = std::max<unsigned>(


This change does also change how MaxWidth is calculated, previously MaxWidth was what MinWidth is now?

Granted, it seems to make sense to change this as well, although I am not sure if this was done intentionally when support for in-loop reductions originally

This change does also change how MaxWidth is calculated, previously MaxWidth was what MinWidth is now?

Yes, that's right.

I am not sure if this was done intentionally when support for in-loop reductions originally

I wonder if MaxWidth got confused with the resulting maximum vector width (/factor) at some point. The original patch that added this was https://reviews.llvm.org/D113973.

The tests added in that patch contain two different types, particularly for testing this purpose, although the CHECK lines (e.g. CHECK: The Smallest and Widest types: 4294967295 / 16 bits) seemed wrong, but after this change it is what I would expect it to be.

Yes, it seems like the new behavior matches the expectations.

SamTebbs33

LGTM, thank you.

fhahn

LGTM, thanks

fhahn · 2025-04-16T20:51:48Z

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

                        RdxDesc.getMinWidthCastToRecurrenceTypeInBits(),
                        RdxDesc.getRecurrenceType()->getScalarSizeInBits()));
+      MaxWidth = std::max<unsigned>(


Yes, it seems like the new behavior matches the expectations.

llvm-ci · 2025-04-17T13:03:07Z

LLVM Buildbot has detected a new failure on builder arc-builder running on arc-worker while building llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/3/builds/14674

Here is the relevant piece of the build log for the reference

Step 6 (test-build-unified-tree-check-all) failure: 1200 seconds without output running [b'ninja', b'check-all'], attempting to kill
...
..............................................................................................................................................................
----------------------------------------------------------------------
Ran 158 tests in 2.494s

OK
10.643 [31/18/30] Linking CXX executable unittests/Debuginfod/DebuginfodTests
11.126 [30/18/31] Linking CXX executable unittests/DebugInfo/PDB/DebugInfoPDBTests
11.746 [29/18/32] Linking CXX executable unittests/ExecutionEngine/JITLink/JITLinkTests
11.894 [28/18/33] Linking CXX executable unittests/ExecutionEngine/ExecutionEngineTests
12.283 [27/18/34] Linking CXX executable unittests/InterfaceStub/InterfaceStubTests
command timed out: 1200 seconds without output running [b'ninja', b'check-all'], attempting to kill
process killed by signal 9
program finished with exit code -1
elapsedTime=1213.276590

sdesmalen-arm requested review from fhahn, SamTebbs33 and david-arm April 15, 2025 12:19

llvmbot added vectorizers llvm:transforms labels Apr 15, 2025

fhahn reviewed Apr 16, 2025

View reviewed changes

SamTebbs33 approved these changes Apr 16, 2025

View reviewed changes

fhahn approved these changes Apr 16, 2025

View reviewed changes

sdesmalen-arm merged commit f9c01b5 into llvm:main Apr 17, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LV] Fix '-1U' bits for smallest type in getSmallestAndWidestTypes #135783

[LV] Fix '-1U' bits for smallest type in getSmallestAndWidestTypes #135783

Uh oh!

sdesmalen-arm commented Apr 15, 2025

Uh oh!

llvmbot commented Apr 15, 2025 •

edited

Loading

Uh oh!

fhahn Apr 16, 2025

Uh oh!

sdesmalen-arm Apr 16, 2025

Uh oh!

fhahn Apr 16, 2025

Uh oh!

SamTebbs33 left a comment

Uh oh!

fhahn left a comment

Uh oh!

fhahn Apr 16, 2025

Uh oh!

Uh oh!

llvm-ci commented Apr 17, 2025

Uh oh!

Uh oh!

[LV] Fix '-1U' bits for smallest type in getSmallestAndWidestTypes #135783

[LV] Fix '-1U' bits for smallest type in getSmallestAndWidestTypes #135783

Uh oh!

Conversation

sdesmalen-arm commented Apr 15, 2025

Uh oh!

llvmbot commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fhahn Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

sdesmalen-arm Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

fhahn Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

SamTebbs33 left a comment

Choose a reason for hiding this comment

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

fhahn Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvm-ci commented Apr 17, 2025

Uh oh!

Uh oh!

llvmbot commented Apr 15, 2025 •

edited

Loading