[AMDGPU] Set Size to 4 for V_MOV_B64_PSEUDO and S_MOV_B64_IMM_PSEUDO #70376

rampitec · 2023-10-26T20:31:55Z

These are not fixed size instructions, so immediate size shall be added separately. A minimal opcode size 4 since the inception of the V_MOV_B64 instruction. A real instruction can be as small as 4 bytes in case of inline immediate. Otherwise it is NFCI.

llvmbot · 2023-10-26T20:32:58Z

@llvm/pr-subscribers-backend-amdgpu

Author: Stanislav Mekhanoshin (rampitec)

Changes

These are not fixed size instructions, so immediate size shall be added separately. A minimal opcode size 4 since the inception of the V_MOV_B64 instruction. A real instruction can be as small as 4 bytes in case of inline immediate. Otherwise it is NFCI.

Full diff: https://github.com/llvm/llvm-project/pull/70376.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIInstructions.td (+2-2)

diff --git a/llvm/lib/Target/AMDGPU/SIInstructions.td b/llvm/lib/Target/AMDGPU/SIInstructions.td
index 567f1b812c1808c..bab74c170ab399d 100644
--- a/llvm/lib/Target/AMDGPU/SIInstructions.td
+++ b/llvm/lib/Target/AMDGPU/SIInstructions.td
@@ -132,7 +132,7 @@ def V_MOV_B64_PSEUDO : VPseudoInstSI <(outs VReg_64:$vdst),
   let isAsCheapAsAMove = 1;
   let isMoveImm = 1;
   let SchedRW = [Write64Bit];
-  let Size = 16; // Needs maximum 2 v_mov_b32 instructions 8 byte long each.
+  let Size = 4;
   let UseNamedOperandTable = 1;
 }
 
@@ -149,7 +149,7 @@ def S_MOV_B64_IMM_PSEUDO : SPseudoInstSI <(outs SReg_64:$sdst),
   let isAsCheapAsAMove = 1;
   let isMoveImm = 1;
   let SchedRW = [WriteSALU, Write64Bit];
-  let Size = 16; // Needs maximum 2 s_mov_b32 instructions 8 byte long each.
+  let Size = 4;
   let Uses = [];
   let UseNamedOperandTable = 1;
 }

Sisyph · 2023-10-26T21:36:19Z

llvm/lib/Target/AMDGPU/SIInstructions.td

@@ -132,7 +132,7 @@ def V_MOV_B64_PSEUDO : VPseudoInstSI <(outs VReg_64:$vdst),
  let isAsCheapAsAMove = 1;
  let isMoveImm = 1;
  let SchedRW = [Write64Bit];
-  let Size = 16; // Needs maximum 2 v_mov_b32 instructions 8 byte long each.
+  let Size = 4;


What is the exact meaning of Size?
In Target.td it says "// Size of encoded instruction", but can we make that more specific that? Is it minimum size?

This is somewhat strange for a pseudo which is never encoded as is, but SIInstrInfo::getInstSizeInBytes() uses it as a baseline size. If it is not a fixed size instruction (and it is not) it will then add literal size to the base. The 16 we have now already count the literal (second time). Given that is a pseudo for expansion I'd prefer to use minimal possible size. 16 was written when we had no V_MOV_B64 and it was always expanded into a pair of V_MOV_B32, but even in that case it shall be 8.

What is the exact meaning of Size?

Good question. I think TargetInstrInfo::getInstSizeInBytes probably has to return a maximum size, since it is used by (for example) the BranchRelaxation pass. The exact meaning of the TableGen Size field is probably target-dependent.

It seems like getInstSizeInBytes will return the true size, neither min nor max. And I agree TableGen Size an intermediate target defined value. We should Size it consistently on the AMDGPU target. I think it is minimum size. There does not appear to be a natural place to document it, but perhaps as a comment on SIInstrInfo::getInstSizeInBytes. Please consider that a nit, otherwise the patch looks good.

By the time we run BranchRelaxation we already have no these post-RA expandable pseudos. I.e. it should really return a correct size, but only at certain point of lowering.

rampitec requested review from jayfoad, arsenm and Sisyph October 26, 2023 20:31

llvmbot added the backend:AMDGPU label Oct 26, 2023

Sisyph reviewed Oct 26, 2023

View reviewed changes

Sisyph approved these changes Oct 27, 2023

View reviewed changes

rampitec merged commit 3e6d6f2 into llvm:main Oct 27, 2023

rampitec deleted the pseudo-size branch October 27, 2023 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Set Size to 4 for V_MOV_B64_PSEUDO and S_MOV_B64_IMM_PSEUDO #70376

[AMDGPU] Set Size to 4 for V_MOV_B64_PSEUDO and S_MOV_B64_IMM_PSEUDO #70376

Uh oh!

rampitec commented Oct 26, 2023

Uh oh!

llvmbot commented Oct 26, 2023

Uh oh!

Sisyph Oct 26, 2023

Uh oh!

rampitec Oct 26, 2023

Uh oh!

jayfoad Oct 27, 2023

Uh oh!

Sisyph Oct 27, 2023

Uh oh!

rampitec Oct 27, 2023

Uh oh!

Uh oh!

[AMDGPU] Set Size to 4 for V_MOV_B64_PSEUDO and S_MOV_B64_IMM_PSEUDO #70376

[AMDGPU] Set Size to 4 for V_MOV_B64_PSEUDO and S_MOV_B64_IMM_PSEUDO #70376

Uh oh!

Conversation

rampitec commented Oct 26, 2023

Uh oh!

llvmbot commented Oct 26, 2023

Uh oh!

Sisyph Oct 26, 2023

Choose a reason for hiding this comment

Uh oh!

rampitec Oct 26, 2023

Choose a reason for hiding this comment

Uh oh!

jayfoad Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

Sisyph Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

rampitec Oct 27, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!