Description
Bugzilla Link | 50647 |
Resolution | INVALID |
Resolved on | Jul 13, 2021 08:48 |
Version | unspecified |
OS | All |
CC | @DavidSpickett,@zygoloid |
Extended Description
I came across a code generation issue related to register allocation.
Essentially, if you have two C variables, one being uninitialized, and you use one of them in regular C code (in the example a shift right), then use both of them as inputs to an inline asm block, the two variables collide on a register. This issue does not appear in similar X86 code or in the ARM gcc compiler. However, if you initialize the variable this bug goes away, but introduces an extra mov that is unnecessary for the correctness of the code.
You can see the below example on compiler explorer here (https://godbolt.org/z/7nKhfj9E9)
The following is code that reproduces the issue
void func(unsigned long long n)
{
#ifndef FIX
unsigned long long b;
#else
//this fixes the bug but introduces an extra, extraneous mov
//the ideal code would not include this extra mov
//this is what gcc does with or without initializing the variable
unsigned long long b = 0;
#endif
//if this is not here, bug does not occur
//one of the C variables must be used to trigger the bug
n = n >> 7;
//_n should be a loop counter
//_b should be the jump address
//but _n and _b map to the same register in arm clang
//they should map to different registers, as seen in arm gcc
__asm__ volatile (
"top%=: \n\t"
"adrp %[_b], lbl%=@PAGE \n\t"
"add %[_b], %[_b], lbl%=@PAGEOFF \n\t"
"br %[_b] \n\t"
"lbl%=: \n\t"
"sub %[_n], %[_n], #​0x1 \n\t"
"cmp %[_n], #​0x0 \n\t"
"b.ne top%= \n\t"
:
: [_n] "r" (n), [_b] "r" (b)
);
}
The generated assembly is as follows
func:
lsr x8, x0, #7
top0:
adrp x8, lbl0@PAGE
add x8, x8, lbl0@PAGEOFF
br x8
lbl0:
sub x8, x8, #1
cmp x8, #0
b.ne top0
ret
Note that x0 is 'n', which after being shifted is stored in x8. But then x8 is used for both 'n' and 'b'.
I would expect the generated assembly to be
func:
lsr x9, x0, #7
top0:
adrp x8, lbl0@PAGE
add x8, x8, lbl0@PAGEOFF
br x8
lbl0:
sub x9, x9, #1
cmp x9, #0
b.ne top0
ret