Rewrite `pointer::with_addr` implementation #110318

WaffleLapkin · 2023-04-14T10:03:53Z

This commit changes the impl from (essentially)

ptr.wrapping_byte_offset(new_addr - self.addr())

to

ptr.wrapping_byte_sub(self.addr()).wrapping_byte_add(new_addr)

While being essentially the same (a-a+b vs a+(b-a)) new implementation generation 2 GEPs, instead of an arithmetic + 1 GEP, which turns out to be easier for LLVM to optimize and reason about.
_{(ptradd instead of getelementptr when .-.)}

Idea by @scottmcm, should fix the problem I've found while working on #110243, which I documented in the LLVM issue: llvm/llvm-project#62093 (TL;DR: the code for .map_addr(|a| a << 2) does not get optimized properly).

Here is a comparison specifically for addr << 2: https://rust.godbolt.org/z/T4oWfx4sb
LLVM-IR comparison between different impls: https://rust.godbolt.org/z/WhKa7b8dd (the new impl is technically more llvm-ir, but it looks like this form is easier to optimize/reason about/etc at least atm)

This commit changes the impl from (essentially) ```rust ptr.wrapping_byte_offset(new_addr - self.addr()) ``` to ```rust ptr.wrapping_byte_sub(self.addr()).wrapping_byte_add(new_addr) ``` While being essentially the same (`a-a+b` vs `a+(b-a)`) new implementation generation 2 GEPs, instead of an arithmetic + 1 GEP, which turns out to be easier for LLVM to optimize and reason about. (`ptradd` instead of `getelementptr` when .-.)

rustbot · 2023-04-14T10:04:00Z

r? @m-ou-se

(rustbot has picked a reviewer for you, use r? to override)

nikic · 2023-04-14T14:20:13Z

Upstream patch: https://reviews.llvm.org/D148341

I'm not sure this is a good idea -- I expect that this will optimize worse in cases where you aren't doing something extremely weird.

Also worth noting that this formulation is not CHERI compatible.

scottmcm · 2023-04-14T15:28:29Z

library/core/src/ptr/const_ptr.rs

+        // Note that we are using two offsets, instead of precomputing offset
+        // with `addr - self.addr()`. This is because it is easier for LLVM to
+        // optimize such code.
+        self.wrapping_byte_sub(self.addr()).wrapping_byte_add(addr)


As I was playing yesterday, I noticed another possible implementation:

Suggested change

self.wrapping_byte_sub(self.addr()).wrapping_byte_add(addr)

self.mask(0).wrapping_byte_add(addr)

That's again just two IR instructions (https://rust.godbolt.org/z/EeraoxKnh), and it has the bonus of never doing a ptrtoint.

Any thoughts on whether that's reasonable or horrible, @nikic?

That's very horrible. ptrmask has about zero optimization support.

WaffleLapkin · 2023-04-14T17:05:51Z

@nikic why is it not CHERI compatible, does it not allow to temporary create invalid pointers? 🤔

nikic · 2023-04-14T17:08:43Z

@nikic why is it not CHERI compatible, does it not allow to temporary create invalid pointers? thinking

I don't think so, but probably @jrtc27 can clarify.

jrtc27 · 2023-04-14T17:24:48Z

This will take the pointer temporarily way out of bounds, which is not supported on CHERI due to the bounds compression scheme in use. Depending on what IR this maps to I would expect LLVM to be able to perform the transformation you want on non-CHERI architectures for you.

WaffleLapkin · 2023-04-14T17:29:35Z

This will take the pointer temporarily way out of bounds, which is not supported on CHERI due to the bounds compression scheme in use.

Well, TIL. Does this also mean that no pointer tagging is ever possible on CHERI?

Anyway, I'm not particularly attached to this change. IMO it is a bit nicer code-wise and looks more logical to me. But if it adds problems for CHERI and even LLVM, we can just close this (especially given the candidate patch which should probably fix the original issue).

jrtc27 · 2023-04-14T17:33:16Z

This will take the pointer temporarily way out of bounds, which is not supported on CHERI due to the bounds compression scheme in use.

Well, TIL. Does this also mean that no pointer tagging is ever possible on CHERI?

Anyway, I'm not particularly attached to this change. IMO it is a bit nicer code-wise and looks more logical to me. But if it adds problems for CHERI and even LLVM, we can just close this (especially given the candidate patch which should probably fix the original issue).

Pointer tagging works provided you use low bits. Some implementations may also provide a way to set high bits without affecting bounds (e.g. as seen in Morello) but that is not portable across CHERI implementations. If you want to start performing arbitrary transformations on addresses and expect to be able to recover a usable pointer then CHERI cannot do that in its current form due to the compression scheme employed (we used to have an uncompressed variant that could, but quadrupling the pointer size is rather less acceptable than merely doubling).

WaffleLapkin · 2023-04-14T17:35:52Z

Pointer tagging works provided you use low bits.

Can't this also move the pointer out of the allocation? Or is the change of value in low bits small enough that it doesn't break compression?

jrtc27 · 2023-04-14T17:42:45Z

Pointer tagging works provided you use low bits.

Can't this also move the pointer out of the allocation?

Yes.

Or is the change of value in low bits small enough that it doesn't break compression?

Yes. Note I said way out of bounds before, not just out of bounds. The architecture gives you 1/8th range either side (one direction is in fact 1/4, though I forget if above or below), with a minimum for small allocations that varies based on the number of bits of precision used (think mantissa width for floating point), but is on the order of KiB for all current 64-bit implementations.

(I’m ignoring Microsoft’s CHERIoT here which is far stricter and thus more hostile to code that likes to play games with pointers, but can get away with it due to being used in an embedded context where you have much greater control over the code being compiled and people are more willing to adapt to weird and wonderful architectures)

WaffleLapkin · 2023-04-14T17:51:14Z

@jrtc27 I see, thanks for the explanation! <3

WaffleLapkin · 2023-04-17T10:53:03Z

Closing as per the concerns above, given that LLVM issue was fixed.

WaffleLapkin added T-libs Relevant to the library team, which will review and decide on the PR/issue. A-raw-pointers Area: raw pointers, MaybeUninit, NonNull A-strict-provenance Area: Strict provenance for raw pointers labels Apr 14, 2023

rustbot assigned m-ou-se Apr 14, 2023

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 14, 2023

This comment was marked as resolved.

Sign in to view

scottmcm reviewed Apr 14, 2023

View reviewed changes

WaffleLapkin closed this Apr 17, 2023

WaffleLapkin deleted the with_dumber_addr branch April 17, 2023 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rewrite `pointer::with_addr` implementation #110318

Rewrite `pointer::with_addr` implementation #110318

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

rustbot commented Apr 14, 2023

Uh oh!

This comment was marked as resolved.

nikic commented Apr 14, 2023

Uh oh!

scottmcm Apr 14, 2023

Uh oh!

nikic Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

nikic commented Apr 14, 2023

Uh oh!

jrtc27 commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

jrtc27 commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

jrtc27 commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 17, 2023

Uh oh!

Uh oh!

	self.wrapping_byte_sub(self.addr()).wrapping_byte_add(addr)
	self.mask(0).wrapping_byte_add(addr)

Rewrite pointer::with_addr implementation #110318

Rewrite pointer::with_addr implementation #110318

Uh oh!

Conversation

WaffleLapkin commented Apr 14, 2023

Uh oh!

rustbot commented Apr 14, 2023

Uh oh!

This comment was marked as resolved.

nikic commented Apr 14, 2023

Uh oh!

scottmcm Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

nikic Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

nikic commented Apr 14, 2023

Uh oh!

jrtc27 commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

jrtc27 commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

jrtc27 commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 14, 2023

Uh oh!

WaffleLapkin commented Apr 17, 2023

Uh oh!

Uh oh!

Rewrite `pointer::with_addr` implementation #110318

Rewrite `pointer::with_addr` implementation #110318