Adding an unreachable branch helps optimize the code when matching on `x % N`

Consider this example:
```rust
pub fn parse(x: usize) -> usize {
    match x % 5 {
        0 => f1(x),
        1 => f2(x),
        2 => f3(x),
        3 => f4(x),
        4 => f5(x),
        // 5 | 6 | 7 => loop{},
        _ => loop{},
    }
}
```
It currently generates similar LLVM-IR:
```llvm
start:
  %_2 = urem i64 %x, 5, !dbg !10
  switch i64 %_2, label %bb12 [
    i64 0, label %bb2
    i64 1, label %bb4
    i64 2, label %bb6
    i64 3, label %bb8
    i64 4, label %bb10
  ], !dbg !11
; ...
bb12:
  br label %bb12, !dbg !32
```
Even though the default branch is unreachable (`x % 5` can't be greater than 4) it is still generated.

But, when un-commenting `5 | 6 | 7 => loop{}` line (still unreachable branch, that does the same as default) the default branch becomes `unreachable`:
```llvm
start:
  %_2 = urem i64 %x, 5, !dbg !10
  switch i64 %_2, label %bb14 [
    i64 0, label %bb2
    i64 1, label %bb4
    i64 2, label %bb6
    i64 3, label %bb8
    i64 4, label %bb10
  ], !dbg !11
; ...
bb14:
  unreachable
```

So, adding an **unreachable** branch that does **the same as the default** helps optimize the code.

Some notes:
- These differences then propagate to assembler, i.e. it also has an unreachable branch when `5 | 6 | 7` is commented-out
- [godbolt link](https://godbolt.org/z/o6hPW16MW)
- Range patterns (like `5..=7`) do **not** help
- Making the default branch `5 | 6 | 7 | _` doesn't help either
- Similar things happen for any `N` in `x % N` that is not a power of two -- adding unreachable branches until the last branch has a power-of-two-minus-one-value helps optimizing the code
  - [`N = 3`](https://godbolt.org/z/5ezr3PP51) (`3 =>`)
  - [`N = 9`](https://godbolt.org/z/o8rdcaK95) (`9 | 10 | 11 | 12 | 13 | 14 | 15 =>`)
  - [`N = 14`](https://godbolt.org/z/76xexbr4M) (`14 | 15 =>`)
- `loop{}`s can be replaced with anything else, for example `unreachable!()` or `panic!()`, effect is still the same
  - in case of panic-like things not removing the panicking branch seems like a big deal
  - I've used `loop{}`s to make diffs less noisy
- `if-else-if` chains are identical to `match` here
- clang seems to handle this situation just fine in any case: [godbolt link](https://godbolt.org/z/xf47WYE8q)
  - it also makes the last reachable branch default in `llvm-ir`, instead of generating an `unreachable` one

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding an unreachable branch helps optimize the code when matching on `x % N` #93514

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Adding an unreachable branch helps optimize the code when matching on x % N #93514

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Adding an unreachable branch helps optimize the code when matching on `x % N` #93514