Confusing debugger line numbers emitted by compiler around lambdas (change of behavior since Scala 2)

## Compiler version
3.1.2 (but valid for all Scala 3 versions)

## Minimized code

```Scala
object main {
  def main(args: Array[String]): Unit = {
    Array(1).foreach { n =>
      val x = 123
      println(n)
    }
  }
}
```

## Inspecting the bytecode produced by Scala 2 (matches between Scala 2.12 and Scala 2.13)
```
  public main([Ljava/lang/String;)V
    // parameter final  args
   L0
    GETSTATIC scala/collection/ArrayOps$.MODULE$ : Lscala/collection/ArrayOps$;
   L1
    LINENUMBER 3 L1
    GETSTATIC scala/Predef$.MODULE$ : Lscala/Predef$;
    ICONST_1
    NEWARRAY T_INT
    DUP
    ICONST_0
    ICONST_1
    IASTORE
    INVOKEVIRTUAL scala/Predef$.intArrayOps ([I)Ljava/lang/Object;
<---------- Line added by me, for emphasis, INVOKEDYNAMIC instruction is a part of the L1 label above
    INVOKEDYNAMIC apply$mcVI$sp()Lscala/runtime/java8/JFunction1$mcVI$sp; [
      // handle kind 0x6 : INVOKESTATIC
      java/lang/invoke/LambdaMetafactory.altMetafactory(Ljava/lang/invoke/MethodHandles$Lookup;Ljava/lang/String;Ljava/lang/invoke/MethodType;[Ljava/lang/Object;)Ljava/lang/invoke/CallSite;
      // arguments:
      (I)V, 
      // handle kind 0x6 : INVOKESTATIC
      main$.$anonfun$main$1(I)V, 
      (I)V, 
      1
    ]
    INVOKEVIRTUAL scala/collection/ArrayOps$.foreach$extension (Ljava/lang/Object;Lscala/Function1;)V
    RETURN
   L2
    LOCALVARIABLE this Lmain$; L0 L2 0
    LOCALVARIABLE args [Ljava/lang/String; L0 L2 1
    MAXSTACK = 6
    MAXLOCALS = 2
```
Notice the line added by me in the output. It signifies that the INVOKEDYNAMIC instruction which runs the lambda expression is part of the L1 label, where the array is initialized. This means that the creation of the array and the INVOKEDYNAMIC instruction are part of the _same_ label. This is important later.

## Inspecting the bytecode produced by Scala 3
```
  public main([Ljava/lang/String;)V
    // parameter final  args
   L0
    LINENUMBER 2 L0
    LINENUMBER 3 L0
    GETSTATIC scala/Predef$.MODULE$ : Lscala/Predef$;
    ICONST_1
    NEWARRAY T_INT
    DUP
    ICONST_0
    ICONST_1
    IASTORE
    INVOKEVIRTUAL scala/Predef$.intArrayOps ([I)Ljava/lang/Object;
    ASTORE 2
    GETSTATIC scala/collection/ArrayOps$.MODULE$ : Lscala/collection/ArrayOps$;
    ALOAD 2
   L1
    LINENUMBER 5 L1
    ALOAD 0
<---------- Line added by me, for emphasis, INVOKEDYNAMIC instruction is a part of the L1 label above
    INVOKEDYNAMIC apply$mcVI$sp(Lmain$;)Lscala/runtime/java8/JFunction1$mcVI$sp; [
      // handle kind 0x6 : INVOKESTATIC
      java/lang/invoke/LambdaMetafactory.altMetafactory(Ljava/lang/invoke/MethodHandles$Lookup;Ljava/lang/String;Ljava/lang/invoke/MethodType;[Ljava/lang/Object;)Ljava/lang/invoke/CallSite;
      // arguments:
      (I)V, 
      // handle kind 0x7 : INVOKESPECIAL
      main$.main$$anonfun$1(I)V, 
      (I)V, 
      1
    ]
    INVOKEVIRTUAL scala/collection/ArrayOps$.foreach$extension (Ljava/lang/Object;Lscala/Function1;)V
    RETURN
   L2
    LOCALVARIABLE this Lmain$; L0 L2 0
    LOCALVARIABLE args [Ljava/lang/String; L0 L2 1
    MAXSTACK = 5
    MAXLOCALS = 3
```
Notice the line added by me in the output. It signifies that the INVOKEDYNAMIC instruction which runs the lambda expression is part of the L1 label, but in this case, the array is initialized in the L0 label. Thus, the creation of the label and the INVOKEDYNAMIC instruction are parts of _different_ labels, contrasting the Scala 2 case.

Now, this is not an inherent problem in and of itself.

The problem comes from the fact that the different Labels (L0 and L1 in the Scala 3 case) have different LINENUMBERs attached to them (2 and 3 for L0 and 5 for L1). Thus, the INVOKEDYNAMIC instruction is attached to LINENUMBER 5.

Compared to the Scala 2 case, the INVOKEDYNAMIC instruction does not have its own LINENUMBER. (Also notice that the whole bytecode of the `main` method does not mention LINENUMBER 5 anywhere. This will be important later.)

This distinction can clearly be seen in a debugger. I will give examples in both IntelliJ IDEA and Metals and they both have the same behavior, resulting in bad UX.

## IntelliJ IDEA Debugger screenshot with Scala 2.13.8
<img width="1566" alt="Screen Shot 2022-05-04 at 11 04 24" src="https://user-images.githubusercontent.com/98332773/166652182-f7ae2fe5-8667-415b-b131-cd21e888f341.png">

Notice that the debugger is stopped on Line 5, which is in the stack frame of the `$anonfun$main$1` method (the lambda body) and that the `n = 1` and x = 123` local variables are visible.

## Metals Debugger screenshot with Scala 2.13
<img width="1566" alt="Screen Shot 2022-05-04 at 11 06 44" src="https://user-images.githubusercontent.com/98332773/166652502-643583db-3ab9-4bbf-b8f1-ec8f518fd59c.png">
Same exact situation in Metals, Line 5, stack frame of `$anonfun$main$1` method and `n = 1` and `x = 123` local variables visible.

## IntelliJ IDEA Debugger screenshots with Scala 3.1.2
<img width="1566" alt="Screen Shot 2022-05-04 at 11 08 57" src="https://user-images.githubusercontent.com/98332773/166652911-b1e7980a-99db-4b7e-9d87-07c1cf7896a1.png">

Notice that the debugger is stopped on Line 5, but this Line 5 is in the stack frame of the `main` method (stemming from the LINENUMBER 5 emitted in the bytecode above). The local variables `n = 1` and `x = 123` are _not_ visible. (The red message that the local variable `n` cannot be found stems from this fact, but its presentation is otherwise a filtering bug in the UI and can be ignored).

_**Resuming**_ the program gets us where we want to be (and used to be with Scala 2). Line 5 of `main$$anonfun$1` (the lambda body) with `n = 1` and `x = 123` local variables visible.
<img width="1566" alt="Screen Shot 2022-05-04 at 11 12 00" src="https://user-images.githubusercontent.com/98332773/166653349-284c8431-349d-41cc-b356-2168311ea188.png">

## Metals Debugger screenshots with Scala 3.1.2
<img width="1566" alt="Screen Shot 2022-05-04 at 11 14 20" src="https://user-images.githubusercontent.com/98332773/166653715-43e64b64-26c7-48e1-94d4-c6c38cbe0d83.png">

Notice that the debugger is stopped on Line 5, but again, in the `main` method, with `n = 1` and `x = 123` again, _not_ visible.

<img width="1566" alt="Screen Shot 2022-05-04 at 11 17 56" src="https://user-images.githubusercontent.com/98332773/166654188-be8d0a6a-d947-4814-973d-b231745e2f16.png">

Again, _**resuming**_ the program brings us to the proper breakpoint in the lambda body. Line 5 of `main$$anonfun$1` with `n = 1` and `x = 123` local variables visible.

## What if we stop on Line 4?

Line 4 is not an ambiguous line (it only exists in the lambda body bytecode) and thus the debugger stops fine on it on the first try. Both in IntelliJ and in Metals. In this case, we're correctly in the stack frame of `main$$anonfun$1` and the lambda argument `n = 1` is visible (the `x = 123` local variable is not initialized yet, and thus not visible, this is by design).

<img width="1566" alt="Screen Shot 2022-05-04 at 11 20 51" src="https://user-images.githubusercontent.com/98332773/166654694-39380759-3616-4e5b-91f1-c2109359d169.png">

<img width="1566" alt="Screen Shot 2022-05-04 at 11 23 12" src="https://user-images.githubusercontent.com/98332773/166654977-6383df2e-5f3e-4ac5-854b-38758e8a40f0.png">

## Conclusion
Only the final line of a multi-line lambda is ambiguous, and stopping on it actually stops on 2 breakpoints, with the second one matching the user intentions more closely, resulting in bad UX. From a source code perspective, the final line of a lambda body should be the same as any other, and IMO, not ambiguous in the outer context.

## Discussion
I'm not saying that the emitted LINENUMBER is irrelevant and can be dropped, I frankly don't have the whole picture to make a call like that. I would like to know some of the reasoning behind that decision and work with you to arrive at a solution that would benefit the end user the most, in all debuggers.

Thanks for reading and thanks in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusing debugger line numbers emitted by compiler around lambdas (change of behavior since Scala 2) #15098

Compiler version

Minimized code

Inspecting the bytecode produced by Scala 2 (matches between Scala 2.12 and Scala 2.13)

Inspecting the bytecode produced by Scala 3

IntelliJ IDEA Debugger screenshot with Scala 2.13.8

Metals Debugger screenshot with Scala 2.13

IntelliJ IDEA Debugger screenshots with Scala 3.1.2

Metals Debugger screenshots with Scala 3.1.2

What if we stop on Line 4?

Conclusion

Discussion

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Confusing debugger line numbers emitted by compiler around lambdas (change of behavior since Scala 2) #15098

Description

Compiler version

Minimized code

Inspecting the bytecode produced by Scala 2 (matches between Scala 2.12 and Scala 2.13)

Inspecting the bytecode produced by Scala 3

IntelliJ IDEA Debugger screenshot with Scala 2.13.8

Metals Debugger screenshot with Scala 2.13

IntelliJ IDEA Debugger screenshots with Scala 3.1.2

Metals Debugger screenshots with Scala 3.1.2

What if we stop on Line 4?

Conclusion

Discussion

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions