Transition rustc Parser to proc_macro token model

Currently, there are two different approaches for dealing with composite tokens like `>>` in rustc.

1. Keep tokens in composed form, and split into pieces, `>` and `>`, when necessary.
2. Keep tokens decomposed, with jointness information, and join tokens when necessary.

At the moment, the first approach is used by the parser, and the second approach is used by the proc_macro API. It would be good to move the parser to the decomposed approach as well, as it is somewhat more natural, more future-compatible (one can introduce new tokens) and having two of a thing is bad in itself!

Here are some relevant bits of the code that handle composed model:

* Composed tokens as produced by [rustc_lexer](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/librustc_lexer/src/lib.rs#L271-L281)
* Composed tokens preserved by the [token cooking](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/libsyntax/parse/lexer/mod.rs#L306)
* Here's the bit when we produce [a TokenTree](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/libsyntax/parse/lexer/tokentrees.rs#L207-L210), consumed by the parser. Note that, although we are tracking jointness here, the tokens are composed.
* Here's the bit of the parser which [decomposes](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/libsyntax/parse/parser.rs#L700-L736) tokens on the fly.

Here are the bits relevant to decomposed model:

* Gluing tokens in [TokenStreamBuilder](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/libsyntax/tokenstream.rs#L412-L429)
* [Token::glue](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/libsyntax/parse/token.rs#L554-L612)

Note that the `tt` matcher in `macro_rules` eats one composed token, and this is affects language specification.
That is, when we transition to decomposed model, we'll need to fix [this code](https://github.com/rust-lang/rust/blob/71e2882973e63b9ddc837a61ac8631e6451d31a9/src/libsyntax/ext/tt/macro_parser.rs#L903-L905) to eat one *composed* token to maintain backwards compatibility.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transition rustc Parser to proc_macro token model #63689

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Transition rustc Parser to proc_macro token model #63689

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions