Handle filemaps with length 0 in codemap lookups #11904

nrc · 2014-01-29T08:53:01Z

No description provided.

sanxiyn · 2014-01-29T13:05:06Z

This needs a test.

brson · 2014-02-11T00:40:29Z

@nick29581 Needs a test case to capture the problem here.

nrc · 2014-02-11T04:38:34Z

Sorry, this has kind of been on the back burner. The bad news is the patch got a lot larger since rebasing. The other bad news is that I haven't managed to get a more minimal test case than running DXR on librustdoc. I'm not actually sure how to create a filemap with length 0 - I see them when running DXR indexing, but I can't create them in a minimal test case.

I guess I might be able to come up with a unit test for this.

jdm · 2014-02-11T13:20:43Z

It might be something to do with macros. That sounds like something @michaelwoerister came across when looking up spans for debug symbols.

sanxiyn · 2014-02-11T17:27:26Z

I think a unit test is acceptable. Regarding "running DXR on librustdoc", I think it should be filed as a separate issue. LGTM otherwise.

michaelwoerister · 2014-02-11T17:34:06Z

@jdm I can't remember encountering this as an issue. There has recently been a small problem with invalid filemaps from include_str!() but that was with the start position always being set to zero, not the length. I can't help you out on this one.

… to bytepos_to_charpos.

nrc · 2014-02-19T01:26:17Z

Now with tests. (Also renamed bytepos_to_local_charpos to reflect what it actually does).

alexcrichton · 2014-02-19T08:34:05Z

src/libsyntax/codemap.rs

-    // located in
-    fn bytepos_to_local_charpos(&self, bpos: BytePos) -> CharPos {
+    // Converts an absolute BytePos to a CharPos relative to the codemap.
+    fn bytepos_to_charpos(&self, bpos: BytePos) -> CharPos {


Just confirming, but the documentation and name has changed, but neither has the functionality.

Shouldn't we rather fix the return value? i.e. CharPos(bpos.to_uint() - map.start_pos - total_extra_bytes)

I have not changed the behaviour. As far as I can tell, the behaviour was non-local.

I don't see why we should change the behaviour - the users seem to expect the current behaviour and it is a private fn. Also, it is not that simple to change because start_pos does not take account of any extra bytes. I guess you could change the total_extra_bytes to only count on the current line, but perhaps that won't work. Given that nothing is broken other than the name, I don't think we should change it.

I think there might be a bug in there. bytepos_to_local_charpos() is used in just two places, and then the difference between the two results is used. Both values are off by the same value (filemap.start_pos) so the difference is correct again.

The char-pos calculated by the function can't be global to the whole codemap because it's it does not take into account any filemap before the current one.

Ah good point. I'll try and come up with a better fix in another PR.

Update regex-syntax to support new word boundry assertions From the regex v1.10.0 release notes [1]: This is a new minor release of regex that adds support for start and end word boundary assertions. [...] The new word boundary assertions are: • \< or \b{start}: a Unicode start-of-word boundary (\W|\A on the left, \w on the right). • \> or \b{end}: a Unicode end-of-word boundary (\w on the left, \W|\z on the right)). • \b{start-half}: half of a Unicode start-of-word boundary (\W|\A on the left). • \b{end-half}: half of a Unicode end-of-word boundary (\W|\z on the right). [1]: https://github.com/rust-lang/regex/blob/master/CHANGELOG.md#1100-2023-10-09 changelog: [`regex`]: add support for start and end word boundary assertions ("\<", "\b{start}", etc.) introduced in regex v0.10

Fix bug with zero-length filemaps and rename bytepos_to_local_charpos…

418eea1

… to bytepos_to_charpos.

alexcrichton reviewed Feb 19, 2014
View reviewed changes

bors added a commit that referenced this pull request Feb 19, 2014

auto merge of #11904 : nick29581/rust/0filemap, r=alexcrichton

1228fb0

bors merged commit 418eea1 into rust-lang:master Feb 19, 2014

nrc deleted the 0filemap branch February 24, 2014 00:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle filemaps with length 0 in codemap lookups #11904

Handle filemaps with length 0 in codemap lookups #11904

nrc commented Jan 29, 2014

sanxiyn commented Jan 29, 2014

brson commented Feb 11, 2014

nrc commented Feb 11, 2014

jdm commented Feb 11, 2014

sanxiyn commented Feb 11, 2014

michaelwoerister commented Feb 11, 2014

nrc commented Feb 19, 2014

alexcrichton Feb 19, 2014

michaelwoerister Feb 19, 2014

nrc Feb 19, 2014

michaelwoerister Feb 19, 2014

nrc Feb 19, 2014

Handle filemaps with length 0 in codemap lookups #11904

Handle filemaps with length 0 in codemap lookups #11904

Conversation

nrc commented Jan 29, 2014

sanxiyn commented Jan 29, 2014

brson commented Feb 11, 2014

nrc commented Feb 11, 2014

jdm commented Feb 11, 2014

sanxiyn commented Feb 11, 2014

michaelwoerister commented Feb 11, 2014

nrc commented Feb 19, 2014

alexcrichton Feb 19, 2014

Choose a reason for hiding this comment

michaelwoerister Feb 19, 2014

Choose a reason for hiding this comment

nrc Feb 19, 2014

Choose a reason for hiding this comment

michaelwoerister Feb 19, 2014

Choose a reason for hiding this comment

nrc Feb 19, 2014

Choose a reason for hiding this comment