@@ -28,11 +28,12 @@ them), and generate various reports for analysis, for example:
28
28
<br />
29
29
30
30
Detailed instructions and examples are documented in the
31
- [ Rust Unstable Book (under _ source-based-code-coverage_ )] [ unstable-book-sbcc ] .
31
+ [ Rust Unstable Book (under
32
+ _ compiler-flags/instrument-coverage_ )] [ unstable-book-instrument-coverage ] .
32
33
33
34
[ llvm-instrprof-increment ] : https://llvm.org/docs/LangRef.html#llvm-instrprof-increment-intrinsic
34
- [ Coverage Map ] : https://llvm.org/docs/CoverageMappingFormat.html
35
- [ unstable-book-sbcc ] : https://doc.rust-lang.org/nightly/unstable-book/compiler-flags/source-based-code -coverage.html
35
+ [ coverage map ] : https://llvm.org/docs/CoverageMappingFormat.html
36
+ [ unstable-book-instrument-coverage ] : https://doc.rust-lang.org/nightly/unstable-book/compiler-flags/instrument -coverage.html
36
37
37
38
## Rust symbol mangling
38
39
@@ -82,7 +83,7 @@ a span of code ([`CodeRegion`][code-region]). It counts the number of times a
82
83
branch is executed, and also specifies the exact location of that code span in
83
84
the Rust source code.
84
85
85
- Note that many of these ` Coverage ` statements will * not * be converted into
86
+ Note that many of these ` Coverage ` statements will _ not _ be converted into
86
87
physical counters (or any other executable instructions) in the final binary.
87
88
Some of them will be (see ` CoverageKind:: ` [ ` Counter ` ] [ counter-coverage-kind ] ),
88
89
but other counters can be computed on the fly, when generating a coverage
@@ -111,7 +112,7 @@ fn some_func(flag: bool) {
111
112
In this example, four contiguous code regions are counted while only
112
113
incrementing two counters.
113
114
114
- CFG analysis is used to not only determine * where * the branches are, for
115
+ CFG analysis is used to not only determine _ where _ the branches are, for
115
116
conditional expressions like ` if ` , ` else ` , ` match ` , and ` loop ` , but also to
116
117
determine where expressions can be used in place of physical counters.
117
118
@@ -150,50 +151,53 @@ MIR `Statement` into some backend-specific action or instruction.
150
151
match statement . kind {
151
152
...
152
153
mir :: StatementKind :: Coverage (box ref coverage ) => {
153
- self . codegen_coverage (& mut bx , coverage . clone ());
154
+ self . codegen_coverage (& mut bx , coverage . clone (), statement . source_info . scope );
154
155
bx
155
156
}
156
157
```
157
158
158
-
159
159
` codegen_coverage() ` handles each ` CoverageKind ` as follows:
160
160
161
- * For all ` CoverageKind ` s, Coverage data (counter ID, expression equation
161
+ - For all ` CoverageKind ` s, Coverage data (counter ID, expression equation
162
162
and ID, and code regions) are passed to the backend's ` Builder ` , to
163
163
populate data structures that will be used to generate the crate's
164
164
"Coverage Map". (See the [ ` FunctionCoverage ` ] [ function-coverage ] ` struct ` .)
165
- * For ` CoverageKind::Counter ` s, an instruction is injected in the backend
165
+ - For ` CoverageKind::Counter ` s, an instruction is injected in the backend
166
166
IR to increment the physical counter, by calling the ` BuilderMethod `
167
167
[ ` instrprof_increment() ` ] [ instrprof-increment ] .
168
168
169
169
``` rust
170
- pub fn codegen_coverage (& self , bx : & mut Bx , coverage : Coverage ) {
170
+ pub fn codegen_coverage (& self , bx : & mut Bx , coverage : Coverage , scope : SourceScope ) {
171
+ ...
172
+ let instance = ... // the scoped instance (current or inlined function)
171
173
let Coverage { kind , code_region } = coverage ;
172
174
match kind {
173
175
CoverageKind :: Counter { function_source_hash , id } => {
174
- if let Some (code_region ) = code_region {
175
- bx . add_coverage_counter (self . instance, id , code_region );
176
- }
176
+ ...
177
+ bx . add_coverage_counter (instance , id , code_region );
177
178
...
178
179
bx . instrprof_increment (fn_name , hash , num_counters , index );
179
180
}
180
181
CoverageKind :: Expression { id , lhs , op , rhs } => {
181
- bx . add_coverage_counter_expression (self . instance, id , lhs , op , rhs , code_region );
182
+ bx . add_coverage_counter_expression (instance , id , lhs , op , rhs , code_region );
182
183
}
183
184
CoverageKind :: Unreachable => {
184
- ...
185
+ bx . add_coverage_unreachable (
186
+ instance ,
187
+ code_region . expect (...
185
188
```
186
- _ code snippet trimmed for brevity_
189
+
190
+ _code snippet abbreviated for brevity_
187
191
188
192
> The function name `instrprof_increment ()` is taken from the LLVM intrinsic
189
- call of the same name ([ ` llvm.instrprof.increment ` ] [ llvm-instrprof-increment ] ),
190
- and uses the same arguments and types; but note that, up to and through this
191
- stage (even though modeled after LLVM's implementation for code coverage
192
- instrumentation), the data and instructions are not strictly LLVM-specific.
193
+ > call of the same name ([`llvm . instrprof. increment`][llvm - instrprof - increment ]),
194
+ > and uses the same arguments and types ; but note that , up to and through this
195
+ > stage (even though modeled after LLVM 's implementation for code coverage
196
+ > instrumentation ), the data and instructions are not strictly LLVM - specific .
193
197
>
194
198
> But since LLVM is the only Rust - supported backend with the tooling to
195
- process this form of coverage instrumentation, the backend for ` Coverage `
196
- statements is only implemented for LLVM, at this time.
199
+ > process this form of coverage instrumentation , the backend for `Coverage `
200
+ > statements is only implemented for LLVM , at this time .
197
201
198
202
[backend - lowering - mir ]: backend / lowering - mir . md
199
203
[codegen - statement ]: https : // doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/mir/struct.FunctionCx.html#method.codegen_statement
@@ -221,25 +225,29 @@ properly-configured variables in LLVM IR, according to very specific
221
225
details of the [_LLVM Coverage Mapping Format_ ][coverage - mapping - format ]
222
226
(Version 4 ). [^ llvm - and - covmap - versions ]
223
227
224
- [ ^ llvm-and-covmap-versions ] : The Rust compiler (as of <!-- date: 2021-01 -->
225
- January 2021) supports _ LLVM Coverage Mapping Format_ Version 4 (the most
226
- up-to-date version of the format, at the time of this writing) for improved
227
- compatibility with other LLVM-based compilers (like _ Clang_ ), and to take
228
- advantage of some format optimizations. Version 4 was introduced in _ LLVM 11_ ,
229
- which is currently the default LLVM version for Rust. Note that the Rust
230
- compiler optionally supports some earlier LLVM versions, prior to _ LLVM 11_ . If
231
- ` rustc ` is configured to use an incompatible version of LLVM, compiling with `-Z
232
- instrument-coverage` will generate an error message.
228
+ [^ llvm - and - covmap - versions ]:
229
+ The Rust compiler (as of <! -- date : 2021 - 01 - ->
230
+ January 2021 ) supports _LLVM Coverage Mapping Format_ Version 4 (the most
231
+ up - to - date version of the format , at the time of this writing ) for improved
232
+ compatibility with other LLVM - based compilers (like _Clang_ ), and to take
233
+ advantage of some format optimizations . Version 4 was introduced in
234
+ _LLVM 11_ , which is currently the default LLVM version for Rust . Note that
235
+ the Rust compiler optionally supports some earlier LLVM versions , prior to
236
+ _LLVM 11_ . If `rustc ` is configured to use an incompatible version of LLVM ,
237
+ compiling with `- Z instrument- coverage` will generate an error message.
233
238
234
239
```rust
235
240
pub fn finalize<'ll, 'tcx>(cx: & CodegenCx <'ll , 'tcx >) {
241
+ ...
242
+ if ! tcx. sess. instrument_coverage_except_unused_functions() {
243
+ add_unused_functions(cx);
244
+ }
245
+
236
246
let mut function_coverage_map = match cx . coverage_context () {
237
247
Some (ctx ) => ctx . take_function_coverage_map (),
238
248
None => return ,
239
249
};
240
250
...
241
- add_unreachable_coverage (tcx , & mut function_coverage_map );
242
-
243
251
let mut mapgen = CoverageMapGenerator :: new ();
244
252
245
253
for (instance , function_coverage ) in function_coverage_map {
@@ -248,66 +256,61 @@ pub fn finalize<'ll, 'tcx>(cx: &CodegenCx<'ll, 'tcx>) {
248
256
mapgen . write_coverage_mapping (expressions , counter_regions , coverage_mapping_buffer );
249
257
});
250
258
```
259
+
251
260
_ code snippet trimmed for brevity_
252
261
253
- One notable step, performed by ` mapgen::finalize() ` before processing the
254
- ` Instance ` s and their ` FunctionCoverage ` s, is the call to
255
- [ ` add_unreachable_functions() ` ] [ add-unreachable-coverage ] .
262
+ One notable first step performed by ` mapgen::finalize() ` is the call to
263
+ [ ` add_unused_functions() ` ] [ add-unused-functions ] .
256
264
257
265
When finalizing the coverage map, ` FunctionCoverage ` only has the ` CodeRegion ` s and counters for
258
266
the functions that went through codegen; such as public functions and "used" functions
259
- (functions referenced by other "used" or public items). Any other functions (considered unused
260
- or "Unreachable") were still parsed and processed through the MIR stage.
267
+ (functions referenced by other "used" or public items). Any other functions (considered unused)
268
+ were still parsed and processed through the MIR stage.
261
269
262
- The set of unreachable functions is computed via the set difference of all MIR
270
+ The set of unused functions is computed via the set difference of all MIR
263
271
` DefId ` s (` tcx ` query ` mir_keys ` ) minus the codegenned ` DefId ` s
264
- (` tcx ` query ` collect_and_partition_mono_items ` ). ` add_unreachable_functions() `
265
- computes the set of unreachable functions, queries the ` tcx ` for the
266
- previously-computed ` CodeRegions ` , for each unreachable MIR, and adds those code
267
- regions to one of the non-generic codegenned functions (non-generic avoids
268
- potentially injecting the unreachable coverage multiple times for multiple
269
- instantiations).
272
+ (` tcx ` query ` codegened_and_inlined_items ` ). ` add_unused_functions() `
273
+ computes the set of unused functions, queries the ` tcx ` for the
274
+ previously-computed ` CodeRegions ` , for each unused MIR, synthesizes an
275
+ LLVM function (with no internal statements, since it will not be called),
276
+ and adds a new ` FunctionCoverage ` , with ` Unreachable ` code regions.
270
277
271
278
[ compile-codegen-unit ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_llvm/base/fn.compile_codegen_unit.html
272
279
[ coverageinfo-finalize ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_llvm/context/struct.CodegenCx.html#method.coverageinfo_finalize
273
280
[ mapgen-finalize ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_llvm/coverageinfo/mapgen/fn.finalize.html
274
281
[ coverage-mapping-format ] : https://llvm.org/docs/CoverageMappingFormat.html
275
- [ add-unreachable-coverage ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_llvm/coverageinfo/mapgen/fn.add_unreachable_coverage .html
282
+ [ add-unused-functions ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_llvm/coverageinfo/mapgen/fn.add_unused_functions .html
276
283
277
284
## Testing LLVM Coverage
278
285
279
286
Coverage instrumentation in the MIR is validated by a ` mir-opt ` test:
280
287
[ ` instrument-coverage ` ] [ mir-opt-test ] .
281
288
282
289
More complete testing of end-to-end coverage instrumentation and reports are done
283
- in the ` run-make-fulldeps ` tests, with sample Rust programs (to be instrumented)
284
- in the [ ` coverage ` ] [ coverage-test-samples ] directory, and the actual tests and expected
290
+ in the ` run-make ` tests, with sample Rust programs (to be instrumented) in the
291
+ [ ` coverage ` ] [ coverage-test-samples ] directory, and the actual tests and expected
285
292
results in [ ` coverage-reports ` ] .
286
293
287
- In addition to testing the final result, two intermediate results are also validated
288
- to catch potential regression errors early: Minimum ` CoverageSpan ` s computed during
289
- the ` InstrumentCoverage ` MIR pass are saved in ` mir_dump ` [ Spanview] [ spanview-debugging ]
290
- files and compared to expected results in [ ` coverage-spanview ` ] .
291
-
292
294
Finally, the [ ` coverage-llvmir ` ] test compares compiles a simple Rust program with
293
295
` -Z instrument-coverage ` and compares the compiled program's LLVM IR to expected
294
296
LLVM IR instructions and structured data for a coverage-enabled program, including
295
297
various checks for Coverage Map-related metadata and the LLVM intrinsic calls to
296
298
increment the runtime counters.
297
299
298
300
Expected results for both the ` mir-opt ` tests and the ` coverage* ` tests under
299
- ` run-make-fulldeps ` can be refreshed by running:
301
+ ` run-make ` can be refreshed by running:
300
302
301
303
``` shell
302
- $ ./x.py test src/test/< test-type> --blessed
304
+ $ ./x.py test mir-opt --blessed
305
+ $ ./x.py test src/test/run-make/coverage --blessed
303
306
```
304
307
305
308
[ mir-opt-test ] : https://github.com/rust-lang/rust/blob/master/src/test/mir-opt/instrument_coverage.rs
306
- [ coverage-test-samples ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make-fulldeps /coverage
307
- [ `coverage-reports` ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make-fulldeps /coverage-reports
308
- [ `coverage-spanview` ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make-fulldeps /coverage-spanview
309
+ [ coverage-test-samples ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make/coverage
310
+ [ `coverage-reports` ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make/coverage-reports
311
+ [ `coverage-spanview` ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make/coverage-spanview
309
312
[ spanview-debugging ] : compiler-debugging.md#viewing-spanview-output
310
- [ `coverage-llvmir` ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make-fulldeps /coverage-llvmir
313
+ [ `coverage-llvmir` ] : https://github.com/rust-lang/rust/tree/master/src/test/run-make/coverage-llvmir
311
314
312
315
## Implementation Details of the ` InstrumentCoverage ` MIR Pass
313
316
@@ -352,11 +355,12 @@ with the following steps:
352
355
- ` inject_intermediate_expression() ` , called for each intermediate expression
353
356
returned from ` make_bcb_counters() `
354
357
355
- [ ^ intermediate-expressions ] : Intermediate expressions are sometimes required
356
- because ` Expression ` s are limited to binary additions or subtractions. For
357
- example, ` A + (B - C) ` might represent an ` Expression ` count computed from three
358
- other counters, ` A ` , ` B ` , and ` C ` , but computing that value requires an
359
- intermediate expression for ` B - C ` .
358
+ [ ^ intermediate-expressions ] :
359
+ Intermediate expressions are sometimes required
360
+ because ` Expression ` s are limited to binary additions or subtractions. For
361
+ example, ` A + (B - C) ` might represent an ` Expression ` count computed from three
362
+ other counters, ` A ` , ` B ` , and ` C ` , but computing that value requires an
363
+ intermediate expression for ` B - C ` .
360
364
361
365
[ instrumentor ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_mir/transform/coverage/struct.Instrumentor.html
362
366
[ coverage-graph ] : https://doc.rust-lang.org/nightly/nightly-rustc/rustc_mir/transform/coverage/graph/struct.CoverageGraph.html
@@ -396,20 +400,21 @@ contrast with the [`SimplifyCfg`][simplify-cfg] MIR pass, this step does
396
400
not alter the MIR itself, because the ` CoverageGraph ` aggressively simplifies
397
401
the CFG, and ignores nodes that are not relevant to coverage. For example:
398
402
399
- * The BCB CFG ignores (excludes) branches considered not relevant
400
- to the current coverage solution. It excludes unwind-related code[ ^ 78544 ]
401
- that is injected by the Rust compiler but has no physical source
402
- code to count, which allows a ` Call ` -terminated BasicBlock
403
- to be merged with its successor, within a single BCB.
404
- * A ` Goto ` -terminated ` BasicBlock ` can be merged with its successor
405
- *** as long as*** it has the only incoming edge to the successor ` BasicBlock ` .
406
- * Some BasicBlock terminators support Rust-specific concerns--like borrow-checking--that are
407
- not relevant to coverage analysis. ` FalseUnwind ` , for example, can be treated the same as
408
- a ` Goto ` (potentially merged with its successor into the same BCB).
409
-
410
- [ ^ 78544 ] : (Note, however, that Issue [ #78544 ] [ rust-lang/rust#78544 ] considers
411
- providing future support for coverage of programs that intentionally
412
- ` panic ` , as an option, with some non-trivial cost.)
403
+ - The BCB CFG ignores (excludes) branches considered not relevant
404
+ to the current coverage solution. It excludes unwind-related code[ ^ 78544 ]
405
+ that is injected by the Rust compiler but has no physical source
406
+ code to count, which allows a ` Call ` -terminated BasicBlock
407
+ to be merged with its successor, within a single BCB.
408
+ - A ` Goto ` -terminated ` BasicBlock ` can be merged with its successor
409
+ ** _ as long as_ ** it has the only incoming edge to the successor ` BasicBlock ` .
410
+ - Some BasicBlock terminators support Rust-specific concerns--like borrow-checking--that are
411
+ not relevant to coverage analysis. ` FalseUnwind ` , for example, can be treated the same as
412
+ a ` Goto ` (potentially merged with its successor into the same BCB).
413
+
414
+ [ ^ 78544 ] :
415
+ (Note, however, that Issue [ #78544 ] [ rust-lang/rust#78544 ] considers
416
+ providing future support for coverage of programs that intentionally
417
+ ` panic ` , as an option, with some non-trivial cost.)
413
418
414
419
The BCB CFG is critical to simplifying the coverage analysis by ensuring graph path-based
415
420
queries (` is_dominated_by() ` , ` predecessors ` , ` successors ` , etc.) have branch (control flow)
@@ -418,10 +423,11 @@ significance.
418
423
To visualize the ` CoverageGraph ` , you can generate a _ graphviz_ ` *.dot `
419
424
file with the following ` rustc ` flags:[ ^ graphviz-dark-mode ]
420
425
421
- [ ^ graphviz-dark-mode ] : This image also applies ` -Z graphviz-dark-mode ` , to
422
- produce a Graphviz document with "dark mode" styling. If you use a dark mode or
423
- theme in your development environment, you will probably want to use this
424
- option so you can review the graphviz output without straining your vision.
426
+ [ ^ graphviz-dark-mode ] :
427
+ This image also applies ` -Z graphviz-dark-mode ` , to
428
+ produce a Graphviz document with "dark mode" styling. If you use a dark mode or
429
+ theme in your development environment, you will probably want to use this
430
+ option so you can review the graphviz output without straining your vision.
425
431
426
432
``` shell
427
433
$ rustc -Z instrument-coverage -Z dump-mir=InstrumentCoverage \
@@ -448,19 +454,19 @@ directional edges (the arrows) leading from each node to its `successors()`.
448
454
The nodes contain information in sections:
449
455
450
456
1 . The gray header has a label showing the BCB ID (or _ index_ for looking up
451
- its ` BasicCoverageBlockData ` ).
457
+ its ` BasicCoverageBlockData ` ).
452
458
2 . The first content section shows the assigned ` Counter ` or ` Expression ` for
453
- each contiguous section of code. (There may be more than one ` Expression `
454
- incremented by the same ` Counter ` for discontiguous sections of code representing
455
- the same sequential actions.) Note the code is represented by the line and
456
- column ranges (for example: ` 52:28-52:33 ` , representing the original source
457
- line 52, for columns 28-33). These are followed by the MIR ` Statement ` or
458
- ` Terminator ` represented by that source range. (How these coverage regions
459
- are determined is discussed in the following section.)
459
+ each contiguous section of code. (There may be more than one ` Expression `
460
+ incremented by the same ` Counter ` for discontiguous sections of code representing
461
+ the same sequential actions.) Note the code is represented by the line and
462
+ column ranges (for example: ` 52:28-52:33 ` , representing the original source
463
+ line 52, for columns 28-33). These are followed by the MIR ` Statement ` or
464
+ ` Terminator ` represented by that source range. (How these coverage regions
465
+ are determined is discussed in the following section.)
460
466
3 . The final section(s) show the MIR ` BasicBlock ` s (by ID/index and its
461
- ` TerminatorKind ` ) contained in this BCB. The last BCB is separated out because
462
- its ` successors() ` determine the edges leading out of the BCB, and into
463
- the ` leading_bb() ` (first ` BasicBlock ` ) of each successor BCB.
467
+ ` TerminatorKind ` ) contained in this BCB. The last BCB is separated out because
468
+ its ` successors() ` determine the edges leading out of the BCB, and into
469
+ the ` leading_bb() ` (first ` BasicBlock ` ) of each successor BCB.
464
470
465
471
Note, to find the ` BasicCoverageBlock ` from a final BCB ` Terminator ` 's
466
472
successor ` BasicBlock ` , there is an index and helper
@@ -572,7 +578,7 @@ incoming edges. Given the following graph, for example, the count for
572
578
573
579
In this situation, BCB node ` B ` may require an edge counter for its
574
580
"edge from A", and that edge might be computed from an ` Expression ` ,
575
- ` Counter(A) - Counter(C) ` . But an expression for the BCB _ node_ ` B `
581
+ ` Counter(A) - Counter(C) ` . But an expression for the BCB _ node_ ` B `
576
582
would be the sum of all incoming edges:
577
583
578
584
``` text
0 commit comments