Improve debug names index fetching global variables performance #70231

jeffreytan81 · 2023-10-25T17:26:41Z

While using dwarf5 .debug_names in internal large targets, we noticed a performance issue (around 10 seconds delay) while lldb-vscode tries to show scopes for a compile unit. Profiling shows the bottleneck is inside DebugNamesDWARFIndex::GetGlobalVariables which linearly search all index entries belongs to a compile unit.

This patch improves the performance by using the compile units list to filter first before checking index entries. This significantly improves the performance (drops from 10 seconds => under 1 second) in the split dwarf situation because each compile unit has its own named index.

clayborg

Looks good, just rename the "has_match_cu" to "cu_matches" and this is good to go.

clayborg · 2023-10-25T17:33:50Z

lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp

@@ -130,6 +130,17 @@ void DebugNamesDWARFIndex::GetGlobalVariables(
  uint64_t cu_offset = cu.GetOffset();
  bool found_entry_for_cu = false;
  for (const DebugNames::NameIndex &ni: *m_debug_names_up) {
+    // Check if this name index contains an entry for the given CU.
+    bool has_match_cu = false;


I would rename this maybe to "cu_matches".

github-actions · 2023-10-25T17:41:02Z

✅ With the latest revision this PR passed the C/C++ code formatter.

llvmbot · 2023-10-25T18:46:45Z

@llvm/pr-subscribers-lldb

Author: None (jeffreytan81)

Changes

While using dwarf5 .debug_names in internal large targets, we noticed a performance issue (around 10 seconds delay) while lldb-vscode tries to show scopes for a compile unit. Profiling shows the bottleneck is inside DebugNamesDWARFIndex::GetGlobalVariables which linearly search all index entries belongs to a compile unit.

This patch improves the performance by using the compile units list to filter first before checking index entries. This significantly improves the performance (drops from 10 seconds => under 1 second) in the split dwarf situation because each compile unit has its own named index.

Full diff: https://github.com/llvm/llvm-project/pull/70231.diff

1 Files Affected:

(modified) lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp (+13-2)

diff --git a/lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp b/lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp
index 292ea2806c59dc7..4fc3866a3b608fd 100644
--- a/lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp
+++ b/lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp
@@ -129,8 +129,19 @@ void DebugNamesDWARFIndex::GetGlobalVariables(
     DWARFUnit &cu, llvm::function_ref<bool(DWARFDIE die)> callback) {
   uint64_t cu_offset = cu.GetOffset();
   bool found_entry_for_cu = false;
-  for (const DebugNames::NameIndex &ni: *m_debug_names_up) {
-    for (DebugNames::NameTableEntry nte: ni) {
+  for (const DebugNames::NameIndex &ni : *m_debug_names_up) {
+    // Check if this name index contains an entry for the given CU.
+    bool cu_matches = false;
+    for (uint32_t i = 0; i < ni.getCUCount(); ++i) {
+      if (ni.getCUOffset(i) == cu_offset) {
+        cu_matches = true;
+        break;
+      }
+    }
+    if (!cu_matches)
+      continue;
+
+    for (DebugNames::NameTableEntry nte : ni) {
       uint64_t entry_offset = nte.getEntryOffset();
       llvm::Expected<DebugNames::Entry> entry_or = ni.getEntry(&entry_offset);
       for (; entry_or; entry_or = ni.getEntry(&entry_offset)) {

bulbazord

This patch looks fine to me in idea, but next time please leave your review up for longer so others have time to take a look as well.

bulbazord · 2023-10-25T18:59:58Z

lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp

+    bool cu_matches = false;
+    for (uint32_t i = 0; i < ni.getCUCount(); ++i) {
+      if (ni.getCUOffset(i) == cu_offset) {
+        cu_matches = true;
+        break;
+      }
+    }
+    if (!cu_matches)
+      continue;


It'd be great if we could use some kind of find_if instead of manually walking the CU Offsets.

dwblaikie · 2023-10-25T19:08:22Z

lldb/source/Plugins/SymbolFile/DWARF/DebugNamesDWARFIndex.cpp

+    for (uint32_t i = 0; i < ni.getCUCount(); ++i) {
+      if (ni.getCUOffset(i) == cu_offset) {


If NameIndex exposed the CUOffsets as a range (which seems pretty easy/reasonable for it to do - ah, because it requires potentially applying relocations it'd probably require a custom iterator - maybe a mapped iterator would be adequate & easy to do) then this could be written as:

if (llvm::none_of(ni.CUOffsets(), [&](uint64_t off) { return off == cu_offset; })) continue;

I /think/ CUOffsets() would look something roughly like this:

auto CUOffsets() const { assert(TU < Hdr.CompUnitCount); const unsigned SectionOffsetSize = dwarf::getDwarfOffsetByteSize(Hdr.Format); uint64_t Offset = CUsBase + SectionOffsetSize * CU; auto R = /* Guess we need some sort of generator to produce the values [CUsBase, CUsBase+SectionOffsetSize*Hdr.CompUnitCount) in SectionOffsetSize increments... some enhancement to llvm::seq that takes a stride size would be suitable */ return llvm::map_range(llvm::seq(CUsBase, [&](uint64_t Offset) { return Section.AccelSection.getRelocatedValue(SectionOffsetSize, &Offset); }); }

Oh, also, if you kept the result (more like a llvm::find_if as @bulbazord was suggesting, rather than my llvm::none_of here) of this search, you could save a small amount of time (no need to indirect through the index and reapply relocations to get the CU offset) by using getCUInedx() and comparing that to the index you would've found in this search - down on line 139/150

jeffreytan81 · 2023-10-25T20:36:04Z

Sounds good. Will keep the PR alive at least half day or one day before merging in future.

Improve debug names index fetching global variables performance

c8d9a1f

jeffreytan81 requested review from labath, clayborg, dwblaikie and ayermolo October 25, 2023 17:26

clayborg approved these changes Oct 25, 2023

View reviewed changes

Address formatter/review feedback

19b7055

jeffreytan81 marked this pull request as ready for review October 25, 2023 18:45

jeffreytan81 requested a review from JDevlieghere as a code owner October 25, 2023 18:45

llvmbot added the lldb label Oct 25, 2023

jeffreytan81 merged commit d4e6e40 into llvm:main Oct 25, 2023

bulbazord reviewed Oct 25, 2023

View reviewed changes

dwblaikie reviewed Oct 25, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve debug names index fetching global variables performance #70231

Improve debug names index fetching global variables performance #70231

Uh oh!

jeffreytan81 commented Oct 25, 2023

Uh oh!

clayborg left a comment

Uh oh!

clayborg Oct 25, 2023

Uh oh!

github-actions bot commented Oct 25, 2023 •

edited

Loading

Uh oh!

llvmbot commented Oct 25, 2023

Uh oh!

bulbazord left a comment

Uh oh!

bulbazord Oct 25, 2023

Uh oh!

dwblaikie Oct 25, 2023

Uh oh!

dwblaikie Oct 25, 2023

Uh oh!

jeffreytan81 commented Oct 25, 2023

Uh oh!

Uh oh!

		for (uint32_t i = 0; i < ni.getCUCount(); ++i) {
		if (ni.getCUOffset(i) == cu_offset) {

Improve debug names index fetching global variables performance #70231

Improve debug names index fetching global variables performance #70231

Uh oh!

Conversation

jeffreytan81 commented Oct 25, 2023

Uh oh!

clayborg left a comment

Choose a reason for hiding this comment

Uh oh!

clayborg Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Oct 25, 2023

Uh oh!

bulbazord left a comment

Choose a reason for hiding this comment

Uh oh!

bulbazord Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

dwblaikie Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

dwblaikie Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

jeffreytan81 commented Oct 25, 2023

Uh oh!

Uh oh!

github-actions bot commented Oct 25, 2023 •

edited

Loading