enhancement: supported deduping spans within block builder by zhxiaogg · Pull Request #6539 · grafana/tempo

zhxiaogg · 2026-02-23T20:59:57Z

What this PR does:

Block builder: deduplicate spans within traces during block creation and track removed duplicates via tempo_block_builder_spans_deduped_total metric

Which issue(s) this PR fixes:
Fixes #6516

Test

can verify the metrics are generated and showing in prometheus ui when testing with singlebinary example.

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

javiermolinar · 2026-02-25T09:00:26Z

+		for _, ss := range rs.ScopeSpans {
+			unique := ss.Spans[:0]
+			for _, s := range ss.Spans {
+				token := util.SpanIDAndKindToToken(s.SpanId, int(s.Kind))


I wonder if we should use this hashing algorithm instead:

tempo/pkg/model/trace/combine.go

Line 27 in 2ad734b

func tokenForID(h hash.Hash64, buffer []byte, kind int32, b []byte) token {

It seems more correct, with less chances of collisions

thanks, changed to use the alternative one you suggested!

stoewer

To avoid heap allocations and traversing each trace twice, I'd suggest to remove dedupeTrace() and implement it's functionality inside the loop that updates timestamp bounds. Roughly like this:

seen := make(map[uint64]struct{}, 1024) // initialize seen before the outer `for entries := range seq` loop

...
for _, rs := range tr.ResourceSpans {
    for _, ss := range rs.ScopeSpans {
        unique := ss.Spans[:0]
        for _, s := range ss.Spans {
            // dedup and update timestamps 
        }
        ss.Spans = unique
    }
}
clear(seen)

While this is less readable than having a separate dedup function, it might still be worth it for performance reasons. What do you think?

stoewer · 2026-02-25T09:26:37Z

+	var deduped uint32
+	for _, rs := range tr.ResourceSpans {
+		for _, ss := range rs.ScopeSpans {
+			unique := ss.Spans[:0]


Nice in-place dedup 👍

stoewer · 2026-02-25T09:33:40Z

+// dedupeTrace removes duplicate spans in-place from tr, deduplicating by span ID and kind.
+// Returns the number of removed duplicate spans.
+func dedupeTrace(tr *tempopb.Trace) uint32 {
+	seen := make(map[uint64]struct{})


There is an opportunity here to save lots of smaller heap allocations:

Initialize seen with a reasonable start size in to avoid allocations and rehashing when the map grows

Reuse the seen in multiple deduplications (and use clear(seen) before reuse)

thanks, all changed including the following comments.

stoewer · 2026-02-25T09:35:18Z

 			}

+			// Deduplicate spans within the trace
+			i.dedupedSpans += dedupeTrace(tr)


Each trace is traversed twice: one time in dedupeTrace() and another time in L118 to update timestamp bounds. Maybe those can be combined

stoewer · 2026-02-25T09:39:16Z

 }

+// DedupedSpans returns the total number of duplicate spans that were removed
+// across all traces. The iterator must be exhausted before this can be accessed.


The iterator must be exhausted before this can be accessed

Just for the sake of being a bit more defensive, what do you think about enforcing this by returning an error when liveTracesIter.DedupedSpans() is called before it's exhausted?

javiermolinar

Looks good to me

)

zhxiaogg force-pushed the dedupe-spans-within-block-builder branch from 3ff9b6f to 49bbfd4 Compare February 24, 2026 19:58

zhxiaogg marked this pull request as ready for review February 24, 2026 20:13

zhxiaogg requested review from carles-grafana, electron0zero, ie-pham, javiermolinar, mapno, mattdurham, mdisibio, oleg-kozlyuk-grafana, ruslan-mikhailov, stoewer, yvrhdn and zalegrala as code owners February 24, 2026 20:13

zhxiaogg enabled auto-merge (squash) February 24, 2026 20:13

javiermolinar reviewed Feb 25, 2026

View reviewed changes

stoewer reviewed Feb 25, 2026

View reviewed changes

zhxiaogg added 5 commits February 25, 2026 07:48

enhancement: supported deduping spans within block builder

e851e16

CHANGELOG.md

dbf23c3

minor refactor

8ca3f0a

use uint32 for the metric value

b9da6c4

address comments related to perf/accuracy consideration

63f569f

zhxiaogg force-pushed the dedupe-spans-within-block-builder branch from 3e9676d to 63f569f Compare February 25, 2026 15:49

safe guard the iter.DedupedSpans() before exhausting the iter

faa368b

zhxiaogg requested review from javiermolinar and stoewer February 25, 2026 16:49

javiermolinar approved these changes Feb 26, 2026

View reviewed changes

zhxiaogg merged commit 91948b8 into grafana:main Feb 26, 2026
40 of 41 checks passed

zhxiaogg deleted the dedupe-spans-within-block-builder branch February 26, 2026 15:49

zalegrala pushed a commit to zalegrala/tempo that referenced this pull request Feb 27, 2026

enhancement: supported deduping spans within block builder (grafana#6539

2d8dc99

)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enhancement: supported deduping spans within block builder#6539

enhancement: supported deduping spans within block builder#6539
zhxiaogg merged 6 commits intografana:mainfrom
zhxiaogg:dedupe-spans-within-block-builder

zhxiaogg commented Feb 23, 2026 •

edited

Loading

Uh oh!

javiermolinar Feb 25, 2026

Uh oh!

zhxiaogg Feb 25, 2026

Uh oh!

stoewer left a comment

Uh oh!

stoewer Feb 25, 2026

Uh oh!

stoewer Feb 25, 2026

Uh oh!

zhxiaogg Feb 25, 2026

Uh oh!

stoewer Feb 25, 2026

Uh oh!

stoewer Feb 25, 2026

Uh oh!

javiermolinar left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zhxiaogg commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

javiermolinar Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

zhxiaogg Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

stoewer left a comment

Choose a reason for hiding this comment

Uh oh!

stoewer Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

stoewer Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

zhxiaogg Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

stoewer Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

stoewer Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

javiermolinar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhxiaogg commented Feb 23, 2026 •

edited

Loading