Skip to content

Alert: high error rate in vulture#5206

Merged
ruslan-mikhailov merged 3 commits intografana:mainfrom
ruslan-mikhailov:vulture-alert
Jun 4, 2025
Merged

Alert: high error rate in vulture#5206
ruslan-mikhailov merged 3 commits intografana:mainfrom
ruslan-mikhailov:vulture-alert

Conversation

@ruslan-mikhailov
Copy link
Copy Markdown
Contributor

@ruslan-mikhailov ruslan-mikhailov commented Jun 3, 2025

What this PR does: adds alert for high error rate reported by vulture

Which issue(s) this PR fixes:
Fixes #

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@ruslan-mikhailov
Copy link
Copy Markdown
Contributor Author

+ %f instead of %s for better float number in output

Copy link
Copy Markdown
Contributor

@javiermolinar javiermolinar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Comment thread operations/tempo-mixin/runbook.md Outdated

This alert fires when Tempo vulture detects a high error rate (above the configured threshold) while validating write or read paths. It indicates there are problems with trace processing or storage.

If a trace write failed once, it will continue to fail on read operations as well. In that case, restarting vulture will start over with new traces.
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If a trace write failed once, it will continue to fail on read operations as well. In that case, restarting vulture will start over with new traces.

Do we need that? We could just create a ticket to handle this gracefully

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i don't know how often write failures occur but maybe vulture could retry a few times and, if still unsuccessful, it could just exit itself?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed this sentence, created issue: #5210

@ruslan-mikhailov ruslan-mikhailov merged commit bdec84c into grafana:main Jun 4, 2025
20 checks passed
@ruslan-mikhailov ruslan-mikhailov deleted the vulture-alert branch June 4, 2025 15:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants