-
Notifications
You must be signed in to change notification settings - Fork 2.8k
feat(metrics): consecutiveSoftErrors #5502
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Hi @AndrewCharlesHay. Thanks for your PR. I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Very interesting, thanks @AndrewCharlesHay 👍 |
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
linter is fixed, worth to rebase |
/lgtm |
@AndrewCharlesHay It LGTM. Can you fix the linter ? After, we should be good to go. |
Will do |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: ivankatliarchuk The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Not sure how easy to reproduce, seems like we have a flaky test https://prow.k8s.io/view/gs/kubernetes-ci-logs/pr-logs/pull/kubernetes-sigs_external-dns/5498/pull-external-dns-unit-test/1931045324659363840 |
Hi @AndrewCharlesHay What the difference is, I'm not sure. It works on github actions and failes in PROW not 100% related kubernetes-sigs/prow#450 |
What does it do ?
New Metric Added
external_dns_controller_consecutive_soft_errors
Metric Registration
consecutiveSoftErrors
metric with the existing set of metrics.Enhanced Error Handling in Controller Loop
softErrorCount
counter to track consecutive soft errors.Benefit:
These changes provide improved observability for persistent, non-fatal issues in the controller’s reconciliation loop, making it easier to detect and troubleshoot recurring problems.
Motivation
#5499
More