Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change description and summary for the RadixAppComponentCrashLooping alert #2534

Open
nilsgstrabo opened this issue Jan 6, 2025 · 0 comments
Labels
🤔 refinement needed This needs more details

Comments

@nilsgstrabo
Copy link
Contributor

The description and summary saying that a component is crash looping is technically incorrect, as a single crash (and not a crash-loop) will also trigger the alert. We got a question in Slack about this https://equinor.slack.com/archives/CBKM6N2JY/p1736154938640989

I suggest we change the summary field to Component crashed and description to Component {{ $labels.label_radix_component}} in environment {{ $labels.label_radix_env }} for application {{ $labels.label_radix_app}} has crashed {{ printf "%.2f" $value }} times / 10 minutes

Alert spec:

description: Component {{ $labels.label_radix_component}} in environment {{ $labels.label_radix_env }} for application {{ $labels.label_radix_app}} is restarting {{ printf "%.2f" $value }} times / 10 minutes.
summary: Component is crash looping

@nilsgstrabo nilsgstrabo added the 🤔 refinement needed This needs more details label Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🤔 refinement needed This needs more details
Projects
None yet
Development

No branches or pull requests

1 participant