-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Minio Sloth rules #254
Conversation
a835970
to
1b55637
Compare
component/class/defaults.yml
Outdated
@@ -39,7 +39,7 @@ parameters: | |||
appcat: | |||
registry: ghcr.io | |||
repository: vshn/appcat | |||
tag: v4.37.0 | |||
tag: sli_minio |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will change this once the SLI exporter is released
@@ -168,6 +168,16 @@ parameters: | |||
# If the alert is pending for more than 5m this indicates a real problem. | |||
for: 6m | |||
ticket_alert: {} | |||
minio: | |||
uptime: | |||
objective: 99.9 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've used the same values as redis/postgres, any objections?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since we haven't got any special requirements, I'd keep it as default
@@ -420,6 +430,7 @@ parameters: | |||
helmChartVersion: ${appcat:charts:minio:version} | |||
grpcEndpoint: ${appcat:grpcEndpoint} | |||
defaultPlan: standard-1 | |||
sla: 99.25 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've used the same values as redis/postgres, any objections?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's keep the same value across other services for now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since we haven't got any special requirements, I'd keep it as default
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
aprooved
@@ -168,6 +168,16 @@ parameters: | |||
# If the alert is pending for more than 5m this indicates a real problem. | |||
for: 6m | |||
ticket_alert: {} | |||
minio: | |||
uptime: | |||
objective: 99.9 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since we haven't got any special requirements, I'd keep it as default
@@ -420,6 +430,7 @@ parameters: | |||
helmChartVersion: ${appcat:charts:minio:version} | |||
grpcEndpoint: ${appcat:grpcEndpoint} | |||
defaultPlan: standard-1 | |||
sla: 99.25 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since we haven't got any special requirements, I'd keep it as default
@@ -58,17 +58,25 @@ local prometheusRule(name) = | |||
spec: patchedRules, | |||
}; | |||
|
|||
local getEvents(serviceName) = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I love this refactor :D
@@ -71,7 +71,7 @@ spec: | |||
sloth_window: 1d | |||
record: slo:sli_error:ratio_rate1d | |||
- expr: | | |||
((sum(rate(appcat_probes_seconds_count{reason!="success", service="VSHNRedis", ha="true"}[3d]) or 0*rate(appcat_probes_seconds_count{service="VSHNRedis"}[3d])) by (service, namespace, name, organization, sla) or vector(0)) - scalar(appcat:cluster:maintenance) > 0 or sum(0*rate(appcat_probes_seconds_count{service="VSHNRedis"}[3d])) by (service, namespace, name, organization, sla)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good catch
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some things to look at
@@ -420,6 +430,7 @@ parameters: | |||
helmChartVersion: ${appcat:charts:minio:version} | |||
grpcEndpoint: ${appcat:grpcEndpoint} | |||
defaultPlan: standard-1 | |||
sla: 99.25 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's keep the same value across other services for now.
8f500a6
to
713106c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
ec0d5d2
to
17e8ab0
Compare
17e8ab0
to
435eb4b
Compare
435eb4b
to
be4f4a0
Compare
Checklist
changelog.
The PR has a meaningful description that sums up the change. It will be
linked in the changelog.
bug
,enhancement
,documentation
,change
,breaking
,dependency
as they show up in the changelog.