Questions tagged [prometheus-alertmanager]
The Alertmanager handles alerts sent by client applications such as the Prometheus server. It takes care of deduplicating, grouping, and routing them to the correct receiver integration such as email, PagerDuty, or OpsGenie. It also takes care of silencing and inhibition of alerts.
prometheus-alertmanager
770
questions
0
votes
0
answers
22
views
Multi-tenant Loki Ruler not sending alerts to Mimir AlertManager
I have an AlertRule set up in my Loki-Distributed instance via the Loki ruler as below:
ruler:
directories:
fake:
rules.txt: |
groups:
- name: mimir_loki_test
...
-1
votes
0
answers
19
views
Script to Deploy Grafana Dashboard with PromQL and Alert Queries [closed]
I'm looking to build a script that can deploy a Grafana dashboard with PromQL queries and alert queries all in one go. The idea is to automate the deployment process so that I can simply call the ...
0
votes
0
answers
26
views
Why alertmanager is re-sending notifications after (group_interval + repeat_interval) duration?
Alertmanager notifications are delayed. Alertmanager re-sends notifications after (group_interval + repeat_interval) time. We are expecting the notifications to be re-sent after repeat-interval time ...
0
votes
0
answers
24
views
how to configure Grafana alertmanager datasource with kube-prometheus-stack helm deployment
I am using the prometheus alertmanager that gets deployed by the kube-prometheus-stack helm deployment. Now I want to connect Grafana and tried to configure a alertmanager datasource what fails with ...
0
votes
0
answers
12
views
How does repeat_interval work in alertmanager?
I am having trouble making sense of some alerting behavior I'm seeing and it seems connected to repeat_interval. Can't seem to find a simple answer to my question.
Question: Will repeat_interval fire ...
0
votes
0
answers
44
views
Add a template under alert manager receivers in Kube-prometheus-stack values.yaml
I am trying to define a template to use without having to create a full ConfigMap within the values.yaml for alert manager since it is such a small template. I saw in a few examples people using ...
0
votes
0
answers
32
views
Spring Boot Prometheus PushGatewayManager: Unable to push metrics due to SocketTimeoutException
I'm encountering an issue with my Spring Boot application where it fails to push metrics to Prometheus Pushgateway. The issue is happening intermettently
The error message indicates a ...
0
votes
0
answers
20
views
Prometheus systemd expression based on a list of services
I have several VMs each exporting (using node-exporter) the status of its systemd services. I would like to have an alerting rule that looks something like:
- alert: service_down
expr: ...
0
votes
0
answers
21
views
Managing Time-to-Live (TTL) for Alerts in Prometheus
We're collecting EUR/USD prices every minute on Prometheus and setting up various alerts, such as percentage changes over the last 10 minutes and the last hour.
The data flow works seamlessly; however,...
0
votes
0
answers
17
views
Notes "New" or "Continuing" alert for Alertmanager
I am doing a fairly simple monitoring setup based on Prometheus and Alertmanager (main branch).
Reciver - Telgram chat. As alertmanager supports Telegram as a reciver since v0.21, i would like to ...
0
votes
0
answers
27
views
show individual container as Prometheus target
I am running a server with several docker (unfortunately I cannot use podman because it is mailcow and this does not fully support podman yet) in one docker-compose file.
I have added cadvisor to the ...
0
votes
0
answers
19
views
Is there a way to suppress 'resolve' messages sent to pager duty
Our pager duty is integrated to alert manager, and we want to stop an specific alert from auto resolving. To my knowledge, the best way to do that would be changing the alert manager configuration so ...
0
votes
0
answers
16
views
How to detect and create a alarm for a hudi job failure using hoodie metrics via Prometheus
Problem: While using multi delta streamer for kafka ingestion, out of many tables, if one of the table ingestion fails, job succeeds. There is no way to check for success/failure for a particular ...
0
votes
1
answer
23
views
Categorisation of alerts in Prometheus and grafana
I want to do categorisation of alerts in Prometheus and grafana if cpu usage is between 70 and 75 then medium , if cpu usage is between 75 and 80 medium and usage is greater than 80 then critical ...
0
votes
0
answers
17
views
Alertmanager does not fire alarm
I use the following config for alertmanager of AWS Prometheus, and it works:
alertmanager_config: |
global:
resolve_timeout: 60s
route:
receiver: default
group_by: ['alertname']
...