Skip to main content

Questions tagged [prometheus-alertmanager]

The Alertmanager handles alerts sent by client applications such as the Prometheus server. It takes care of deduplicating, grouping, and routing them to the correct receiver integration such as email, PagerDuty, or OpsGenie. It also takes care of silencing and inhibition of alerts.

prometheus-alertmanager
0 votes
0 answers
22 views

Multi-tenant Loki Ruler not sending alerts to Mimir AlertManager

I have an AlertRule set up in my Loki-Distributed instance via the Loki ruler as below: ruler: directories: fake: rules.txt: | groups: - name: mimir_loki_test ...
Golide's user avatar
  • 961
-1 votes
0 answers
19 views

Script to Deploy Grafana Dashboard with PromQL and Alert Queries [closed]

I'm looking to build a script that can deploy a Grafana dashboard with PromQL queries and alert queries all in one go. The idea is to automate the deployment process so that I can simply call the ...
Samantha V's user avatar
0 votes
0 answers
26 views

Why alertmanager is re-sending notifications after (group_interval + repeat_interval) duration?

Alertmanager notifications are delayed. Alertmanager re-sends notifications after (group_interval + repeat_interval) time. We are expecting the notifications to be re-sent after repeat-interval time ...
Aruna Thuse's user avatar
0 votes
0 answers
24 views

how to configure Grafana alertmanager datasource with kube-prometheus-stack helm deployment

I am using the prometheus alertmanager that gets deployed by the kube-prometheus-stack helm deployment. Now I want to connect Grafana and tried to configure a alertmanager datasource what fails with ...
Herr Hempel's user avatar
0 votes
0 answers
12 views

How does repeat_interval work in alertmanager?

I am having trouble making sense of some alerting behavior I'm seeing and it seems connected to repeat_interval. Can't seem to find a simple answer to my question. Question: Will repeat_interval fire ...
shek's user avatar
  • 215
0 votes
0 answers
44 views

Add a template under alert manager receivers in Kube-prometheus-stack values.yaml

I am trying to define a template to use without having to create a full ConfigMap within the values.yaml for alert manager since it is such a small template. I saw in a few examples people using ...
Sh3perd's user avatar
  • 69
0 votes
0 answers
32 views

Spring Boot Prometheus PushGatewayManager: Unable to push metrics due to SocketTimeoutException

I'm encountering an issue with my Spring Boot application where it fails to push metrics to Prometheus Pushgateway. The issue is happening intermettently The error message indicates a ...
Dhruv's user avatar
  • 43
0 votes
0 answers
20 views

Prometheus systemd expression based on a list of services

I have several VMs each exporting (using node-exporter) the status of its systemd services. I would like to have an alerting rule that looks something like: - alert: service_down expr: ...
Corel's user avatar
  • 623
0 votes
0 answers
21 views

Managing Time-to-Live (TTL) for Alerts in Prometheus

We're collecting EUR/USD prices every minute on Prometheus and setting up various alerts, such as percentage changes over the last 10 minutes and the last hour. The data flow works seamlessly; however,...
more's user avatar
  • 133
0 votes
0 answers
17 views

Notes "New" or "Continuing" alert for Alertmanager

I am doing a fairly simple monitoring setup based on Prometheus and Alertmanager (main branch). Reciver - Telgram chat. As alertmanager supports Telegram as a reciver since v0.21, i would like to ...
neversure's user avatar
0 votes
0 answers
27 views

show individual container as Prometheus target

I am running a server with several docker (unfortunately I cannot use podman because it is mailcow and this does not fully support podman yet) in one docker-compose file. I have added cadvisor to the ...
LeifSec's user avatar
  • 85
0 votes
0 answers
19 views

Is there a way to suppress 'resolve' messages sent to pager duty

Our pager duty is integrated to alert manager, and we want to stop an specific alert from auto resolving. To my knowledge, the best way to do that would be changing the alert manager configuration so ...
Jose Antonio Vidal Sanchez's user avatar
0 votes
0 answers
16 views

How to detect and create a alarm for a hudi job failure using hoodie metrics via Prometheus

Problem: While using multi delta streamer for kafka ingestion, out of many tables, if one of the table ingestion fails, job succeeds. There is no way to check for success/failure for a particular ...
Roobal Jindal's user avatar
0 votes
1 answer
23 views

Categorisation of alerts in Prometheus and grafana

I want to do categorisation of alerts in Prometheus and grafana if cpu usage is between 70 and 75 then medium , if cpu usage is between 75 and 80 medium and usage is greater than 80 then critical ...
Akash Kotkar's user avatar
0 votes
0 answers
17 views

Alertmanager does not fire alarm

I use the following config for alertmanager of AWS Prometheus, and it works: alertmanager_config: | global: resolve_timeout: 60s route: receiver: default group_by: ['alertname'] ...
dn2024's user avatar
  • 1

15 30 50 per page
1
2 3 4 5
52