Skip to main content

All Questions

-1 votes
0 answers
19 views

Script to Deploy Grafana Dashboard with PromQL and Alert Queries [closed]

I'm looking to build a script that can deploy a Grafana dashboard with PromQL queries and alert queries all in one go. The idea is to automate the deployment process so that I can simply call the ...
Samantha V's user avatar
0 votes
0 answers
24 views

how to configure Grafana alertmanager datasource with kube-prometheus-stack helm deployment

I am using the prometheus alertmanager that gets deployed by the kube-prometheus-stack helm deployment. Now I want to connect Grafana and tried to configure a alertmanager datasource what fails with ...
Herr Hempel's user avatar
0 votes
0 answers
32 views

Spring Boot Prometheus PushGatewayManager: Unable to push metrics due to SocketTimeoutException

I'm encountering an issue with my Spring Boot application where it fails to push metrics to Prometheus Pushgateway. The issue is happening intermettently The error message indicates a ...
Dhruv's user avatar
  • 43
0 votes
0 answers
20 views

Prometheus systemd expression based on a list of services

I have several VMs each exporting (using node-exporter) the status of its systemd services. I would like to have an alerting rule that looks something like: - alert: service_down expr: ...
Corel's user avatar
  • 623
0 votes
0 answers
21 views

Managing Time-to-Live (TTL) for Alerts in Prometheus

We're collecting EUR/USD prices every minute on Prometheus and setting up various alerts, such as percentage changes over the last 10 minutes and the last hour. The data flow works seamlessly; however,...
more's user avatar
  • 133
0 votes
0 answers
17 views

Notes "New" or "Continuing" alert for Alertmanager

I am doing a fairly simple monitoring setup based on Prometheus and Alertmanager (main branch). Reciver - Telgram chat. As alertmanager supports Telegram as a reciver since v0.21, i would like to ...
neversure's user avatar
0 votes
0 answers
27 views

show individual container as Prometheus target

I am running a server with several docker (unfortunately I cannot use podman because it is mailcow and this does not fully support podman yet) in one docker-compose file. I have added cadvisor to the ...
LeifSec's user avatar
  • 85
0 votes
0 answers
16 views

How to detect and create a alarm for a hudi job failure using hoodie metrics via Prometheus

Problem: While using multi delta streamer for kafka ingestion, out of many tables, if one of the table ingestion fails, job succeeds. There is no way to check for success/failure for a particular ...
Roobal Jindal's user avatar
0 votes
1 answer
23 views

Categorisation of alerts in Prometheus and grafana

I want to do categorisation of alerts in Prometheus and grafana if cpu usage is between 70 and 75 then medium , if cpu usage is between 75 and 80 medium and usage is greater than 80 then critical ...
Akash Kotkar's user avatar
0 votes
1 answer
63 views

How to configure Prometheus Alertmanager for alerting when any container goes down?

I am using Prometheus Alertmanager to monitor dozens of hosts and hundreds of containers on these hosts. I need to receive notifications when any container goes down. I understand from the ...
alis's user avatar
  • 1
0 votes
0 answers
25 views

Slack alerts into Webex Team

I'm trying to migrate Prometheus Alertmanager alerting from using slack_configs (ref1) into the webex_configs (ref2). Is there any possibility to adjust the message sent to Webex with structure used ...
ddano's user avatar
  • 1
0 votes
1 answer
58 views

adding node exporter info to prometheus query

I am running multiple docker stacks all providing the infrastructure for a product. Internally those docker stacks are monitored using Prometheus and alert to a teams channel. I want all docker stacks ...
jonathan-dev's user avatar
0 votes
1 answer
147 views

PromQL query to find 99th percentile request latency

I'm looking at a PromQL query we're using to fire an alert when the latency of a certain service goes beyond 400ms, but I'm unable to understand how it works, or even if it is correct at all - The ...
Soham Dixit's user avatar
0 votes
1 answer
157 views

Custom alerts sent to Alertmanager not showing up in Grafana

I am currently using a version of the kube-prometheus-stack Helm chart to deploy Grafana, Prometheus and Alertmanager. I made a Go program to send custom alerts to Alertmanager (this feature will be ...
quenting's user avatar
1 vote
0 answers
68 views

In Prometheus, is it possible that at same timestamp the defined PromQL is evaluated as true but alert is not firing?

I recently encountered a problem that struggle me several days... I've defined a PromQL in rule yaml in expr, and for is set as 0s (firing immediately). Global evaluation_interval is 1m and query....
Conifers's user avatar
  • 370

15 30 50 per page
1
2 3 4 5
41