Prometheus Troubleshooting Toolkit

Top Prometheus errors

Start with the most common production issues and troubleshooting paths.

alertmanager failed to join cluster

Fix Alertmanager 'failed to join cluster': open port 9094 TCP+UDP, set --cluster.advertise-address, and stop duplicate notifica…

Alert Stuck 'Pending' and Never Firing

Fix Prometheus alerts stuck in Pending or missing from /alerts: tune for and evaluation_interval, verify the expression returns…

binary expression must contain only scalar and instant vector types

Fix PromQL 'binary expression must contain only scalar and instant vector types' errors: wrap range vectors in rate(), use scal…

compaction failed

Fix Prometheus 'compaction failed' errors: remove corrupt blocks, free disk space, recover from unclean shutdowns, and restore…

Error loading config (--config.file=/etc/prometheus/prometheus.yml)

Fix Prometheus 'Error loading config' and HTTP 400 reload failures: validate YAML with promtool, enable web lifecycle, and reso…

duplicate sample for timestamp

Fix Prometheus 'duplicate sample for timestamp' errors: dedupe exporters exposing repeated series, add unique instance/job labe…

found multiple scrape configs with job name

Fix Prometheus 'found multiple scrape configs with job name' errors: locate colliding job_names across included files, dedupe s…

Empty query result

Fix Prometheus 'Empty query result' and 'No data' when a metric should exist: label typos, stale series, stopped targets, lookb…

Best Prometheus prompts

Use these prompts to turn symptoms, logs, and config into a structured troubleshooting plan.

Free Prometheus tools

Validate, troubleshoot, or analyze your configuration before production changes.

AI Alert Rule Generator

Turn a plain-English SLO into a ready-to-ship Prometheus alerting rule.

Open the tool

AI Incident Response Assistant

Paste an alert and metrics, get a structured investigation plan.

Start triage

Prometheus runbook

Use a repeatable checklist for production troubleshooting.

A checklist for scrape, query, and alerting problems.

1 Check targets and their health (up, last scrape, error)
2 Validate scrape configs and relabeling
3 Review alert rules and their evaluation state
4 Test the PromQL directly in the expression browser
5 Inspect remote-write status and queue backpressure

Get the runbook checklist

Alertmanager Grouping Timers: group_wait, group_interval, and repeat_interval The $__rate_interval Trap: Why Grafana rate() Panels Lie When You Zoom metric_relabel_configs as a Cardinality Firewall Native Histograms vs Classic Buckets: Getting Quantiles You Can Trust OpenTelemetry Collector Backpressure: memory_limiter, batch, and Queues Protecting the Prometheus Read Path: max-samples, timeout, and Concurrency

Prometheus Troubleshooting Toolkit

Top Prometheus errors

alertmanager failed to join cluster

Alert Stuck 'Pending' and Never Firing

binary expression must contain only scalar and instant vector types

compaction failed

Error loading config (--config.file=/etc/prometheus/prometheus.yml)

duplicate sample for timestamp

found multiple scrape configs with job name

Empty query result

Best Prometheus prompts

SLO Error Budget & Multi-Window Burn Rate Alerts

Grafana Loki + Prometheus Correlation

Prometheus Alert Rule Generator

Alertmanager Routing Tree Matcher Design Review

Free Prometheus tools

AI Alert Rule Generator

AI Incident Response Assistant

Prometheus runbook

Top Prometheus errors

alertmanager failed to join cluster

Alert Stuck 'Pending' and Never Firing

binary expression must contain only scalar and instant vector types

compaction failed

Error loading config (--config.file=/etc/prometheus/prometheus.yml)

duplicate sample for timestamp

found multiple scrape configs with job name

Empty query result

Best Prometheus prompts

SLO Error Budget & Multi-Window Burn Rate Alerts

Grafana Loki + Prometheus Correlation

Prometheus Alert Rule Generator

Alertmanager Routing Tree Matcher Design Review

Free Prometheus tools

AI Alert Rule Generator

AI Incident Response Assistant

Prometheus runbook

Related Prometheus guides