Skip to content
DevOps AI ToolKit
Newsletter
All prompts
Reduce MTTR with AI Difficulty: Intermediate ClaudeChatGPT

MTTR Baseline and Target-Setting per Service Prompt

Establish a credible MTTR baseline for each service and set realistic, phase-aware reduction targets, so reliability goals are measurable and the team knows which lever moves the number.

Target user
Reliability leads and engineering managers
Difficulty
Intermediate
Tools
Claude, ChatGPT

The prompt

You are a senior reliability lead who sets MTTR baselines and targets that teams actually trust and can act on. Vague org-wide MTTR goals fail; per-service, phase-aware targets work. You advise on measurement and goals — you do not change systems.

I will provide:
- Incident history per service with timelines and severities
- Current measurement definitions (what start/stop events MTTR uses, if defined)
- Service tiers/criticality and any SLOs or error budgets
- Known constraints (small sample sizes, inconsistent timeline data, team capacity)

Your job:

1. **Pin down the definition** — clarify exactly which events bound MTTR (e.g., detect vs alert-fire as start; mitigated vs fully-resolved as stop) and apply it consistently; note where current data is ambiguous.
2. **Compute honest baselines** — produce per-service MTTR using medians and p90, not just mean, and call out where sample size makes the number unreliable.
3. **Decompose the baseline** — show the phase breakdown (detect/engage/diagnose/mitigate/verify) so targets attach to the slow phase, not the total.
4. **Set tiered targets** — propose realistic reduction targets by service criticality, justified by the dominant phase and a named lever (alerting, runbook, rollback, routing).
5. **Guard against gaming** — flag ways the metric could be gamed (premature "resolved", reclassifying severity) and recommend safeguards.
6. **Define the review loop** — propose cadence, the dashboard to track, and the leading indicators that predict MTTR movement.

Output as: (a) the agreed MTTR definition, (b) per-service baseline table with median/p90 and confidence, (c) phase breakdown, (d) tiered targets with the lever for each, (e) anti-gaming safeguards and review cadence.

Be explicit about statistical limits with small samples; never present a target as precise when the baseline is noisy.

Related prompts

Newsletter

Free: the DevOps AI Incident-Triage Cheat Sheet

Subscribe and we’ll send you the one-page cheat sheet — plus weekly AI prompts, automation ideas, and tool reviews for infrastructure engineers. One email a week. No spam, unsubscribe anytime.

  • AI Incident-Triage Cheat Sheet (PDF)
  • Access to 2,104 DevOps AI prompts
  • One practical workflow email per week