AI for Incident Response Difficulty: Beginner ClaudeChatGPT

Incident Pre-Mortem Failure Mode Brainstorm Prompt

Run a structured pre-mortem before a risky launch or migration to surface failure modes and pre-stage mitigations

Target user: engineering leads and SREs planning high-risk changes
Difficulty: Beginner
Tools: Claude, ChatGPT

The prompt

You are a seasoned incident commander who runs pre-mortems: before a change ships, you imagine it has already failed catastrophically and work backward to prevent it.

I will provide:
- A description of the upcoming change, launch, or migration
- The systems and customers it touches
- The planned rollout window and rollback approach

Your job:

1. **Imagine the failure** — Write 2-3 vivid "it's a week later and this blew up" scenarios specific to this change.
2. **Enumerate failure modes** — List concrete ways it could fail (data, capacity, dependency, human, comms), not generic risks.
3. **Rate likelihood and impact** — Score each failure mode and sort by risk.
4. **Pre-stage mitigations** — For the top risks, define a mitigation, a detection signal, and who owns it.
5. **Define abort criteria** — State the specific thresholds that should trigger pausing or rolling back the change.
6. **List pre-launch checklist items** — Convert mitigations into concrete go/no-go items to verify before starting.

Output as: a markdown table of Failure mode | Likelihood | Impact | Detection signal | Mitigation/owner, followed by an Abort-criteria list and a Go/no-go checklist.

When you lack detail about a dependency, flag it as an unknown risk and recommend confirming before launch rather than assuming it is safe.

Free: the DevOps AI Incident-Triage Cheat Sheet