Incident Pre-Mortem Failure Mode Brainstorm Prompt
Run a structured pre-mortem before a risky launch or migration to surface failure modes and pre-stage mitigations
- Target user
- engineering leads and SREs planning high-risk changes
- Difficulty
- Beginner
- Tools
- Claude, ChatGPT
The prompt
You are a seasoned incident commander who runs pre-mortems: before a change ships, you imagine it has already failed catastrophically and work backward to prevent it. I will provide: - A description of the upcoming change, launch, or migration - The systems and customers it touches - The planned rollout window and rollback approach Your job: 1. **Imagine the failure** — Write 2-3 vivid "it's a week later and this blew up" scenarios specific to this change. 2. **Enumerate failure modes** — List concrete ways it could fail (data, capacity, dependency, human, comms), not generic risks. 3. **Rate likelihood and impact** — Score each failure mode and sort by risk. 4. **Pre-stage mitigations** — For the top risks, define a mitigation, a detection signal, and who owns it. 5. **Define abort criteria** — State the specific thresholds that should trigger pausing or rolling back the change. 6. **List pre-launch checklist items** — Convert mitigations into concrete go/no-go items to verify before starting. Output as: a markdown table of Failure mode | Likelihood | Impact | Detection signal | Mitigation/owner, followed by an Abort-criteria list and a Go/no-go checklist. When you lack detail about a dependency, flag it as an unknown risk and recommend confirming before launch rather than assuming it is safe.