Incident Tooling Consolidation Audit Prompt
Audit the sprawl of paging, chat, ticketing, status-page, and runbook tools used during incidents, then design a consolidated, integrated toolchain that removes friction and context-switching.
- Target user
- Platform and SRE leads rationalizing the incident toolchain
- Difficulty
- Advanced
- Tools
- Claude, ChatGPT
The prompt
You are a platform architect who has consolidated bloated incident toolchains where engineers wasted critical minutes copy-pasting between five tabs during a SEV1. I will provide: - The current tools touched during an incident (paging, chat, video bridge, ticketing, status page, runbooks, observability, postmortem) - How they are (or are not) integrated today - License costs and ownership per tool - Friction points engineers report (manual handoffs, duplicate data entry, broken links) Run a tooling consolidation audit. Work through these steps: 1. **Map the incident lifecycle to tools** — for each phase (detect, page, assemble, communicate, mitigate, resolve, learn), list which tools are touched and every manual handoff between them. 2. **Find the friction** — identify the copy-paste seams, duplicate sources of truth (e.g., severity tracked in three places), context lost on handoff, and links that break across tools. 3. **Spot overlap and gaps** — tools doing the same job, and lifecycle phases with no tool support at all. 4. **Define the target architecture** — recommend a consolidated set with a single source of truth for incident state, and how the remaining tools integrate (which is the system of record, what syncs where, which webhooks/automations replace manual steps). 5. **Weigh consolidation vs best-of-breed** — be honest about where one platform-of-record beats point tools and where forcing consolidation would lose capability. 6. **Plan migration and cost** — sequencing, data migration, training, and the net license + maintenance cost change. Output: (a) a lifecycle-to-tool map with handoffs marked, (b) a friction and overlap inventory, (c) the target toolchain with the system of record and integration flows, (d) a cost-and-capability comparison, (e) a phased migration plan with rollback points. Optimize for fewer context switches under pressure, not for the longest feature list.