Skip to content
CloudOps
Newsletter
All guides
Post Mortems with AI By James Joyner IV · · 10 min read

Writing External RCA Reports for Enterprise Customers With AI

Enterprise customers demand RCA reports after outages. Learn how to write a credible external root cause analysis fast, with AI drafting and humans owning every word.

  • #incident-response
  • #postmortem
  • #communication
  • #rca

The email arrived two hours after we’d resolved a SEV1: “Per our SLA, we require a formal root cause analysis within five business days.” Our biggest customer’s procurement team was asking, and the document we sent would land in front of their leadership as evidence of whether we were a vendor worth keeping. That’s a very different artifact from the internal postmortem we’d already written — and writing it well, under a deadline, while still raw from the incident, is one of the more stressful pieces of incident communication there is.

An external RCA is a customer-facing document that explains what went wrong, why, and what you’re doing about it. Unlike an internal postmortem, it’s a trust instrument. The customer isn’t reading it to learn engineering details; they’re reading it to decide whether your outage was a fluke they can forgive or a pattern that means they should start looking at alternatives. Getting it right matters commercially, and getting it written fast matters because the deadline is contractual. This is a place where AI drafting genuinely helps — as long as humans own every word that goes out.

How an external RCA differs from a postmortem

Your internal postmortem is blameless, technical, and frank — it names the gory details because its only job is to help your team learn. An external RCA is a different document with a different audience and different stakes. It needs to be honest without being self-flagellating, technical enough to be credible but not so technical it loses a non-engineer, and forward-looking in a way that rebuilds confidence. It should never throw an individual engineer under the bus, never speculate about causes you haven’t confirmed, and never make commitments your team can’t keep.

The structure customers expect: a clear summary of impact and duration, a factual timeline, the confirmed root cause explained accessibly, and — the part they care most about — concrete steps you’re taking to prevent recurrence. That last section is the trust rebuild. A customer can forgive an outage; what they can’t forgive is the sense that you don’t understand it or won’t fix it.

Where AI accelerates the draft

You already have the raw material: the internal postmortem, the incident timeline, the channel history. The work of the external RCA is largely translation — taking the frank internal account and reshaping it for an external, less-technical, trust-focused audience. That translation is synthesis, and it’s exactly what a model does fast. Feed it the internal postmortem and ask for a customer-facing version, and you’ve cut the most time-consuming part of meeting the deadline.

The AI Incident Response Assistant can take your internal materials and produce a structured external RCA draft — impact summary, accessible timeline, plain-language root cause, and a remediation section — in the time it’d take to stare at a blank page. The prompt I use: “Convert this internal postmortem into a customer-facing RCA. Audience is a non-technical procurement and leadership team. Be honest and factual, explain the cause in accessible terms, do not name individuals, do not speculate beyond confirmed facts, and frame remediation concretely.” Every constraint there is a guardrail against a specific way external RCAs go wrong.

Pro Tip: Ask the AI to flag any statement in its draft that it inferred rather than found explicitly in your source material. External RCAs live and die on accuracy — a single claim that turns out wrong destroys the document’s credibility and your customer’s trust. Having the model mark its inferences gives your reviewer a punch-list of exactly what to verify before sending.

The human review that has to happen

Here is the non-negotiable part: an external RCA is a high-stakes legal and commercial document, and no model gets to be the final author. The AI produces a fast, well-structured draft. Then a human — ideally with input from someone who understands the customer relationship and the legal exposure — reviews every line. They check that every claimed fact is true, that nothing overcommits, that the tone fits this specific customer, and that nothing in it creates liability the business can’t accept.

This isn’t bureaucratic caution; it’s the core of why the document exists. The RCA is your word to a customer about what happened and what you’ll do. A model can’t weigh whether “we will implement X by Q3” is a promise your team can keep, or whether a particular phrasing admits more fault than is wise, or whether this customer needs reassurance or detail. Those are human judgments tied to relationships, contracts, and consequences the model can’t see. AI drafts the document; a human owns it, signs off on it, and is accountable for it.

A draft that needed exactly that review

On that five-day-deadline RCA, the AI draft was genuinely good — well-structured, accurate to the postmortem, accessible. But the review caught two things that would have hurt us. First, the draft’s remediation section confidently stated a fix would be “fully deployed within two weeks,” a timeline pulled from an optimistic line in the internal notes that engineering hadn’t actually committed to. Sending that would have set up a broken promise. Second, the tone was slightly too casual for this particular customer’s formal leadership culture.

Both were easy human fixes — soften the timeline to what we could actually commit, formalize the tone — but neither was something the model could have known to fix, because both required context that lived outside the documents: engineering’s real capacity and the customer’s culture. The AI got us a strong draft in minutes; the human review made it safe to send. That’s the division working exactly as it should.

The line that never moves

Worth stating plainly: the AI here only drafts and translates. It reads internal materials and produces an external draft. It sends nothing, commits to nothing, and touches no system. The decision about what to tell a customer, what to promise, and when to hit send is entirely human, because those decisions carry consequences a model can’t be accountable for. This is the same principle as everywhere else in incident response — AI for synthesis and communication drafts, humans for the decisions and the actions — and it matters most precisely when the document is going to someone who can fire you as a vendor.

Pro Tip: Keep a small library of past external RCAs that went well, and feed one as a style reference when generating the next. The model will match the tone, structure, and level of commitment your team has already vetted as appropriate, which gets you a draft that’s not just accurate but already in your house style — far less for the human reviewer to reshape.

Turning an outage into retained trust

A well-handled external RCA can leave a customer more confident in you than before the incident, because it demonstrates competence, honesty, and seriousness about fixing things — qualities they couldn’t otherwise observe. A badly handled one, late or inaccurate or evasive, does the opposite and can cost you the account. The difference is often just whether you had the bandwidth to write it well under deadline.

Let the AI carry the drafting load so your team has the time and energy for the part that actually protects the relationship: the careful human review that makes the document true, safe, and right for this customer. Explore more incident response communication practice, and find RCA and customer-comms drafting prompts in the prompt library and prompt packs.

Free download · 368-page PDF

Download the Free 500-Prompt DevOps AI Toolkit

500 battle-tested, copy-paste AI prompts engineered by a senior systems engineer — every one with fill-in placeholders and safety/back-out notes. Drop your email and it's yours.

  • 500 prompts: Linux · Kubernetes · Terraform · OpenStack · GitLab · Docker · Monitoring · Incident Response
  • Instant PDF download — yours free, forever
  • Plus one practical AI-workflow email a week (no spam)

Single opt-in · unsubscribe anytime · no spam.