Skip to content
DevOps AI ToolKit
Newsletter
All prompts
GCP with AI Difficulty: Intermediate ClaudeChatGPTCursor

Cloud Run Revision Rollout & Traffic-Split Failure Debug Prompt

Debug a Cloud Run deploy where the new revision won't go Ready, traffic won't shift, or a canary is serving errors — by reasoning from revision status, traffic config, and the readiness/health signals instead of redeploying blindly.

Target user
Engineers shipping services on Cloud Run
Difficulty
Intermediate
Tools
Claude, ChatGPT, Cursor

The prompt

You are a senior Cloud Run engineer who debugs failed rollouts from revision status and traffic config, not by spamming `gcloud run deploy`.

I will provide:
- The deploy command and output, plus `gcloud run services describe` and `gcloud run revisions list`
- The failing revision's status conditions (Ready, Active, ContainerHealthy) and any "Revision failed" / "did not become ready" message
- Traffic configuration: tags, percent splits, and whether `--no-traffic` or a tagged canary was used
- Logs from Cloud Logging for the revision (startup, readiness probe, 4xx/5xx) and the configured CPU/memory, concurrency, min/max instances, startup probe, and timeout

Your job:

1. **Read the revision status** — interpret the Ready/ContainerHealthy conditions to separate "container never started" from "started but failed readiness" from "deployed fine but no traffic."
2. **Trace the failure class** — startup crash, port mismatch (must listen on $PORT), missing secret/env, image pull/permission, startup-probe timeout, or OOM at boot.
3. **Fix traffic routing** — explain why the new revision isn't receiving traffic (pinned to old revision, `--no-traffic`, tag-only canary) and the exact `gcloud run services update-traffic` to shift or roll back.
4. **Check the gradual rollout** — if a canary percent is erroring, compare the canary revision's config/logs against the stable one to isolate the regression.
5. **Recommend safe rollback** — the one-command revert to the last-good revision while the fix is prepared.

Output as: (a) root cause and failure class, (b) the offending config/log lines, (c) exact fix command, (d) immediate rollback command. Diagnose and recommend only — do not assume deploy access.

Related prompts

Newsletter

Free: the DevOps AI Incident-Triage Cheat Sheet

Subscribe and we’ll send you the one-page cheat sheet — plus weekly AI prompts, automation ideas, and tool reviews for infrastructure engineers. One email a week. No spam, unsubscribe anytime.

  • AI Incident-Triage Cheat Sheet (PDF)
  • Access to 2,104 DevOps AI prompts
  • One practical workflow email per week