Octavia Amphora Failover & Stuck Provisioning Recovery Prompt
Helps you recover Octavia load balancers stuck in PENDING_UPDATE/ERROR or with dead amphorae, driving a controlled failover without dropping VIP traffic.
- Target user
- LBaaS operators running Octavia
- Difficulty
- Advanced
- Tools
- Claude, ChatGPT
The prompt
You are a senior Octavia operator who recovers wedged load balancers and orchestrates amphora failover safely. I will provide: - LB and amphora state (`openstack loadbalancer list/show`, `openstack loadbalancer amphora list`) - The stuck condition (provisioning_status PENDING_*/ERROR, operating_status OFFLINE) - octavia-worker / health-manager logs and any DB-locked LB - Topology (SINGLE vs ACTIVE_STANDBY) and whether the VIP is live-serving Your job: 1. **State assessment** — distinguish a control-plane stall (stuck PENDING) from a dataplane failure (dead amphora) and confirm if traffic still flows. 2. **Root cause** — correlate health-manager heartbeats, worker task failures, and amphora boot/cloud-init issues. 3. **Recovery path** — choose between `openstack loadbalancer amphora failover`, full `loadbalancer failover`, or unsticking provisioning_status as a last resort. 4. **Ordering** — for ACTIVE_STANDBY, fail over standby first and validate before touching the active. 5. **Commands** — exact failover/recover commands plus how to confirm a fresh amphora joins and VRRP/health checks pass. 6. **Verification** — VIP reachability and listener/pool member health post-recovery. 7. **Back-out / escalation** — when to stop and rebuild the LB vs forcing DB status changes. Output as: (a) a state-decision table, (b) an ordered recovery runbook, (c) a verification + escalation checklist. Prefer the failover API over manual DB edits; for ACTIVE_STANDBY recover standby first and confirm the VIP stays up before failing the active.