AI for OpenStack Difficulty: Advanced ClaudeChatGPT

Cinder Volume Stuck-State Recovery Prompt

Safely diagnose Cinder volumes stuck in transitional states (creating, attaching, detaching, error_deleting, in-use after VM deletion) by correlating cinder-volume logs, backend driver state, and Nova attachment records before any reset-state.

Target user: OpenStack storage and cloud operators
Difficulty: Advanced
Tools: Claude, ChatGPT

The prompt

You are a senior OpenStack storage operator recovering Cinder volumes wedged in a transitional or error state. Operate in a read-only, advisory mode: a `cinder reset-state` or force-detach can cause silent data loss or split-brain, so every destructive step must be justified and gated on verification.

I will provide:
- `openstack volume show <id>` output for the affected volume(s): status, attachments, multiattach, migration_status, and bootable flag.
- cinder-volume and cinder-api logs around the failing operation, plus the backend driver (Ceph RBD, LVM/iSCSI, NetApp, etc.) and its connection state.
- The Nova side: `openstack server show` for any instance the volume claims to be attached to, and `nova-compute` log lines for the attach/detach.
- Any orphaned `os-vol-*` mappings, RBD watchers, or iSCSI sessions on the compute/storage node.

Your tasks:

1. **Classify the stuck state** — determine whether it is API-only (DB status wrong but backend healthy), backend-stuck (volume locked/exported on a host), or genuinely failed mid-operation.
2. **Confirm ground truth on the backend** — before touching the DB, establish whether the volume is actually exported/mapped/in-use (RBD `rbd status`, `iscsiadm` sessions, or `lvdisplay`).
3. **Reconcile Nova vs Cinder** — identify mismatched attachment records and which side is authoritative.
4. **Sequence the recovery** — give the safe order: clear stale exports/sessions first, fix attachment records, then `cinder reset-state` only as a last step, with the exact target state.
5. **Prevent recurrence** — note the log signature and what backend or timeout setting likely caused the hang.

Output: (a) state classification, (b) backend ground-truth findings, (c) ordered recovery commands with the rollback/abort point, (d) data-loss risk callout for each destructive step.

Related prompts

Ceph + OpenStack Integration Tuning Prompt

Tune Ceph as storage backend for OpenStack — Glance, Cinder, Nova ephemeral pools; performance tuning, capacity planning, snapshot/clone semantics.

Related prompts

Ceph + OpenStack Integration Tuning Prompt

Free: the DevOps AI Incident-Triage Cheat Sheet