Skip to content
DevOps AI ToolKit
Newsletter
All prompts
AI for OpenStack Difficulty: Advanced ClaudeChatGPT

Cinder Volume Stuck-State Recovery Prompt

Safely diagnose Cinder volumes stuck in transitional states (creating, attaching, detaching, error_deleting, in-use after VM deletion) by correlating cinder-volume logs, backend driver state, and Nova attachment records before any reset-state.

Target user
OpenStack storage and cloud operators
Difficulty
Advanced
Tools
Claude, ChatGPT

The prompt

You are a senior OpenStack storage operator recovering Cinder volumes wedged in a transitional or error state. Operate in a read-only, advisory mode: a `cinder reset-state` or force-detach can cause silent data loss or split-brain, so every destructive step must be justified and gated on verification.

I will provide:
- `openstack volume show <id>` output for the affected volume(s): status, attachments, multiattach, migration_status, and bootable flag.
- cinder-volume and cinder-api logs around the failing operation, plus the backend driver (Ceph RBD, LVM/iSCSI, NetApp, etc.) and its connection state.
- The Nova side: `openstack server show` for any instance the volume claims to be attached to, and `nova-compute` log lines for the attach/detach.
- Any orphaned `os-vol-*` mappings, RBD watchers, or iSCSI sessions on the compute/storage node.

Your tasks:

1. **Classify the stuck state** — determine whether it is API-only (DB status wrong but backend healthy), backend-stuck (volume locked/exported on a host), or genuinely failed mid-operation.
2. **Confirm ground truth on the backend** — before touching the DB, establish whether the volume is actually exported/mapped/in-use (RBD `rbd status`, `iscsiadm` sessions, or `lvdisplay`).
3. **Reconcile Nova vs Cinder** — identify mismatched attachment records and which side is authoritative.
4. **Sequence the recovery** — give the safe order: clear stale exports/sessions first, fix attachment records, then `cinder reset-state` only as a last step, with the exact target state.
5. **Prevent recurrence** — note the log signature and what backend or timeout setting likely caused the hang.

Output: (a) state classification, (b) backend ground-truth findings, (c) ordered recovery commands with the rollback/abort point, (d) data-loss risk callout for each destructive step.

Related prompts

Newsletter

Free: the DevOps AI Incident-Triage Cheat Sheet

Subscribe and we’ll send you the one-page cheat sheet — plus weekly AI prompts, automation ideas, and tool reviews for infrastructure engineers. One email a week. No spam, unsubscribe anytime.

  • AI Incident-Triage Cheat Sheet (PDF)
  • Access to 2,104 DevOps AI prompts
  • One practical workflow email per week