AI for OpenStack
Troubleshoot Nova, Neutron, Cinder, RabbitMQ, and Keystone with AI-assisted workflows.
Prompts
- Advanced
Cinder Volume Stuck-State Recovery Prompt
Safely diagnose Cinder volumes stuck in transitional states (creating, attaching, detaching, error_deleting, in-use after VM deletion) by correlating cinder-volume logs, backend driver state, and Nova attachment records before any reset-state.
- Claude
- ChatGPT
Open prompt - Intermediate
Glance Image Upload & Store Failure Triage Prompt
Diagnose Glance image problems — uploads stuck in saving/queued, images that won't boot due to bad properties or format, store backend errors (Ceph/Swift/file), and signature/conversion failures — before re-uploading or deleting image data.
- Claude
- ChatGPT
Open prompt - Intermediate
Keystone Token & Auth Failure Triage Prompt
Diagnose Keystone authentication and authorization failures (401/403, expired Fernet tokens, broken role assignments, federation/LDAP lookup errors) by tracing the auth request through token validation, catalog, and policy enforcement.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron Port Binding Failed Debug Prompt
Diagnose Neutron ports stuck in BINDING_FAILED or DOWN by tracing the ML2 mechanism driver negotiation, the L2 agent state on the compute host, and the segment/physnet mapping that the vif_type resolution depends on.
- Claude
- ChatGPT
Open prompt - Advanced
Nova-compute Host Health Recovery Prompt
Triage an unhealthy nova-compute host reporting as down in the service list — distinguishing a dead nova-compute service, a hung libvirt/qemu, an AMQP heartbeat problem, or a wedged hypervisor — and recover it without endangering running instances.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Live Migration Failure Debug Prompt
Debug failed or stuck Nova live migrations — pre-check rejections, instances stuck in MIGRATING, libvirt 'migration job' errors, and post-migration cleanup left on the source host — across shared and block (non-shared) storage scenarios.
- Claude
- ChatGPT
Open prompt - Advanced
Nova NoValidHost Scheduler Triage Prompt
Diagnose why nova-scheduler returns NoValidHost for a boot/migrate request by walking the filter chain, Placement allocation candidates, and host aggregate metadata to find which constraint eliminated every compute host.
- Claude
- ChatGPT
Open prompt - Advanced
Oslo.messaging RabbitMQ Backlog Triage Prompt
Diagnose OpenStack control-plane slowness or stuck operations caused by RabbitMQ/oslo.messaging issues: ballooning reply/notification queues, partitioned clusters, stale agent consumers, and RPC timeouts across Nova/Neutron/Cinder.
- Claude
- ChatGPT
Open prompt - Advanced
Placement Inventory & Allocation Reconcile Prompt
Audit and reconcile the OpenStack Placement service for resource-provider inventory drift, leaked/orphaned allocations, and incorrect allocation ratios that cause phantom 'full' hypervisors or scheduling failures.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder NetApp & NFS Driver Mount/Export Debug Prompt
Diagnose Cinder NFS/NetApp backend failures where volumes won't attach because of stale exports, mount option drift, or export-policy mismatches.
- Claude
- ChatGPT
Open prompt - Advanced
Glance Cinder-Store & Image-Volume Cache Tuning Prompt
Tune the Glance cinder store and Cinder's image-volume cache so booting from image skips redundant downloads and uses backend-side cloning.
- Claude
- ChatGPT
Open prompt - Advanced
Heat Nested Stack & ResourceGroup Index Design Prompt
Design maintainable Heat templates using nested stacks, ResourceGroup, conditions, and index_var so scaling and rolling updates don't replace the wrong resources.
- Claude
- ChatGPT
- Cursor
Open prompt - Advanced
Keystone Fernet to JWS Token Provider Migration Prompt
Plan a safe cutover from the Fernet token provider to JWS (asymmetric) tokens across a multi-node Keystone deployment without invalidating live sessions.
- Claude
- ChatGPT
Open prompt - Advanced
Magnum Cluster Autoscaler & Node Group Rolling Upgrade Prompt
Design a safe node-group rolling upgrade and cluster-autoscaler config for Magnum Kubernetes clusters so node replacements drain pods without dropping workloads.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron OVN BGP EVPN L3 Gateway Design Prompt
Design and validate an OVN-BGP-Agent EVPN/VRF deployment so tenant networks are advertised into a fabric without overlapping VLAN sprawl.
- Claude
- ChatGPT
Open prompt - Advanced
Nova SR-IOV & PCI Passthrough Whitelist Debug Prompt
Diagnose why SR-IOV VFs or passthrough PCI devices fail to attach, get the wrong NUMA affinity, or never show up in Placement inventory.
- Claude
- ChatGPT
- Cursor
Open prompt - Advanced
Nova vTPM & Emulated Secure Boot Flavor Design Prompt
Design Nova flavors and image properties for vTPM and UEFI Secure Boot so confidential/Windows workloads boot with measured boot and a persistent TPM, validated before fleet use.
- Claude
- ChatGPT
Open prompt - Advanced
Swift Proxy Memcache & Rate-Limit Tuning Prompt
Tune the Swift proxy-server pipeline (memcache, ratelimit, container/account caching) to stop 503 storms and cache stampedes under load.
- Claude
- ChatGPT
Open prompt - Intermediate
Aodh Alarm Evaluation Debug Prompt
Diagnose Aodh alarms that never transition state, fire false positives, or fail to trigger their action URL for auto-scaling and alerting.
- Claude
- ChatGPT
Open prompt - Intermediate
Blazar Host Reservation and Lease Design Prompt
Design Blazar resource reservation — dedicated host leases, instance reservations, and capacity carve-outs without starving general scheduling.
- Claude
- ChatGPT
Open prompt - Intermediate
CloudKitty Rating and Chargeback Pipeline Design Prompt
Design a CloudKitty rating pipeline — collector sources, hashmap/pyscript rules, and per-tenant chargeback reports that reconcile with usage.
- Claude
- ChatGPT
Open prompt - Advanced
Cyborg Accelerator Device Debug Prompt
Diagnose Cyborg GPU/FPGA accelerator attach failures, missing device profiles, and placement resource-provider mismatches for instances.
- Claude
- ChatGPT
Open prompt - Advanced
Mistral Workflow Execution Debug Prompt
Diagnose Mistral workflow executions stuck in RUNNING, tasks erroring on YAQL/Jinja, and action failures against OpenStack services.
- Claude
- ChatGPT
Open prompt - Intermediate
Senlin Cluster Scaling Policy Debug Prompt
Troubleshoot Senlin auto-scaling clusters where scaling policies fail to fire, nodes get stuck in ERROR, or health policy recovery loops.
- Claude
- ChatGPT
Open prompt - Advanced
Tacker VNF Lifecycle Debug Prompt
Troubleshoot Tacker NFV orchestration — VNF instantiate/heal/scale failures, VIM connectivity, and TOSCA/VNFD template errors.
- Claude
- ChatGPT
Open prompt - Advanced
Trove Database Replication and Failover Debug Prompt
Diagnose Trove DBaaS replication lag, broken replica chains, and failed promote/failover operations on MySQL/PostgreSQL instances.
- Claude
- ChatGPT
Open prompt - Advanced
Zun Container Lifecycle Debug Prompt
Troubleshoot Zun containers stuck in Creating/Error, networking failures via Kuryr, and capsule scheduling problems on compute hosts.
- Claude
- ChatGPT
Open prompt - Intermediate
Cinder Scheduler Weigher & Capacity Filter Tuning Prompt
Tune the Cinder scheduler filters and weighers so volumes land on the right backend pool — balancing capacity, allocation ratio, and affinity — instead of clustering on one backend or hitting 'no valid backend'.
- Claude
- ChatGPT
Open prompt - Intermediate
Designate Zone Transfer & Secondary Zone Debug Prompt
Diagnose why Designate secondary zones won't transfer (AXFR/IXFR), NOTIFY isn't received, or zone serials drift from an external master, while keeping the backend nameservers in sync.
- Claude
- ChatGPT
Open prompt - Advanced
Keystone LDAP Identity Backend Debug Prompt
Diagnose why Keystone authentication against an external LDAP/Active Directory identity backend fails, returns wrong group memberships, or is slow, while keeping service accounts in the SQL backend.
- Claude
- ChatGPT
Open prompt - Advanced
Kolla-Ansible TLS Certificate Deployment Review Prompt
Review and plan TLS enablement in a Kolla-Ansible deployment — external/internal API TLS, backend TLS, and certificate rotation — without breaking endpoint URLs or internal service traffic.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron OVN Southbound DB Bloat & Compaction Debug Prompt
Diagnose a bloated or slow OVN Southbound/Northbound database in a Neutron OVN deployment — runaway size, slow ovsdb-server, chassis churn — and compact it safely without disrupting the dataplane.
- Claude
- ChatGPT
Open prompt - Intermediate
Nova Database Archive & Purge Runbook Prompt
Safely shrink a bloated Nova database by archiving deleted rows to shadow tables and purging them, without breaking cells-v2 mappings or losing audit data.
- Claude
- ChatGPT
Open prompt - Advanced
Nova vGPU Mediated Device Flavor Design Prompt
Design Nova compute configuration and flavors for vGPU workloads using mediated (mdev) devices, mapping mdev types to Placement resource providers without stranding GPU capacity.
- Claude
- ChatGPT
Open prompt - Intermediate
Swift Large Object SLO/DLO Strategy Design Prompt
Design a Swift large-object strategy choosing between Static Large Objects (SLO) and Dynamic Large Objects (DLO), segment sizing, and manifest handling for multi-gigabyte uploads.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder Orphaned Attachment Cleanup Prompt
Find and safely clean up orphaned or duplicate Cinder volume attachments left behind by failed deletes, evacuations, or migrations, where attachment records no longer map to live instances.
- Claude
- ChatGPT
Open prompt - Intermediate
Glance Image Stuck in saving or killed Status Recovery Prompt
Recover Glance images wedged in saving, importing, queued, or killed status after a failed upload or import, reconciling DB status with backend store data.
- Claude
- ChatGPT
Open prompt - Intermediate
Keystone Token Validation Latency Debug Prompt
Diagnose slow API calls cloud-wide caused by Keystone token validation latency, covering Fernet overhead, catalog size, caching misses, and auth_token middleware behavior.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron OVN northd Sync Lag Debug Prompt
Diagnose why logical resources in the OVN Northbound DB are not propagating to the Southbound DB or chassis, causing ports that never go ACTIVE or traffic that never programs.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Block Device Mapping and Volume Attach Stuck Recovery Prompt
Recover instances stuck attaching or detaching a Cinder volume, where Nova's block_device_mapping, the Cinder attachment, and the hypervisor's view of the disk have diverged.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Resize Stuck in VERIFY_RESIZE Recovery Prompt
Recover instances stuck in VERIFY_RESIZE, RESIZE_MIGRATING, or RESIZE_PREP after a failed or abandoned resize/cold-migration, restoring clean ACTIVE or original state.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack AMQP TLS Certificate Rotation Runbook Prompt
Plan and execute rotation of RabbitMQ AMQP TLS certificates across all OpenStack services without dropping RPC connectivity or stranding controllers, computes, and agents.
- Claude
- ChatGPT
Open prompt - Advanced
Placement Allocation-Ratio and Inventory Mismatch Debug Prompt
Resolve cases where Placement inventory, allocation ratios, or reserved values disagree with nova.conf or actual host capacity, causing wrong capacity, NoValidHost, or surprise overcommit.
- Claude
- ChatGPT
Open prompt - Intermediate
RabbitMQ oslo.messaging RPC Timeout Debug Prompt
Triage MessagingTimeout and lost-reply errors between OpenStack services, distinguishing slow workers, broken reply queues, broker overload, and oslo.messaging misconfiguration.
- Claude
- ChatGPT
Open prompt - Intermediate
Cinder Volume QoS & Front-End Throttling Design Prompt
Helps you design Cinder QoS specs and volume-type bindings to cap IOPS/throughput per volume and prevent a single tenant from saturating shared Ceph or LVM backends.
- Claude
- ChatGPT
Open prompt - Advanced
Glance Web-Download & URI Import Hardening Prompt
Helps you safely enable and lock down Glance's web-download / URI image import method so users can pull images from URLs without exposing internal networks to SSRF.
- Claude
- ChatGPT
Open prompt - Advanced
Keystone Hierarchical Project & Nested Quota Design Prompt
Helps you design a Keystone project hierarchy with nested quotas so business units self-manage sub-projects without exceeding a parent allocation.
- Claude
- ChatGPT
Open prompt - Intermediate
Kolla-Ansible Config Override & Reconfigure Runbook Prompt
Helps you apply custom service config via Kolla-Ansible's /etc/kolla/config overrides and run a targeted reconfigure without redeploying the whole control plane.
- Claude
- ChatGPT
Open prompt - Intermediate
Neutron Floating IP Port Forwarding (DNAT) Design Prompt
Helps you design and troubleshoot Neutron floating IP port forwarding so multiple instances share a single public IP across distinct ports under OVN or L3-agent.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Allocation Ratio & Overcommit Tuning Prompt
Helps you safely set per-host and per-aggregate CPU/RAM/disk allocation ratios in Nova so you maximize density without triggering OOM kills or noisy-neighbor problems.
- Claude
- ChatGPT
Open prompt - Intermediate
Nova noVNC & Console Proxy Troubleshooting Prompt
Helps you diagnose broken instance consoles where noVNC/SPICE sessions fail to connect, time out, or show a black screen across the proxy chain.
- Claude
- ChatGPT
Open prompt - Advanced
Octavia Amphora Failover & Stuck Provisioning Recovery Prompt
Helps you recover Octavia load balancers stuck in PENDING_UPDATE/ERROR or with dead amphorae, driving a controlled failover without dropping VIP traffic.
- Claude
- ChatGPT
Open prompt - Advanced
Placement Resource Provider Inventory Reshape Debug Prompt
Helps you diagnose and repair Placement reshape failures where allocations get stranded between root and nested resource providers (NUMA, PCI, VGPU) after a Nova upgrade.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder Volume Replication & DR Failover Design Prompt
Design Cinder cheesecake-style volume replication and host failover/failback so block storage survives a backend or site outage with a tested, ordered recovery runbook.
- Claude
- ChatGPT
Open prompt - Intermediate
Glance Tenant Image Sharing & Quota Design Prompt
Design Glance image visibility, member-based sharing, and per-project quotas so teams share golden images safely without exposing private images or blowing past store capacity.
- Claude
- ChatGPT
Open prompt - Advanced
Heat Software Deployment & Config Agent Debug Prompt
Diagnose stuck Heat SoftwareDeployment resources — os-collect-config, os-refresh-config, and os-apply-config agents, signaling, and the in-instance config hooks that silently hang stack creation.
- Claude
- ChatGPT
Open prompt - Intermediate
Ironic Node Cleaning & Lifecycle Debug Prompt
Diagnose Ironic bare-metal nodes stuck in cleaning, clean-wait, or maintenance — covering the state machine, clean steps, ramdisk logs, and conductor takeover so nodes return to available.
- Claude
- ChatGPT
Open prompt - Advanced
Keystone Multi-Region & Domain-Scoped RBAC Design Prompt
Design Keystone identity for multi-region OpenStack — domains, projects, scoped roles, and the new secure RBAC (reader/member/admin + system scope) so tenancy stays isolated and least-privilege.
- Claude
- ChatGPT
Open prompt - Intermediate
Neutron DHCP Agent & Address Scope Debug Prompt
Diagnose why instances fail to lease IPs, get wrong DNS, or cannot route between subnets — covering Neutron DHCP agents, dnsmasq, address scopes, and subnet pools.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Server Group & Anti-Affinity Scheduling Design Prompt
Design Nova server groups with affinity, anti-affinity, and soft policies so critical workloads spread across hosts, fault domains, and racks without breaking scheduling at scale.
- Claude
- ChatGPT
Open prompt - Advanced
Octavia UDP & SCTP Listener Health Design Prompt
Design Octavia UDP and SCTP load balancers with correct health monitoring, session persistence, and amphora sizing for DNS, VoIP, gaming, and telco signaling workloads.
- Claude
- ChatGPT
Open prompt - Advanced
Swift Account Reaper & Replication Lag Debug Prompt
Diagnose Swift consistency problems — lagging replication, stuck account-reaper deletions, and dispersion gaps — so deleted accounts actually free space and object durability stays intact.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder Generic Volume Group & Consistency Snapshot Design Prompt
Design Cinder generic volume groups and crash-consistent group snapshots so multi-volume applications (databases, clustered apps) can be snapshotted and restored as one atomic unit.
- Claude
- ChatGPT
Open prompt - Beginner
Glance Interoperable Image Import Debug Prompt
Troubleshoot Glance's interoperable image import (web-download, glance-direct, copy-image) when uploads stall in 'importing', plugins fail, or multi-store copies never complete.
- Claude
- ChatGPT
Open prompt - Advanced
Heat Autoscaling Group & Aodh Alarm Design Prompt
Design a Heat OS::Heat::AutoScalingGroup with scale-up/down policies driven by Aodh/Ceilometer alarms, including cooldowns, signaling, and safe stack updates.
- Claude
- ChatGPT
Open prompt - Advanced
Keystone OAuth2.0 & mTLS Client Auth Design Prompt
Design certificate-bound OAuth2.0 client-credentials and mutual-TLS authentication for Keystone so machine clients get scoped tokens without long-lived passwords.
- Claude
- ChatGPT
Open prompt - Intermediate
Neutron Metadata & Config-Drive Debug Prompt
Diagnose why instances fail to fetch metadata (no SSH key, cloud-init hangs at 169.254.169.254) across isolated networks, DVR, and config-drive fallback.
- Claude
- ChatGPT
Open prompt - Intermediate
Nova Flavor Extra-Specs & Aggregate Scheduling Design Prompt
Design a clean flavor catalog with extra-specs and host-aggregate metadata so workloads land on the right hardware (GPU, high-mem, fast-NVMe) via the AggregateInstanceExtraSpecsFilter.
- Claude
- ChatGPT
Open prompt - Intermediate
Octavia Health Monitor & Connection Draining Tuning Prompt
Tune Octavia health monitors, timeouts, and member draining so load balancers fail over fast without flapping or dropping in-flight connections during deploys.
- Claude
- ChatGPT
Open prompt - Advanced
Swift Container Sharding Design Prompt
Plan and execute Swift container database sharding for large accounts — identify hot containers, size shard ranges, run swift-manage-shard-ranges safely, and verify replication without downtime.
- Claude
- ChatGPT
Open prompt - Intermediate
Cinder Volume Migration & Retype Runbook Prompt
Plan safe Cinder volume migrations and retypes — moving volumes between backends or volume types with the right migration policy and minimal attached-instance disruption.
- Claude
- ChatGPT
Open prompt - Intermediate
Glance & Cinder Image-Volume Conversion Prompt
Design boot-from-volume and image↔volume conversion flows across Glance and Cinder — enabling the image-volume cache, format conversion, and fast Ceph clone-backed boots.
- Claude
- ChatGPT
Open prompt - Advanced
Keystone Fernet Key Rotation Rollout Prompt
Design a safe Fernet (and Credential) key rotation schedule and distribution mechanism across multi-node, multi-region Keystone so tokens never silently invalidate.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron BGP Dynamic Routing Design Prompt
Design Neutron dynamic routing with neutron-dynamic-routing/BGP (or OVN BGP agent) to advertise tenant and floating-IP prefixes upstream and retire static routes on the network fabric.
- Claude
- ChatGPT
Open prompt - Intermediate
Nova Host Evacuation & Maintenance Runbook Prompt
Build a safe runbook to drain, patch, and return a Nova compute host to service — choosing between live-migration, cold migration, and evacuate for dead hosts without losing instances.
- Claude
- ChatGPT
Open prompt - Intermediate
Octavia Flavor & Active-Standby Topology Design Prompt
Design Octavia load-balancer flavors and amphora topology — choosing SINGLE vs ACTIVE_STANDBY, compute/network flavors, and VRRP tuning for the right availability and cost per tier.
- Claude
- ChatGPT
Open prompt - Advanced
Placement Aggregates & Traits Allocation Design Prompt
Design Placement resource-provider aggregates, traits, and required/forbidden trait flavor extra-specs so the Nova scheduler lands workloads on the right hardware tiers.
- Claude
- ChatGPT
Open prompt - Advanced
Swift Erasure Coding Storage Policy Design Prompt
Design Swift erasure-coding storage policies — picking EC scheme, fragment/parity counts, and region layout to cut raw-capacity cost while keeping durability and read latency acceptable.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack Watcher Cluster Optimization Design Prompt
Design and tune OpenStack Watcher audits and action plans to consolidate workloads, rebalance noisy neighbors, and cut power draw without breaking affinity rules or SLAs.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder Volume Encryption & Barbican Key Design Prompt
Design and troubleshoot LUKS-based Cinder volume encryption backed by Barbican — encrypted volume types, key lifecycle, performance impact, and recovery when keys go missing.
- Claude
- ChatGPT
Open prompt - Advanced
Glance Image Signing & Trusted Boot Prompt
Enable and troubleshoot Glance image signature verification with Barbican-stored keys so Nova only boots cryptographically trusted images, plus a signing pipeline for golden images.
- Claude
- ChatGPT
Open prompt - Beginner
Horizon Dashboard Troubleshooting Prompt
Diagnose OpenStack Horizon failures — login loops, blank panels, session/CSRF errors, slow page loads, and missing services — by tracing the Django app, Keystone, and API endpoint chain.
- Claude
- ChatGPT
Open prompt - Intermediate
Keystone Application Credentials & Trusts Design Prompt
Design secure non-human authentication in Keystone using application credentials and trusts — scoping, access rules, expiry, and migrating CI/automation off long-lived passwords.
- Claude
- ChatGPT
Open prompt - Intermediate
Neutron Security Groups & FWaaS v2 Design Prompt
Design tenant-facing security groups and FWaaS v2 firewall policies in Neutron — rule hygiene, stateful vs stateless, OVS conntrack behavior, and debugging silently dropped traffic.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron SR-IOV & Trunk Port Design Prompt
Design Neutron SR-IOV passthrough and trunk-port networking for high-throughput VNFs — PF/VF allocation, sriov-agent config, VLAN-aware VMs, and debugging ports stuck in BUILD.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Cells v2 Scaling & Architecture Prompt
Design and operate Nova Cells v2 for large OpenStack clouds — cell sizing, message-queue and DB partitioning, scheduler/conductor placement, and debugging cross-cell instance issues.
- Claude
- ChatGPT
Open prompt - Advanced
Nova NUMA, CPU Pinning & Huge-Page Flavor Design Prompt
Design Nova flavors and compute configuration for NFV/high-performance workloads — CPU pinning, NUMA topology, huge pages, and emulator-thread isolation — without stranding capacity.
- Claude
- ChatGPT
Open prompt - Intermediate
Octavia TLS Termination & SNI Listener Design Prompt
Design Octavia load-balancer listeners for TLS termination and SNI — Barbican-backed certificates, cipher policy, HTTP/2, and debugging handshake failures across multiple hostnames.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder Over-Subscription & Thin Provisioning Design Prompt
Tune Cinder thin provisioning and over-subscription ratios safely — capacity reporting, max_over_subscription_ratio, reserved space, and scheduler capacity filters — so you maximize density without risking backend full-disk events that freeze every volume.
- Claude
- ChatGPT
Open prompt - Intermediate
Glance Multi-Store & Image Cache Design Prompt
Architect Glance multi-store backends and the image cache — Ceph RBD vs file vs Swift stores, store priorities, copy-image, and per-compute caching — so instance boots are fast and image storage is placed cost-effectively.
- Claude
- ChatGPT
Open prompt - Intermediate
Heat Template (HOT) Authoring Best Practices Prompt
Author maintainable, idempotent Heat Orchestration Templates — parameters, nested stacks, conditions, software deployments, and update policies — that deploy cleanly and update in place without destroying live resources.
- Claude
- ChatGPT
Open prompt - Advanced
Kolla-Ansible Rolling Upgrade Orchestration Prompt
Plan and execute a zero-downtime Kolla-Ansible release-to-release upgrade — image bumps, DB schema sync, control-plane rolling restarts, and per-service upgrade ordering — with verified rollback points at each phase.
- Claude
- ChatGPT
Open prompt - Advanced
ML2/OVS to OVN Migration Planning Prompt
Plan a phased migration from Neutron ML2/OVS (with L3/DHCP agents) to OVN — control-plane mapping, agent-to-OVN-controller equivalence, data-plane cutover, and feature-parity gaps — with a tested rollback at each step.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron L3 Agent & DVR Routing Design Prompt
Design and validate a Neutron L3 routing topology — centralized vs distributed (DVR) vs HA routers, L3 agent placement, network-node bottlenecks, and SNAT/north-south traffic paths — for predictable east-west and external connectivity.
- Claude
- ChatGPT
Open prompt - Intermediate
Nova Compute Service Down Recovery Prompt
Diagnose and recover nova-compute hosts that show as down in the service list — agent crashes, RabbitMQ heartbeat loss, clock skew, hypervisor lockups — without stranding instances or triggering false-positive evacuations.
- Claude
- ChatGPT
Open prompt - Intermediate
Octavia Amphora Image Build & Rotation Prompt
Build, test, and rotate Octavia Amphora images safely — diskimage-builder pipelines, image tagging, certificate rotation, and rolling amphora replacement — so load balancers stay current and secure without dropping tenant traffic.
- Claude
- ChatGPT
Open prompt - Advanced
RabbitMQ Performance Tuning for OpenStack Prompt
Tune RabbitMQ for an OpenStack control plane — queue/HA policies, connection and channel limits, heartbeats, prefetch, memory/flow-control watermarks, and durable vs transient reply queues — so RPC stays fast and the broker never wedges under load.
- Claude
- ChatGPT
Open prompt - Intermediate
Cinder Multi-Backend & Volume-Type Design Prompt
Design Cinder multi-backend storage with sane volume types, extra-specs, QoS, and scheduler hints — so volumes land on the right backend (Ceph, LVM, NetApp) and the cinder-scheduler stops sending them to the wrong place.
- Claude
- ChatGPT
Open prompt - Intermediate
Keystone Service Catalog & Endpoint Debug Prompt
Fix broken service-catalog and endpoint problems — clients hitting internal URLs from outside, wrong regions, missing endpoints, SSL/hostname mismatches, and 'public endpoint not found' errors after a deploy or TLS change.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron OVS Dataplane & Flow Trace Prompt
Trace a packet through the Open vSwitch dataplane — br-int, br-tun, br-ex, VLAN/VXLAN tagging, and OpenFlow tables — to find where tenant traffic is silently dropped in an ML2/OVS deployment.
- Claude
- ChatGPT
Open prompt - Intermediate
Nova Instance Stuck-State Recovery Prompt
Recover instances stuck in ERROR, BUILD, REBOOT, DELETING, or task_state limbo — reconcile the Nova DB state with the actual libvirt domain, and reset state safely without orphaning resources.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Placement & Resource Provider Debug Prompt
Diagnose why the Placement service reports the wrong capacity — phantom allocations, stale resource providers, inventory/allocation-ratio mismatches, and 'No valid host' failures rooted in Placement rather than the Nova scheduler filters.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack-Ansible Deployment Debug Prompt
Debug failing OpenStack-Ansible (OSA) playbook runs — LXC container issues, inventory/group_vars problems, repo-build and venv failures, and idempotency breaks — to get a clean converge on deploy or upgrade.
- Claude
- ChatGPT
Open prompt - Intermediate
OpenStack Floating IP & SNAT Debug Prompt
Diagnose broken north-south connectivity — floating IPs that don't reach instances, missing SNAT for outbound traffic, and router namespace problems across centralized L3 and DVR deployments.
- Claude
- ChatGPT
Open prompt - Intermediate
OpenStack Quota Management & Reconciliation Prompt
Design and audit project/domain quotas across Nova, Neutron, Cinder, and Placement — fix drifted usage counters, tier quotas sensibly, and stop 'Quota exceeded' errors that don't match real consumption.
- Claude
- ChatGPT
Open prompt - Advanced
Keystone Federation (SAML/OIDC) Debug Prompt
Diagnose Keystone federation — SAML IdP setup, OIDC integration, mapping rules, group mapping, federated user provisioning.
- Claude
- ChatGPT
Open prompt - Intermediate
OpenStack Security Hardening Prompt
Harden OpenStack — service-to-service TLS, secure RBAC, secure defaults audit, CVE response, audit logging.
- Claude
- ChatGPT
Open prompt - Advanced
TripleO / Kolla-Ansible Deployment Debug Prompt
Diagnose OpenStack deployment failures with TripleO (overcloud) or Kolla-Ansible (containerized) — Heat stack failures, container crashes, configuration issues.
- Claude
- ChatGPT
Open prompt - Intermediate
Cinder Backup & Restore Workflow Prompt
Design and operate Cinder backup workflows — backend choice (Ceph/Swift/NFS), incremental backups, cross-AZ restore, retention.
- Claude
- ChatGPT
Open prompt - Advanced
Ceph + OpenStack Integration Tuning Prompt
Tune Ceph as storage backend for OpenStack — Glance, Cinder, Nova ephemeral pools; performance tuning, capacity planning, snapshot/clone semantics.
- Claude
- ChatGPT
Open prompt - Advanced
Masakari Instance HA Debug Prompt
Diagnose Masakari instance HA — host monitor not triggering, instance evacuation not happening, process monitor issues, reserved host strategy.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Live Migration Troubleshooting Prompt
Diagnose Nova live migration failures — shared storage requirements, block migration, network bandwidth, CPU compatibility, error 'migration aborted'.
- Claude
- ChatGPT
Open prompt - Intermediate
Gnocchi & Ceilometer Metering Debug Prompt
Diagnose OpenStack telemetry — Ceilometer pollsters, Gnocchi storage, metric aggregation, missing measurements, retention policy.
- Claude
- ChatGPT
Open prompt - Advanced
Galera/MariaDB Recovery for OpenStack Prompt
Recover Galera/MariaDB cluster used by OpenStack — split-brain, single-node bootstrap, WSREP issues, schema changes on busy cluster.
- Claude
- ChatGPT
Open prompt - Intermediate
Heat Stack Failure Diagnosis Prompt
Diagnose Heat orchestration stack create/update/delete failures — template errors, dependency cycles, partial rollback states, resource-level errors.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack Capacity Planning Prompt
Plan OpenStack capacity — CPU/RAM/disk oversubscription, growth modeling, hypervisor sizing, Cinder backend planning, network bandwidth.
- Claude
- ChatGPT
Open prompt - Intermediate
Neutron QoS & Rate Limiting Design Prompt
Design Neutron QoS policies — bandwidth limits, DSCP marking, minimum guarantee, OVS vs OVN driver differences.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack Request-ID Log Trace Prompt
Correlate a single API request across services (nova-api → conductor → scheduler → compute → neutron → cinder) using OpenStack request IDs.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack Upgrade Pre-Flight Review Prompt
Pre-upgrade safety review of an OpenStack cluster moving release N → N+1 — config drift, deprecated options, DB migrations, breaking changes, service ordering.
- Claude
- ChatGPT
Open prompt - Advanced
OVN Control Plane Deep Dive Prompt
Debug OVN control plane — Northbound/Southbound databases, ovn-northd, ovn-controller, logical flows, raft cluster health.
- Claude
- ChatGPT
Open prompt - Intermediate
Barbican Secret Store Management Prompt
Manage Barbican secrets — secret/container/order model, HSM backend, key rotation, ACLs, Octavia integration.
- Claude
- ChatGPT
Open prompt - Advanced
Nova Scheduler Filter Analysis Prompt
Diagnose why VMs aren't landing on hosts — review scheduler filters, weighers, host aggregates, placement allocations, and capacity.
- Claude
- ChatGPT
Open prompt - Advanced
Ironic Bare Metal Deployment Debug Prompt
Diagnose Ironic deploys — node stuck in cleaning, deploy fails, PXE/iPXE issues, BMC unreachable, RAID config not applied.
- Claude
- ChatGPT
Open prompt - Intermediate
Keystone Token & Policy Audit Prompt
Audit Keystone configuration, role assignments, and policy.yaml for auth failures, token expiration issues, and excessive privilege.
- Claude
- ChatGPT
Open prompt - Advanced
Cinder Volume Troubleshooting Prompt
Diagnose stuck volumes, failed attachments, and backend issues (Ceph/LVM/iSCSI/NFS) in OpenStack Cinder using CLI output and service logs.
- Claude
- ChatGPT
Open prompt - Advanced
Swift Object Storage Ring Management Prompt
Manage Swift rings — add/remove nodes, rebalance, replication health, partition power, dispersion.
- Claude
- ChatGPT
Open prompt - Intermediate
Manila Shared Filesystem Troubleshooting Prompt
Diagnose Manila share issues — share stuck creating, access rules not applying, mount fails, backend (CephFS / NFS / GlusterFS) errors.
- Claude
- ChatGPT
Open prompt - Advanced
Neutron Networking Debug Prompt
Diagnose Neutron networking failures — unreachable VMs, broken security groups, missing floating IPs, OVS/OVN flow issues — from CLI output and agent logs.
- Claude
- ChatGPT
Open prompt - Advanced
Magnum Kubernetes Cluster Debug Prompt
Diagnose Magnum K8s cluster creation/scale failures — cluster template, COE driver, Heat stack interaction, node not joining, certificate issues.
- Claude
- ChatGPT
Open prompt - Intermediate
Designate DNS-as-a-Service Troubleshooting Prompt
Diagnose Designate issues — zones not propagating, BIND/PowerDNS backend issues, pool config, record updates not visible.
- Claude
- ChatGPT
Open prompt - Advanced
Octavia Load Balancer Troubleshooting Prompt
Diagnose Octavia issues — amphora boot failures, listener/pool/health-monitor misconfig, certificate problems, failover, statistics.
- Claude
- ChatGPT
Open prompt - Intermediate
Glance Image Lifecycle Management Prompt
Manage Glance images — store backends, image signing, format conversion, image cache, multi-store, deletion-protection.
- Claude
- ChatGPT
Open prompt - Advanced
OpenStack VM Troubleshooting Prompt
Diagnose Nova VM boot failures, networking issues, and stuck instances using nova/openstack CLI output.
- Claude
- ChatGPT
Open prompt
Guides
- · 10 min read
OpenStack Error Guide: 'ImageUnacceptable' Cinder volume-from-image failure
Cinder rejecting a volume create with ImageUnacceptable? Diagnose image size vs volume size, format, and virtual-size mismatches for boot-from-volume step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Floating IP pool not found' external network failure
Allocating a floating IP and hitting Floating IP pool not found or ExternalNetworkNotReachable? Diagnose missing external networks and bad pool names step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'InstanceNotFound' nova-compute manager failure
Nova logging InstanceNotFound during periodic tasks or deletes? Diagnose orphaned database rows, stale local instances, and compute-manager drift step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Node stuck in clean failed' Ironic provisioning failure
Ironic bare-metal node stuck in clean failed or clean wait? Diagnose failed cleaning steps, ramdisk boot issues, and maintenance recovery step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Volume group not found' Cinder LVM backend failure
Cinder LVM driver failing with Volume group cinder-volumes not found? Diagnose missing VG, lost physical volumes, and backend recovery step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Placement 409 Conflict' allocation update failure
Nova logging Placement 409 Conflict on inventory or allocation updates? Diagnose generation conflicts, stale resource providers, and concurrent writes step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Resource CREATE failed: ResourceInError' Heat stack failure
Heat stack failing with Resource CREATE failed: ResourceInError? Diagnose underlying Nova/Cinder errors, status reason chains, and stack recovery step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'neutron-l3-agent router namespace missing' connectivity loss
L3 router namespace qrouter-<id> missing and tenants lost external connectivity? Diagnose dead l3-agent, OVS bridge gaps, and namespace recreation step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Could not determine a suitable URL for the plugin' endpoint failure
Keystone client raising Could not determine a suitable URL for the plugin? Diagnose missing endpoints, wrong interface, and bad catalog entries step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Swift 503 unable to connect to memcached' proxy failure
Swift proxy returning 503 Service Unavailable with unable to connect to memcached? Diagnose dead memcached, wrong proxy config, and token-cache loss step by step.
Read guide - · 9 min read
AI Ops in OpenStack Management: A 2026 Practical Guide
Discover the role of AI Ops in OpenStack management. Learn how to integrate AIOps for better automation and faster incident response in 2026.
Read guide - · 10 min read
OpenStack Error Guide: 'Application credentials cannot request a scope' Keystone Auth Failure
Fix the Keystone 'Application credentials cannot request a scope' error: strip OS_PROJECT scope vars, fix clouds.yaml v3applicationcredential auth, recreate app creds.
Read guide - · 10 min read
OpenStack Error Guide: 'Build of instance aborted' Nova Spawn Failure
Fix the Nova 'Build of instance aborted' error: distinguish aborted vs rescheduled failures and trace network, volume/BDM, and image causes in compute logs.
Read guide - · 10 min read
OpenStack Error Guide: Cinder 'No weighed backends available' Scheduler Failure
Fix Cinder 'No valid host was found. No weighed backends available': revive dead cinder-volume, pass CapacityFilter, and align volume_backend_name.
Read guide - · 10 min read
OpenStack Error Guide: 'Exceeded maximum number of retries' Nova Scheduler Exhaustion
Fix Nova 'Exceeded maximum number of retries' build failures: tune max_attempts, read RetryFilter behavior, and find the real per-host failure behind exhaustion.
Read guide - · 10 min read
OpenStack Error Guide: 'Failed to allocate the network(s)' Nova/Neutron Setup Failure
Fix the Nova 'Failed to allocate the network(s), not rescheduling' error: trace vif plugging timeouts, dead L2 agents, ports stuck DOWN, and Neutron outages.
Read guide - · 10 min read
OpenStack Error Guide: 'Filtering removed all hosts' Nova Filter Chain
Fix Nova 'Filtering removed all hosts' / 'Filter returned 0 hosts': read scheduler debug logs, find which filter zeroed the list, and inspect per-filter counts.
Read guide - · 10 min read
OpenStack Error Guide: nova-compute Service State 'down' / Hypervisor Down
Fix the nova-compute 'down' hypervisor state in OpenStack: diagnose dead services, RabbitMQ drops, clock skew, stale placement, and force-down for evacuation.
Read guide - · 10 min read
OpenStack Error Guide: Nova/Glance 'Failed to download image' on Compute
Fix Nova 'Failed to download image' / Glance store NotFound when nova-compute fetches an image: reachability, Ceph RBD auth, image status, and cache space.
Read guide - · 10 min read
OpenStack Error Guide: 'Something went wrong!' Horizon HTTP 500 Internal Server Error
Fix the Horizon 'Something went wrong!' HTTP 500 error: read Apache logs, repair SECRET_KEY, memcached sessions, static assets, ALLOWED_HOSTS, and keystone endpoints.
Read guide - · 10 min read
OpenStack Error Guide: Nova 'Live Migration failure' / Migration Failed
Fix Nova live migration failures in OpenStack: resolve CPU model mismatches, missing shared storage, firewall-blocked libvirt ports, timeouts, and NUMA pinning.
Read guide - · 10 min read
OpenStack Error Guide: 'No allocation candidates returned' Placement API
Fix the Placement 'No allocation candidates returned' error: reconcile inventory vs usage, traits, host/placement aggregate mismatch, and heal stale allocations.
Read guide - · 10 min read
OpenStack Error Guide: 'No valid host was found' Nova Scheduling Failure
Fix the Nova 'No valid host was found' error: diagnose scheduler filters, placement inventory, allocation ratios, anti-affinity groups, and flavor extra_specs.
Read guide - · 10 min read
OpenStack Error Guide: 'Missed heartbeats from client, timeout: 60 seconds' RabbitMQ Connection Churn
Fix oslo.messaging 'Missed heartbeats from client' and 'AMQP server is unreachable' errors in OpenStack: tune heartbeats, eventlet, firewalls, and RabbitMQ HA.
Read guide - · 10 min read
OpenStack Error Guide: Placement 'Resource provider with uuid not found' / 404
Fix the Placement 'resource provider not found' 404 in OpenStack: repair stale RPs after host renames, mismatched hostnames, orphaned allocations, and re-added nodes.
Read guide - · 10 min read
OpenStack Error Guide: Cinder Volume in 'error' / 'error_deleting' Status
Recover Cinder volumes stuck in error, error_deleting, or error_extending: read cinder-volume logs, fix backend driver faults, and reset-state safely.
Read guide - · 12 min read
Building Production-Ready Magnum Cluster Templates in OpenStack
How to design Magnum cluster templates, node groups, and autoscaler config for production Kubernetes, with AI helping you plan update-safe rolling upgrades.
Read guide - · 11 min read
Designing Heat Nested Stacks and ResourceGroups in OpenStack
How to structure Heat templates with nested stacks and ResourceGroups so updates and scale-downs don't replace the wrong resources, with AI predicting the blast radius.
Read guide - · 11 min read
Hardening Glance Image Import With AI in OpenStack
How to lock down Glance interoperable image import, web-download, and conversion so tenants can't smuggle bad images in, with AI helping you audit the policy.
Read guide - · 11 min read
Integrating Cinder With NetApp and NFS Backends in OpenStack
How to configure Cinder NFS and NetApp ONTAP backends, avoid stale-export and mount-option pitfalls, and use AI to diagnose attach failures that look like Nova bugs.
Read guide - · 11 min read
Migrating Keystone From Fernet to JWS Tokens in OpenStack
A node-by-node runbook for cutting Keystone over from Fernet to JWS tokens without dropping live sessions, with AI used to validate every step before you act.
Read guide - · 11 min read
Nova PCI Passthrough and SR-IOV With AI in OpenStack
How to wire up Nova PCI passthrough and SR-IOV device_spec, flavors, and Placement inventory, using AI to cross-check config against the actual hardware.
Read guide - · 10 min read
OpenStack Error Guide: 'The request you have made requires authentication' (HTTP 401)
Fix OpenStack Keystone HTTP 401 'requires authentication' errors: expired or invalid tokens, wrong credentials, clock skew, bad auth_url, and Fernet key rotation.
Read guide - · 10 min read
OpenStack Error Guide: 'Stack CREATE_FAILED' / 'Resource CREATE failed' (Heat orchestration)
Debug Heat 'Stack CREATE_FAILED' and 'Resource CREATE failed' errors: nested Nova/Neutron/Cinder failures, template and parameter mistakes, quotas, timeouts, and dependencies.
Read guide - · 9 min read
OpenStack Error Guide: 'Image stuck in saving' or 'killed' when uploading to Glance
Glance images stuck in saving or flipping to killed, and uploads that fail? Diagnose store backend capacity, permissions, glance-api workers, checksums, and quota step by step.
Read guide - · 11 min read
OpenStack Error Guide: 'Instance failed to spawn' Nova Stuck in BUILD/spawning
Fix Nova 'Instance failed to spawn' and instances stuck in BUILD/spawning: diagnose libvirt/qemu errors, disk space, VIF plug timeouts, SELinux, and CPU flags.
Read guide - · 11 min read
OpenStack Error Guide: 'MessagingTimeout' oslo.messaging / RabbitMQ Unreachable
Fix oslo.messaging MessagingTimeout and 'AMQP server closed connection' errors in OpenStack: diagnose RabbitMQ down, partitions, firewall to 5672, creds, and queue buildup.
Read guide - · 9 min read
OpenStack Error Guide: 'No more IP addresses available on network' (Neutron IP exhaustion)
Resolve the Neutron 'No more IP addresses available' error: exhausted allocation pools, leaked ports, oversized reservations, small CIDRs, and orphaned VM ports.
Read guide - · 10 min read
OpenStack Error Guide: 'binding_failed' Neutron Port Binding Failed
Fix the Neutron PortBindingFailed / binding_failed error in OpenStack: diagnose ML2 mechanism drivers, dead L2 agents, physnet bridge mappings, and MTU.
Read guide - · 9 min read
OpenStack Error Guide: 'Quota exceeded' for cores, RAM, instances, ports, and floating IPs
Hitting Quota exceeded in Nova or Neutron for cores, RAM, instances, ports, or floating IPs? Diagnose quota limits, leaked resources, usage drift, and reconcile drift step by step.
Read guide - · 10 min read
OpenStack Error Guide: 'Volume stuck in creating' and 'failed to attach volume' in Cinder
Cinder volumes stuck in creating or error state, or failing to attach? Diagnose cinder-volume, backend connectivity, scheduler, and iSCSI/multipath root causes step by step.
Read guide - · 11 min read
Scaling the Swift Proxy Tier With Memcache and AI
How to size Swift proxy nodes, tune memcache and ratelimit, and use AI to attribute 503 storms to the right layer instead of just loosening limits.
Read guide - · 11 min read
Tuning Galera Flow Control for OpenStack Databases
How to read Galera flow-control pauses, size the cluster for OpenStack's write patterns, and use AI to diagnose replication stalls before they freeze the API.
Read guide - · 11 min read
Tuning OVN Gateway Chassis and BFD for L3 Failover in OpenStack
How to size OVN gateway chassis, tune BFD timers, and use AI to verify L3 failover behavior so a leaf failure doesn't blackhole your floating IPs.
Read guide - · 11 min read
Automating OpenStack Workflows with Mistral and AI
Mistral turns multi-step OpenStack operations into versioned, retryable workflows. Here is how I author, debug, and run them — with an AI pairing as my fast junior engineer.
Read guide - · 9 min read
Backup-as-a-Service with OpenStack Freezer and AI
Freezer brings scheduled, multi-tenant backup and restore to OpenStack. Here is how I configure jobs, run restores, and use AI to draft the parts I dare not get wrong.
Read guide - · 10 min read
OpenStack Chargeback and Rating with CloudKitty and AI
CloudKitty turns OpenStack usage into invoices and showback reports. Here is how I configure rating rules, debug missing data, and let AI draft the tricky parts.
Read guide - · 9 min read
Customizing and Debugging OpenStack Horizon with AI
Horizon is the dashboard your users actually see. Here is how I customize it, debug the blank-page failures, and use AI to navigate its Django internals safely.
Read guide - · 9 min read
Deploying the Skyline Dashboard for OpenStack with AI
Skyline is OpenStack's modern, faster alternative to Horizon. Here is how I deploy it, wire it to Keystone, debug the gateway, and let AI handle the config grind.
Read guide - · 10 min read
Managing GPUs and Accelerators with OpenStack Cyborg
Cyborg gives OpenStack a way to manage GPUs, FPGAs, and other accelerators. Here is how I configure device profiles, attach them to instances, and debug with AI help.
Read guide - · 10 min read
Monitoring-as-a-Service with OpenStack Monasca and AI
Monasca delivers scalable, multi-tenant monitoring for OpenStack. Here is how I push metrics, build alarm definitions, and let AI draft expressions without breaking prod.
Read guide - · 11 min read
Root Cause Analysis with OpenStack Vitrage and AI
Vitrage correlates alarms into root causes across your OpenStack cloud. Here is how I configure templates, read the entity graph, and use AI to cut through alarm storms.
Read guide - · 10 min read
Validating OpenStack Clouds with Tempest and AI
Tempest is the integration test suite that proves your OpenStack cloud actually works. Here is how I configure it, triage failures, and let AI read the tracebacks for me.
Read guide - · 11 min read
Cinder Volume Backups and Disaster Recovery in OpenStack
Snapshots aren't backups. Here's how to build a real Cinder backup and DR strategy in OpenStack with incremental backups, restores, and AI-assisted runbooks.
Read guide - · 11 min read
Debugging Neutron Floating IPs and NAT in OpenStack
Floating IPs that don't route, DNAT that silently drops, and SNAT egress failures. Here's how to trace OpenStack L3 NAT through routers and namespaces, with AI help.
Read guide - · 20 min read
How to Build a Production-Ready OpenStack Cloud (2026 Guide)
Build a production-ready OpenStack cloud: HA control plane, Kolla-Ansible as code, TLS, networking, storage, backups, monitoring, and a tested upgrade path.
Read guide - · 12 min read
Nova Host Aggregates, NUMA, and CPU Pinning in OpenStack
Performance-sensitive workloads need NUMA awareness and CPU pinning in Nova. Here's how to configure host aggregates, flavors, and pinning, debugged with AI help.
Read guide - · 10 min read
Rate Limiting and Traffic Shaping with Neutron QoS
Neutron QoS policies cap bandwidth, guarantee minimums, and mark DSCP per port. Here's how to apply and debug OpenStack QoS without throttling the wrong tenant, with AI help.
Read guide - · 11 min read
OpenStack Telemetry and Alarming with Ceilometer and Aodh
Ceilometer collects, Gnocchi stores, and Aodh alarms. Here's how to wire OpenStack telemetry end to end and debug alarms that never fire, with AI help.
Read guide - · 11 min read
Orchestrating NFV with OpenStack Tacker and VNFs
Tacker is OpenStack's VNF manager and NFV orchestrator. Here's how to onboard VNF packages, instantiate VNFs, and debug failed deployments with AI assistance.
Read guide - · 10 min read
Resource Reservation with OpenStack Blazar
Blazar adds reservations to OpenStack so users can book hosts and instances ahead of time. Here's how to set up leases, debug allocation failures, and use AI to plan capacity.
Read guide - · 10 min read
Running Containers Directly on OpenStack with Zun
Zun runs containers as first-class OpenStack resources without a Kubernetes layer. Here's how to deploy, network, and debug Zun capsules with AI assistance.
Read guide - · 11 min read
Troubleshooting Swift Object Storage Replication and 503s
Swift looks simple until a ring goes lopsided or replication stalls. Here's how I diagnose 503s, unbalanced rings, and stuck object replication in OpenStack Swift.
Read guide - · 10 min read
AI-Assisted Glance Image and Instance Boot Failure Troubleshooting
Why instances won't boot from a Glance image — disk formats, image properties, virtio drivers, cloud-init — and how AI speeds up triage without your cloud.
Read guide - · 10 min read
AI-Assisted Keystone Token and Policy Debugging in OpenStack
A practical walkthrough of debugging Keystone tokens, scopes, role assignments, and policy.yaml RBAC with AI help — and why the AI never touches your admin token.
Read guide - · 12 min read
AI-Assisted Neutron Security Group and Port Binding Troubleshooting
Tracing binding_failed ports, ML2 agent gaps, and silent security group drops in Neutron, with AI as a fast assistant that never touches production credentials.
Read guide - · 12 min read
Building an AI-Assisted OpenStack On-Call Workflow
A field-tested on-call workflow for OpenStack that uses AI to triage alert storms and draft writeups, while keeping it firmly out of the production control plane.
Read guide - · 11 min read
Using AI to Debug a Nova Scheduler That Won't Place Instances
A seasoned operator's guide to chasing down Nova NoValidHost errors with AI as a co-pilot: scheduler logs, filters, placement candidates, and flavor extra_specs.
Read guide - · 11 min read
Diagnosing RabbitMQ Queue Buildup and Partitions in OpenStack with AI
How I use AI to triage RabbitMQ queue buildup, network partitions, stale reply queues, and oslo.messaging heartbeat timeouts in OpenStack control planes.
Read guide - · 11 min read
Reading OpenStack Placement Resource Inventories with AI
How to use AI to read and cross-tabulate OpenStack Placement resource provider inventories, spot capacity exhaustion, and verify before you ever act on it.
Read guide - · 11 min read
Recovering Stuck Cinder Volumes and Snapshots with AI Help
How a veteran operator unwinds Cinder volumes wedged in creating, deleting, or attaching states using reset-state carefully, with AI assisting safely.
Read guide - · 12 min read
Writing OpenStack Diagnostic Runbooks with AI Prompt Engineering
A practical guide to prompting an LLM to draft OpenStack triage runbooks: structure, CLI check sequences, log redaction, version control, and human review.
Read guide - · 9 min read
Automating OpenStack with the Python SDK and CLI
Clicking through Horizon doesn't scale. Here's how I automate OpenStack with the openstacksdk, the unified CLI, and clouds.yaml for repeatable, idempotent operations.
Read guide - · 8 min read
Autoscaling Clusters with OpenStack Senlin
Senlin manages homogeneous clusters of nodes with policies for scaling, health, and load balancing. Here's how I use it for real autoscaling on OpenStack.
Read guide - · 8 min read
Instance High Availability with OpenStack Masakari
When a compute node dies, Masakari evacuates its VMs automatically instead of paging you. Here's how I run Masakari in production so a dead host self-heals.
Read guide - · 8 min read
Managing Glance Images at Scale in OpenStack
Image sprawl quietly eats storage and slows boots. Here's how I run Glance at scale — backends, image properties, caching, and a cleanup discipline that holds.
Read guide - · 8 min read
Managing Manila Shared Filesystems in OpenStack
Manila gives OpenStack tenants real shared filesystems — NFS and CIFS that survive instance churn. Here's how I run it in production without the share-server sprawl biting me.
Read guide - · 8 min read
Optimizing Resource Usage with OpenStack Watcher
Watcher is OpenStack's optimization engine — it consolidates VMs, balances load, and saves power. Here's how I drive it in production without it live-migrating my cloud into a wall.
Read guide - · 8 min read
Running Database-as-a-Service with OpenStack Trove
Trove gives tenants self-service databases — MySQL, PostgreSQL, more — with backups and replication. Here's how I run it in production without the guest-agent pain.
Read guide - · 9 min read
Scaling Nova with Cells v2 in OpenStack
Cells v2 lets a single Nova deployment scale to thousands of compute nodes by sharding the database and message queue. Here's how I plan and run a multi-cell cloud.
Read guide - · 9 min read
Setting Up Keystone Federation in OpenStack
Federation lets users log into OpenStack with an external IdP — SAML or OIDC — instead of local Keystone accounts. Here's how I set it up and map identities in production.
Read guide - · 8 min read
Debugging Heat Orchestration Stacks in OpenStack
Stacks stuck in CREATE_FAILED, rollback loops, and dependency hell. Here's how to debug OpenStack Heat templates and recover wedged stacks in production.
Read guide - · 8 min read
Debugging Ironic Bare Metal Provisioning in OpenStack
Nodes stuck in cleaning, PXE that won't boot, and IPMI that lies. Here's how to debug OpenStack Ironic bare metal provisioning in production.
Read guide - · 8 min read
Managing Designate DNS-as-a-Service in OpenStack
Zones stuck in PENDING, pool manager confusion, and records that never propagate. Here's how to run OpenStack Designate DNS in production.
Read guide - · 8 min read
Managing Quotas and Capacity Planning in OpenStack
'No valid host was found', quota drift, and the overcommit math nobody checks. Here's how to manage OpenStack quotas and plan capacity before you run out.
Read guide - · 9 min read
Migrating Neutron to OVN Networking in OpenStack
Why OVN replaces the agent sprawl, how the migration actually works, and how to debug the OVN southbound DB when networking breaks in OpenStack.
Read guide - · 9 min read
Running Kubernetes on OpenStack with Magnum
Cluster templates, stuck CREATE_IN_PROGRESS, and the Cloud Provider OpenStack glue. Here's how to run Magnum-managed Kubernetes in production.
Read guide - · 8 min read
Scaling and Debugging Octavia Load Balancers in OpenStack
Amphorae that won't boot, stuck PENDING_CREATE load balancers, and failover storms. Here's how to run Octavia LBaaS in production without losing sleep.
Read guide - · 8 min read
Securing Secrets with Barbican Key Management in OpenStack
TLS certs, LUKS keys, and the HSM plugin. Here's how to run OpenStack Barbican key management safely and debug it when secrets won't decrypt.
Read guide - · 8 min read
Debugging Keystone Identity and Authentication in OpenStack
401s, token expiry, and role mistakes block every other OpenStack service. Here's how to debug Keystone identity, tokens, and RBAC methodically.
Read guide - · 9 min read
Debugging Neutron Networking in OpenStack
Neutron failures hide behind layers of namespaces, OVS bridges, and security groups. Here's a methodical packet-path approach to debugging OpenStack networking.
Read guide - · 9 min read
Deploying OpenStack with Kolla-Ansible: A Practical Guide
Kolla-Ansible packages OpenStack as containers deployed by Ansible. Here's a practical walkthrough of a clean deployment, the config that matters, and where it bites.
Read guide - · 9 min read
Monitoring OpenStack with Prometheus and Grafana
OpenStack has dozens of moving parts and few useful defaults. Here's a practical Prometheus monitoring stack for OpenStack — exporters, key alerts, and SLOs that matter.
Read guide - · 9 min read
Planning OpenStack Upgrades Safely Without Downtime
OpenStack upgrades fail on the boring details: DB migrations, RPC version pinning, and ordering. Here's a battle-tested plan for upgrading without taking the cloud down.
Read guide - · 8 min read
Troubleshooting Cinder Block Storage in OpenStack
Stuck volumes, failed attachments, and phantom 'in-use' states are the daily reality of Cinder. Here's how to diagnose and recover OpenStack block storage safely.
Read guide - · 9 min read
Troubleshooting Nova Compute Failures in OpenStack
When an OpenStack instance won't boot, the error is rarely where you first look. Here's a field-tested order for tracing Nova compute failures from API to hypervisor.
Read guide - · 8 min read
Troubleshooting Live Migration in OpenStack
Live migration keeps instances running during maintenance — until it stalls or fails. Here's how to diagnose Nova live migration across CPU, storage, and network.
Read guide - · 8 min read
Troubleshooting RabbitMQ in OpenStack
RabbitMQ is OpenStack's nervous system, and when it backs up the whole cloud stalls. Here's how to diagnose queue backlogs, partitions, and stuck consumers.
Read guide