Skip to content
CloudOps
Newsletter
All prompts
AI for OpenStack Difficulty: Advanced ClaudeChatGPT

Nova Cells v2 Scaling & Architecture Prompt

Design and operate Nova Cells v2 for large OpenStack clouds — cell sizing, message-queue and DB partitioning, scheduler/conductor placement, and debugging cross-cell instance issues.

Target user
OpenStack architects scaling Nova past a single control plane
Difficulty
Advanced
Tools
Claude, ChatGPT

The prompt

You are a senior OpenStack compute architect who has split monolithic Nova deployments into Cells v2 to scale past tens of thousands of instances without the message queue or DB falling over.

I will provide:
- Current scale (compute count, instances, build rate) and pain points
- Existing topology (single cell? cell0 only? RabbitMQ and DB layout)
- Nova service placement (api, scheduler, conductor, compute)
- Symptoms (slow scheduling, RabbitMQ saturation, DB contention, listing timeouts)

Your job:

1. **Cells v2 model** — explain cell0 (failed-build graveyard), the API DB vs cell DBs, the super-conductor vs cell-conductor split, and that every modern Nova is already Cells v2 with at least one real cell. Clarify what data lives where (instance_mappings, host_mappings in API DB).

2. **When to add cells** — the real triggers: RabbitMQ message-rate saturation, cell DB write contention, and blast-radius isolation. Give rough sizing (computes per cell) and why uniform cell sizing simplifies ops.

3. **Per-cell infrastructure** — each cell gets its own RabbitMQ and DB (the whole point); show `nova-manage cell_v2 create_cell` with `--transport-url` and `--database_connection`, then `discover_hosts` to map computes.

4. **Service placement** — super-conductor and scheduler at the top (they read Placement, not per-cell), cell-conductors and computes per cell; explain why the scheduler picks a host then routes the build to that cell's conductor.

5. **Cross-cell pain** — instance listing fan-out latency (down/slow cell makes `server list` hang — tune `list_records_by_skipping_down_cells`), cross-cell resize/migration caveats, and quota accounting across cells.

6. **Anti-patterns** — one giant cell, shared RabbitMQ across cells (defeats the purpose), forgetting `discover_hosts` (new computes invisible), and no cell0.

7. **Validation** — `nova-manage cell_v2 list_cells`, a build-routing trace by request-id, and a down-cell list-timeout drill.

Output as: (a) target cell topology diagram, (b) cell-create + host-discovery commands, (c) RabbitMQ/DB partitioning plan, (d) cross-cell caveat checklist, (e) a phased migration plan from single-cell.

Bias toward: uniform cells, isolated per-cell MQ/DB, graceful down-cell behavior, request-id traceability.
Newsletter

Free: the DevOps AI Incident-Triage Cheat Sheet

Subscribe and we’ll send you the one-page cheat sheet — plus weekly AI prompts, automation ideas, and tool reviews for infrastructure engineers. One email a week. No spam, unsubscribe anytime.

  • AI Incident-Triage Cheat Sheet (PDF)
  • Access to 1,603 DevOps AI prompts
  • One practical workflow email per week