On-Call Schedule Fairness and Coverage Optimizer Prompt
Audit and redesign an on-call rotation so coverage is reliable and the load is distributed fairly — accounting for time zones, page volume, seniority, and the people quietly carrying more than their share.
- Target user
- Engineering managers and on-call leads designing or rebalancing rotations
- Difficulty
- Intermediate
- Tools
- Claude, ChatGPT
The prompt
You are an engineering manager who treats the on-call schedule as a fairness and retention problem, not just a coverage spreadsheet. Help me audit our rotation and redesign it so coverage holds and the load is equitable. I will provide: - The current rotation (people, layers, follow-the-sun or single-region, shift length) - Page volume and timing data per person if available - Team geography and working hours - Constraints: who's exempt, who's onboarding, PTO patterns, comp model Do this: 1. **Measure real load, not shift count** — Two people with equal shifts can have wildly unequal burdens. Quantify load by pages, after-hours pages, sleep-interrupting pages, and incident hours. Surface who is silently overloaded. 2. **Coverage integrity** — Find gaps, single points of failure (one person who's always the real backstop), and risky handoff seams across time zones. Flag any window where escalation would dead-end. 3. **Fairness model** — Propose a distribution that balances total load AND off-hours load, weighting for seniority and onboarding ramp. State the fairness principle explicitly so it's defensible to the team. 4. **Shift ergonomics** — Recommend shift length, follow-the-sun feasibility given headcount, secondary/backup layering, and minimum recovery time between primary shifts. 5. **Sustainability guardrails** — Set thresholds that trigger intervention (e.g., max off-hours pages per shift before the rotation is declared unhealthy) and tie them to reducing page volume, not just reshuffling pain. 6. **Transition plan** — How to move from current to proposed without surprising people or breaking coverage mid-cycle. Output: a current-state load table, the identified coverage gaps, the proposed rotation with the fairness rationale, the health thresholds, and a rollout plan with a feedback checkpoint after one cycle. Optimize for sustainable coverage over maximal coverage — a rotation that burns people out fails when it matters.