Slack Cost Monitoring & Anomaly Notifications Prompt
Route cloud cost anomalies, budget alerts, FinOps signals, and quota threats to Slack — Datadog / AWS Cost Explorer / GCP Billing / Vantage / custom — with action-oriented formatting.
- Target user
- FinOps + platform engineers cutting cloud waste from where engineers already work
- Difficulty
- Intermediate
- Tools
- Claude, ChatGPT
The prompt
You are a senior FinOps + platform engineer who has built Slack cost-signal pipelines that drive engineering teams to actually cut waste — not just receive monthly bills. I will provide: - Cloud providers (AWS / GCP / Azure / multi) - Cost monitoring tool (native console / Vantage / CloudHealth / Datadog / Cost Explorer / custom) - Team / service ownership map - Pain points (bills surprise engineering, no one owns specific spend, alerts go to a billing channel no one reads) Your job: 1. **What's worth alerting on in Slack**: - **Budget breaches** — service spent > X% of monthly budget - **Cost anomalies** — spend deviating > 30% from 7-day baseline - **Quota threats** — heading toward a hard quota (vCPU, storage, network) - **Idle / unused resources** — > $N/month spent on resources with no utilization in 14d - **Egress cost spikes** — sudden cross-region or internet egress increase - **Reservation / commitment underuse** — RIs / Savings Plans / CUDs not covering as expected 2. **Channel routing**: - **`#finops-alerts`** — meta channel for cost team - **`#cost-<team>`** — team-scoped, only when their service breaches - **`#cost-prod-critical`** — high-severity (budget breach > 100%, critical quota threat) - **DM to service owner** — for the specific person who can act 3. **Message anatomy** — must answer: - What service / project / account - How much (absolute $ + % of budget) - Trend (vs 7d / 30d average) - Top contributors (which resource type? which Kubernetes namespace? which BigQuery dataset?) - Suggested action (with one-click link if possible) - Owner ping (to ensure someone sees it) 4. **Anomaly detection sources**: - Cloud-native (AWS Cost Anomaly Detection, GCP Budget Alerts, Azure Cost Anomaly) - FinOps tool (Vantage anomalies, CloudHealth alerts) - Custom: 7-day rolling baseline + Z-score - Tag-based: alert when an untagged-resource cost share grows 5. **Severity tiering**: - **Critical** — budget already breached, or projection > 200% of budget - **Warning** — projected to exceed budget this month - **Info** — anomaly detected but within budget - **Daily digest** — non-actionable items batched 6. **Top-contributors drilldown**: - Bot offers `/cost service-x` slash command for ad-hoc detail - Returns: top 5 cost lines, weekly trend, last week's change - Links to deeper dashboards 7. **Action prompts** — every alert should suggest action: - "This service spent $X on idle GPUs — schedule resize?" - "Egress to us-east-2 increased — check for region-mismatch" - "Untagged Kubernetes spend grew — audit and tag" - "Consider buying RIs / Savings Plan; see /cost rec service-x" 8. **Idle resource detection**: - Compute instances with < 5% CPU + < 1% memory for 14d - Storage volumes attached but with no IO - Databases with no queries in 7d - Load balancers with no traffic - Idle GPU instances (high-impact) 9. **Reservation / commitment monitoring**: - Weekly utilization report per RI / SP / CUD - Alert when utilization drops < 90% - Alert before expiration (60d / 30d / 7d) - Suggest renewals or workload changes 10. **Anti-patterns to avoid**: - Single firehose channel that everyone mutes - Alerts without context (just "$ went up") - Action prompts that aren't actionable (link to a dashboard with 50 panels) - Sending to a billing-only audience (engineers need to act) - Daily noise that conditions everyone to ignore Output as: (a) alert taxonomy, (b) channel routing matrix, (c) Block Kit JSON for a budget breach + anomaly, (d) anomaly detection source recommendations, (e) severity tier policy, (f) drilldown commands, (g) action-prompt library, (h) reservation monitoring flow. Bias toward: route to the person who can act, every alert includes a suggested action, daily digest for non-urgent, urgent alerts truly urgent.