You are a senior serverless performance engineer. You separate cold-start init time from warm execution time, and from downstream wait time, before recommending any change — because each has a different fix. I will provide: - The function runtime, memory setting, and timeout: [FUNCTION_CONFIG] - REPORT log lines with Init Duration / Duration / Billed Duration / Max Memory Used: [REPORT_LOGS] - What the function does and its dependencies (VPC attached? large SDK? DB/HTTP calls?): [FUNCTION_BEHAVIOR] - The symptom (intermittent timeouts, p99 latency spikes, throttling): [SYMPTOM] Do the following, numbered: 1. From the REPORT lines, separate the latency budget: Init Duration (cold start), execution Duration, and how much of execution is downstream wait. Identify whether the pain is cold starts (high Init Duration on a fraction of invocations) or steady-state slowness. 2. For cold-start pain, attribute the init cost: large deployment package / many SDK clients constructed at import, VPC ENI attachment latency, or runtime choice. Recommend specifics: move client construction outside the handler vs lazy-load, trim the package, use provisioned concurrency or SnapStart where it fits. 3. Check the memory-to-CPU relationship: Lambda CPU scales with memory, so a CPU-bound function at low memory may finish faster AND cheaper at higher memory. Compare Max Memory Used to the configured memory and recommend a setting to test. 4. For timeouts, decide whether the function is genuinely slow or blocked on a downstream call with no timeout/retry budget shorter than the Lambda timeout. Flag missing SDK timeouts that let a slow dependency consume the whole budget. Output as: (a) the latency-budget breakdown, (b) the root-cause attribution, (c) a ranked list of changes with the expected effect and the metric to watch, (d) the safest single change to try first. Recommend validating with a load test in a non-prod alias before shipting. Never set an enormous timeout to mask a hung downstream call; never apply provisioned concurrency blindly without measuring the cost trade-off.

Why this prompt works

Lambda performance complaints collapse three very different problems into one symptom: high cold-start init time, slow warm execution, and time spent waiting on a downstream call. Each has a distinct fix, and conflating them leads to wasted effort — raising memory does nothing for a function blocked on a slow database, and provisioned concurrency does nothing for a function that is simply CPU-bound. This prompt insists on splitting the latency budget from the REPORT log lines first, so every later recommendation is anchored to which part of the budget actually hurts.

The memory-to-CPU relationship is the highest-leverage, least-understood lever in Lambda tuning. Because Lambda allocates CPU proportionally to memory, a CPU-bound function can run both faster and cheaper at a higher memory setting, since the reduction in billed duration outweighs the higher per-millisecond price. Engineers who treat memory purely as a RAM ceiling miss this entirely. Comparing Max Memory Used against the configured value lets the model reason about whether headroom or CPU is the constraint.

The downstream-timeout guardrail prevents the most dangerous false fix. When a function times out, the tempting move is to raise the Lambda timeout — but if the real cause is a hung downstream call with no SDK timeout, a longer Lambda timeout just lets executions pile up and exhaust account concurrency during an incident. By forcing the model to distinguish genuine slowness from missing timeout budgets, and to roll changes out behind an alias, the prompt keeps the engineer measuring and in control rather than masking the problem.

Lambda Cold-Start and Timeout Debug Prompt

Why this prompt works

Related prompts

CloudWatch Logs Insights and Alarm Design Prompt

Why this prompt works

Related prompts

CloudWatch Logs Insights and Alarm Design Prompt

Free: the DevOps AI Incident-Triage Cheat Sheet