What is the difference between a request and a limit for CPU versus memory?

Question

Accepted Answer

CPU requests influence scheduling and CPU shares, while CPU limits cause throttling. Memory requests influence scheduling, and memory limits are fatal when exceeded because memory is not compressible the way CPU time is. CPU is compressible: the workload can be throttled and continue slowly. Memory is non-compressible: exceeding a limit can terminate the process. A common production pattern is memory limit with careful headroom, and CPU requests without strict CPU limits for latency-sensitive services. Health and resources are production controls, not just YAML fields; wrong settings cause outages, noisy restarts, bad rollouts, or wasted capacity. Requests affect scheduling and node capacity planning; readiness affects traffic; liveness affects restart behavior. Validate settings with real load, startup timing, memory profiles, and deployment rollout behavior.

What is the difference between a request and a limit for CPU versus memory?

Answer

Technical explanation

Hands-on example

More Kubernetes, Docker, Helm & Podman interview questions