How does the Cluster Autoscaler decide to add or remove nodes?

Question

Accepted Answer

Cluster Autoscaler adds nodes when Pods are unschedulable because no existing node can fit them, and it removes nodes when they are underutilized and their Pods can be safely moved. It must respect PDBs, scheduling constraints, taints, local storage, and cloud node group limits. Scale-up is triggered by unschedulable Pods, not by high node CPU alone. Scale-down requires Pods to be movable and can be blocked by PDBs, local volumes, affinity, or system pods. Kubernetes workload controllers encode different lifecycle guarantees: interchangeable replicas, stable identities, node-local agents, or finite tasks. Storage decisions must align with durability, access mode, zone placement, backup, restore, and failover behavior. Autoscaling should be designed with metrics, scheduling constraints, PDBs, and node capacity together.

How does the Cluster Autoscaler decide to add or remove nodes?

Answer

Technical explanation

Hands-on example

More Kubernetes, Docker, Helm & Podman interview questions