What is an Auto Scaling Group, and what are the scaling policy types?

Question

Accepted Answer

An Auto Scaling Group maintains desired EC2 capacity, replaces unhealthy instances, and scales using policies such as target tracking, step scaling, scheduled scaling, and predictive scaling. I prefer metrics that reflect real demand, not only CPU. Scaling on request count per target can be better than CPU for web services because it follows user demand more directly. Compute design should balance availability, scaling speed, startup time, instance limits, health checks, and deployment rollback, not just raw instance size. Autoscaling and load balancing only work well when health checks reflect readiness and when applications externalize state. Cost optimization should be tied to utilization data and workload tolerance for interruption, commitment, and architecture changes.

What is an Auto Scaling Group, and what are the scaling policy types?

Answer

Technical explanation

Hands-on example

More AWS interview questions