How do you design for disaster recovery - explain RTO and RPO and the DR strategies.

Question

Accepted Answer

DR design starts with RTO and RPO. Backup-restore, pilot light, warm standby, and multi-site are increasing levels of readiness, cost, and complexity for decreasing recovery time and data loss. Lower RTO/RPO requires more pre-provisioning, replication, automation, and regular DR exercises. Availability design should start from business impact, RTO/RPO, dependency mapping, and failure-mode testing, not only from deploying resources in multiple AZs. Stateless compute, resilient data stores, health checks, rollback, backups, and game days are all required to prove resilience. Lower recovery targets require higher cost, more automation, replicated data, pre-provisioned capacity, and regularly tested runbooks.

How do you design for disaster recovery - explain RTO and RPO and the DR strategies.

Answer

Technical explanation

Hands-on example

More AWS interview questions