How would you back up and restore an etcd cluster?

Question

Accepted Answer

For etcd backup, I take a snapshot with etcdctl or the managed provider mechanism, store it securely, and regularly test restore in a non-production cluster. A backup is not trusted until a restore has been rehearsed. A consistent etcd restore usually recreates a cluster from the snapshot rather than merging arbitrary old state into a live cluster. Snapshot encryption, access control, retention, and restore runbooks are as important as the snapshot command. Kubernetes internals follow a watch-and-reconcile model over API objects stored in etcd. Extending Kubernetes safely requires schema validation, idempotent controllers, finalizers, ownership, and observable status conditions. Backup and restore procedures are part of the control-plane design, not an afterthought.

How would you back up and restore an etcd cluster?

Answer

Technical explanation

Hands-on example

More Kubernetes, Docker, Helm & Podman interview questions