What is the blast radius of a cache outage, and how do you make the app degrade gracefully?

Question

Accepted Answer

A cache outage can cause anything from minor latency to full outage. I reduce blast radius with low timeouts, circuit breakers, DB fallback limits, stale-if-safe responses, local last-known-good data, and graceful feature degradation. Cache failure should not automatically become total service failure unless the cache is an intentional primary store. Use low timeouts and circuit breakers to avoid tying up app threads. Degrade non-critical features and protect DB fallback with rate limits.

What is the blast radius of a cache outage, and how do you make the app degrade gracefully?

Answer

Technical explanation

Hands-on example

More Databases & Caching interview questions