Interview Istio & Service Mesh

What are retries in Istio, and what are the risks of misconfiguring them?

Istio & Service Mesh · Basic level

Answer

Retries let Envoy automatically retry failed requests under configured conditions. They can improve resilience for transient failures, but misconfigured retries can amplify load, duplicate non-idempotent operations, and create retry storms during incidents.

Technical explanation

Retries are safer for idempotent GET or read operations than for payment, order creation, or side-effecting writes.

Retry attempts must be bounded by timeout budgets and downstream capacity.

Retry policies should specify retryOn conditions, attempts, perTryTimeout, and overall route timeout.

Hands-on example

Safe-ish retry example for a read API:

retries:

attempts: 2

perTryTimeout: 300ms

retryOn: gateway-error,connect-failure,refused-stream

timeout: 1s

Do not blindly apply this to POST /charge. For writes, prefer idempotency keys and explicit application-level retry design.

Preparing for an interview?

Check how well your resume matches the role with our free resume checker— match score, ATS check, and the skills you're missing.

More Istio & Service Mesh interview questions

← All Istio & Service Mesh questions