Interview Istio & Service Mesh

What is a common cause of 503 errors in Istio, and how do you diagnose it?

Istio & Service Mesh · Intermediate level

Answer

A common cause of 503 in Istio is that Envoy has no healthy upstream endpoints or no valid route to the selected subset. It can also come from mTLS mismatch, outlier ejection, gateway routing errors, or upstream connection failures.

Technical explanation

If a VirtualService routes to subset v2 but DestinationRule labels do not match any pods, Envoy can return 503.

Proxy access-log flags help narrow the class of issue: NR for no route, UF for upstream failure, UH for no healthy upstream, and RBAC for denied requests.

Always compare Kubernetes endpoints with Envoy endpoints.

Hands-on example

Diagnosis:

$ kubectl get endpoints backend -n app

$ istioctl proxy-config endpoints deploy/frontend -n app | grep backend

$ istioctl proxy-config route deploy/frontend -n app | grep backend

$ kubectl logs deploy/frontend -c istio-proxy -n app --tail=200

Fix labels, subsets, readiness, or TLS policy based on the missing piece.

Preparing for an interview?

Check how well your resume matches the role with our free resume checker— match score, ATS check, and the skills you're missing.

More Istio & Service Mesh interview questions

← All Istio & Service Mesh questions