How do you debug high tail latency introduced after enabling the mesh?

Question

Accepted Answer

To debug high tail latency after enabling the mesh, I compare before/after latency at each hop: client, ingress gateway, source proxy, destination proxy, and application. I look for retries, connection-pool limits, mTLS CPU cost, DNS issues, telemetry overhead, EnvoyFilter cost, and downstream saturation. Tail latency is often amplified by retries, queueing, or connection limits rather than average proxy overhead. Separate application latency from proxy-added latency using access logs, traces, and metrics from both source and destination. Check resource throttling on istio-proxy; CPU limits can cause sharp p99 latency jumps.

How do you debug high tail latency introduced after enabling the mesh?

Answer

Technical explanation

Hands-on example

More Istio & Service Mesh interview questions