What is the difference between a histogram and a summary, and the trade-offs? [Basic]

Question

Accepted Answer

Histograms bucket observations and allow server-side aggregation and percentile calculation with histogram_quantile. Summaries calculate quantiles in the client and are harder to aggregate across instances. I usually prefer histograms for service latency in distributed systems. Histograms produce bucket time series such as le='0.5', le='1', and le='+Inf'. Summaries can provide accurate client-side quantiles for one process but cannot be correctly averaged across replicas. Histogram bucket choice matters: buckets should align to user-relevant thresholds and SLO objectives.

What is the difference between a histogram and a summary, and the trade-offs? [Basic]

Answer

Technical explanation

Hands-on example

More Observability interview questions