What is data sampling or filtering at ingest, and why does it matter for cost? [Intermediate]

Question

Accepted Answer

Ingest sampling or filtering reduces the amount of data sent to the backend by dropping, transforming, or sampling events before indexing. It matters because high-volume low-value telemetry drives license, storage, and search cost. Filtering removes events that are not useful, such as successful health checks or repetitive debug logs. Sampling keeps a representative subset, useful for high-volume success events but risky for rare errors. Never sample compliance, security, audit, or error data unless the business has explicitly approved it.

What is data sampling or filtering at ingest, and why does it matter for cost? [Intermediate]

Answer

Technical explanation

Hands-on example

More Observability interview questions