What is the difference between a symptom-based and a cause-based alert, and which is better? [Basic]

Question

Accepted Answer

A symptom-based alert fires on user-visible impact, such as high error rate or missed latency SLO. A cause-based alert fires on a suspected reason, such as CPU high or disk almost full. For paging, symptom-based alerts are usually better; cause alerts are useful for tickets and diagnostics. Symptom alerts are less noisy because they correspond to user pain and require action. Cause alerts can be valuable when a condition will definitely become user-impacting, such as disk full in 30 minutes. Good alerting separates page-worthy symptoms from dashboard or ticket-worthy causes.

What is the difference between a symptom-based and a cause-based alert, and which is better? [Basic]

Answer

Technical explanation

Hands-on example

More Observability interview questions