How do you decide which columns to index?

Question

Accepted Answer

I decide indexes from real workload evidence: WHERE predicates, JOIN keys, ORDER BY, GROUP BY, uniqueness rules, and top expensive queries. I prioritize selective columns and query-specific composite indexes, then verify with EXPLAIN ANALYZE. Use real workload evidence from pg_stat_statements, slow logs, Performance Insights, or traces before adding indexes. EXPLAIN shows the plan; EXPLAIN ANALYZE runs the query and compares estimated versus actual rows and timing. Sequential scans are not always bad; for small tables or low-selectivity filters they may be optimal.

How do you decide which columns to index?

Answer

Technical explanation

Hands-on example

More Databases & Caching interview questions