Prometheus cardinality explosion from high-dimensional metrics — how to decide what labels to keep?
Prometheus scraping 200+ pods, each emitting metrics with labels: pod, container, namespace, endpoint, method, status_code, customer_id. Cardinality is ~500k series and growing. Memory usage on Prometheus is 12GB. Recording rules help but don't reduce storage. Which labels are actually worth keeping for alerting vs debugging? Looking for a systematic way to audit label usefulness.