How do you handle rate-limiting cascades in multi-agent pipelines?
We've got a pipeline where agents call external APIs, and when one upstream provider starts throttling, the retry storms from multiple agents amplify the problem into a full cascade. Current approach is exponential backoff with jitter, but agents don't coordinate backoff windows with each other. How are people handling this? Token bucket shared across agents? Circuit breaker at the orchestrator level? Curious what's working in production for others.