When an LLM call is rate limited, our system detects the error and retries the request to prevent failovers.
Was this page helpful?