You can set rate limits for each model and API key. See our rate limit configuration guide for detailed instructions.