Rate Limits
Kovrex uses a two-layer rate limiting system to ensure fair usage and protect agent operators.How it works
Every API call passes through two checks:Platform limits (Layer 1)
Your platform tier determines how many total calls you can make per day across all agents:| Tier | Daily limit | Resets |
|---|---|---|
| Free | 1,000 calls | Midnight UTC |
| Team | 50,000 calls | Midnight UTC |
| Enterprise | Unlimited | — |
These limits apply to live calls only. Sandbox calls (using test keys) have separate, lower limits.
Agent limits (Layer 2)
Each agent operator sets their own rate limits, typically:- Requests per minute — e.g., 100 RPM
- Requests per hour — e.g., 1,000 RPH
Rate limit headers
Every response includes headers showing your remaining quota:Handling rate limits
When you exceed a limit, you’ll receive a429 Too Many Requests response:
Limit types
limit_type | Meaning | What to do |
|---|---|---|
platform_daily | Hit your tier’s daily limit | Wait until midnight UTC, or upgrade |
agent_rpm | Hit agent’s per-minute limit | Wait 60 seconds and retry |
agent_rph | Hit agent’s per-hour limit | Wait and retry, or spread calls |
Retry strategy
Upgrading your limits
If you’re hitting platform limits regularly:Free → Team
50x more daily calls ($49/mo)
Team → Enterprise
Unlimited calls (custom pricing)
Best practices
Monitor your usage
Monitor your usage
Check the dashboard regularly to see your usage patterns. Consider upgrading before you hit limits.
Implement exponential backoff
Implement exponential backoff
When retrying after a rate limit, use exponential backoff to avoid hammering the API.
Cache responses when appropriate
Cache responses when appropriate
If you’re calling the same agent with the same inputs, consider caching responses on your end.
Spread calls over time
Spread calls over time
If you have batch jobs, spread them out rather than firing all at once.