Rate Limits - Taam Cloud API

Understanding Rate Limits

Each account has a default rate limit of 300 Requests Per Minute (RPM) Per Model.

Rate limits control how frequently users can make requests to our LLM API within specific time periods. These limits help ensure fair resource distribution and maintain service stability.

Prevent Abuse

Protect against API misuse and abuse

Fair Usage

Ensure fair resource distribution

Stability

Maintain consistent API performance

Rate Limit Details

Default Limits

Limit Type	Value	Period
Per Model	300	1 minute
Concurrent Requests	5	At once

Rate Limit Response

Best Practices

Implement Request Throttling

Add rate limiting in your application code to stay within limits:

const rateLimiter = new RateLimiter({
  requests: 300,
  period: '1m'
});

Add Exponential Backoff

Implement retry logic with increasing delays:

const backoff = (attempt) => Math.min(1000 * Math.pow(2, attempt), 10000);

Monitor Usage

Track your API usage through our dashboard: View Usage Stats

Handling Rate Limits

When you receive a 429 error, implement these handling strategies:

Retry Later: Wait for the specified cooldown period
Optimize Requests: Batch operations when possible
Monitor Usage: Track your consumption patterns

Increasing Rate Limits

Contact Support

Join our Discord community for immediate assistance

Enterprise Needs

Book a call with our sales team for custom limits

Repeatedly exceeding rate limits may result in temporary account restrictions. Please monitor your usage and request limit increases if needed.

Subscription

​Understanding Rate Limits

Prevent Abuse

Fair Usage

Stability

​Rate Limit Details

​Best Practices

​Handling Rate Limits

​Increasing Rate Limits

Contact Support

Enterprise Needs

Understanding Rate Limits

Rate Limit Details

Best Practices

Handling Rate Limits

Increasing Rate Limits