Skip to main content

Rate Limit

RTZR STT OpenAPI controls usage by concurrency rather than time-based QPS. Policies differ for file mode and streaming mode. All limits apply per organization account.

File STT: Concurrency limit

File STT is constrained by a maximum concurrency limit (max number of files processed simultaneously). For example, with a concurrency limit of 10, up to 10 files can be processed at the same time. The 11th request will wait or fail depending on policy.

Response when exceeding the limit

When the limit is exceeded, the API returns HTTP 429 with:

{
"code": "A0002",
"msg": "Too many concurrent requests"
}

Streaming STT: Concurrent channel limit

Streaming STT is constrained by a maximum number of concurrent channels (sessions). For example, with a channel limit of 5, up to 5 concurrent connections can be maintained. The 6th will be rejected.

Limits by plan

Batch STT (file mode)

Plansommerswhisper
Free after signup102
Paid (Basic)205
EnterpriseContact usContact us

Streaming STT

PlanConcurrent channels
Free after signup5
Paid (Basic)20
EnterpriseContact us
tip

For Enterprise, see the Pricing page or contact us to discuss an optimal limit.

Managing limits and best practices

Exponential backoff

If a request fails due to a limit (429) or transient network error, immediate retries may also fail. Use exponential backoff:

  1. After a failure, wait briefly (e.g., 1s) then retry.
  2. On each failure, double the wait time (e.g., 2s, 4s, 8s...).
  3. Stop after reaching max retries or max wait.

Client-side queue

If you must process many files, sending them all at once easily hits the limit. Implement a request queue to keep only up to the allowed concurrency in-flight; push the next file when one completes.