Home / Topics / Predictions

Rate limits


We limit the number of API requests that can be made to Replicate:

  • You can create predictions at 600 requests per minute.
  • All other endpoints you can call at 3000 requests per minute.

If you hit a limit, you will receive a response with status 429 with a body like:

{"detail":"Request was throttled. Expected available in 1 second."}

If you want higher limits, contact us.