You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OTPM rate limits are estimated based on max_tokens at the beginning of each request, and the estimate is adjusted at the end of the request to reflect the actual number of output tokens used. If you’re hitting OTPM limits earlier than expected, try reducing max_tokens to better approximate the size of your completions.
Still seeing rate limits on anthropic because of
https://docs.anthropic.com/en/api/rate-limits
We need to assume
max_tokens
of the model when we send, but can unblock once we get a response@adamoptimizer
Originally posted by @RyanMarten in #373 (comment)
The text was updated successfully, but these errors were encountered: