You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add a way for the user in some way to specify hard caps on tool calls or other possible "overthinking" during agent execution, so that runaway calls do not result in surprise costs for some models or even lower-end models that hyper-focus on tool calls.
Hey @timothycarambat, thanks for the reply and for digging into the issue! I tested the agent with Claude 3.5 Sonnet and Gemini 2.5 Flash, and both worked well, finishing tasks quickly without burning through tokens or getting stuck in loops.
It does seem like 3.7’s behavior is the main culprit. Your idea of a tool iteration cap sounds like a solid fix to stop this kind of overuse with models like 3.7. Maybe something like Gemini’s thinking budget could also work here?
Add a way for the user in some way to specify hard caps on tool calls or other possible "overthinking" during agent execution, so that runaway calls do not result in surprise costs for some models or even lower-end models that hyper-focus on tool calls.
Originally posted by @LinusAurel in #3675
The text was updated successfully, but these errors were encountered: