[FEAT]: Agent `max` limits to prevent #3679

timothycarambat · 2025-04-18T21:18:06Z

Add a way for the user in some way to specify hard caps on tool calls or other possible "overthinking" during agent execution, so that runaway calls do not result in surprise costs for some models or even lower-end models that hyper-focus on tool calls.

Hey @timothycarambat, thanks for the reply and for digging into the issue! I tested the agent with Claude 3.5 Sonnet and Gemini 2.5 Flash, and both worked well, finishing tasks quickly without burning through tokens or getting stuck in loops.

It does seem like 3.7’s behavior is the main culprit. Your idea of a tool iteration cap sounds like a solid fix to stop this kind of overuse with models like 3.7. Maybe something like Gemini’s thinking budget could also work here?

Originally posted by @LinusAurel in #3675

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT]: Agent `max` limits to prevent #3679

[FEAT]: Agent `max` limits to prevent #3679

timothycarambat commented Apr 18, 2025

[FEAT]: Agent max limits to prevent #3679

[FEAT]: Agent max limits to prevent #3679

Comments

timothycarambat commented Apr 18, 2025

[FEAT]: Agent `max` limits to prevent #3679

[FEAT]: Agent `max` limits to prevent #3679