Skip to content
Discussion options

You must be logged in to vote

Supported in the latest release 2.5.5

It's not efficient as it is with simple agents, because the initial retrieval process must be executed outside of the async request, but it makes finally possible to parallelize the inference.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by joecharm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants