Support for context in generate endpoint #15

jmont-dev · 2024-08-12T05:07:09Z

Context tokens are supplied from the generate endpoint. Allow supplying these as an input to additional generations to allow responses that span multiple generations.

The text was updated successfully, but these errors were encountered:

JG-Adams · 2024-08-14T21:11:22Z

I figured out how to do in manual way.
First capture the context.
inline nlohmann::json Keepcontext;
if (response.as_json()["done"]==true) {
prompt_AI.busy=false;
if (response.as_json().contains("context")){
Keepcontext = response.as_json()["context"];
}
}
Then you put it in request in the next time.

ollama::request request(ollama::message_type::generation);
if (!Keepcontext.empty()){
request["context"] = Keepcontext;
}

jmont-dev · 2024-08-18T13:38:36Z

This was addressed in #23. You can now include previous responses to provide context when using the generate endpoint. See the sections added to the Readme on handling context and context length: https://github.com/jmont-dev/ollama-hpp?tab=readme-ov-file#handling-context.

jmont-dev added the enhancement New feature or request label Aug 12, 2024

jmont-dev added this to the 1.0 Release milestone Aug 12, 2024

jmont-dev self-assigned this Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for context in generate endpoint #15

Support for context in generate endpoint #15

jmont-dev commented Aug 12, 2024

JG-Adams commented Aug 14, 2024 •

edited

Loading

jmont-dev commented Aug 18, 2024

Support for context in generate endpoint #15

Support for context in generate endpoint #15

Comments

jmont-dev commented Aug 12, 2024

JG-Adams commented Aug 14, 2024 • edited Loading

jmont-dev commented Aug 18, 2024

JG-Adams commented Aug 14, 2024 •

edited

Loading