Skip to content

Commit 69db1a2

Browse files
authored
Merge pull request #72 from ghd3v/main
adding required provider; keeping default as vllm but adding sample a…
2 parents 0145ebb + de15d06 commit 69db1a2

File tree

1 file changed

+13
-1
lines changed

1 file changed

+13
-1
lines changed

use-cases/adding_reasoning_to_llama_3/cot_tools_config.yaml

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,23 @@
11
# Custom configuration for tool use Chain of Thought enhancement
2+
llm:
3+
# Provider selection: "vllm" or "api-endpoint"
4+
provider: "vllm"
25

36
vllm:
47
api_base: "http://localhost:8000/v1"
58
model: "unsloth/Meta-Llama-3.3-70B-Instruct"
69
max_retries: 3
710
retry_delay: 1.0
811

12+
# API endpoint configuration
13+
api-endpoint:
14+
api_base: "https://api.llama.com/v1" # Optional base URL for API endpoint (null for default API)
15+
api_key: "llama_api_key" # API key for API endpoint or compatible service (can use env var instead)
16+
model: "Llama-4-Maverick-17B-128E-Instruct-FP8" # Default model to use
17+
max_retries: 3 # Number of retries for API calls
18+
retry_delay: 1.0 # Initial delay between retries (seconds)
19+
sleep_time: 0.5 # Small delay in seconds between batches to avoid rate limits
20+
921
generation:
1022
temperature: 0.2 # Lower temperature for more consistent reasoning
1123
top_p: 0.95
@@ -14,7 +26,7 @@ generation:
1426
# The most important part - our custom Chain of Thought prompt
1527
prompts:
1628
cot_enhancement: |
17-
You are a highly intelligent AI with an IQ of 170, and your job is to enhance existing conversation examples. Remember to return the entire conversation as is, BUT
29+
You are a highly intelligent AI with an IQ of 170, and your job is to enhance existing conversation examples. Remember to return the entire conversation as is,
1830
BUT, we will add Chain of Thought and planning to "Assistant" messages whenever they return a tool call.
1931
Remember, ONLY when an assistant message returns a tool call will we add thinking and reasoning traces before it to add logic. Otherwise, we don't touch the conversation history.
2032
Remember to return the entire message, but only enhance the assistant messages whenever a tool is called in the conversation by adding thoughts.

0 commit comments

Comments
 (0)