Skip to content

Commit 27cc813

Browse files
haoruanlianhao
authored andcommitted
add a cpu specific helm chart values file for AgentQnA
Signed-off-by: Ruan Hao <[email protected]>
1 parent 340fa07 commit 27cc813

File tree

2 files changed

+29
-0
lines changed

2 files changed

+29
-0
lines changed

AgentQnA/kubernetes/helm/README.md

+7
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,10 @@
99
export HFTOKEN="insert-your-huggingface-token-here"
1010
helm install agentqna oci://ghcr.io/opea-project/charts/agentqna --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f gaudi-values.yaml
1111
```
12+
13+
## Deploy on CPU
14+
15+
```
16+
export HFTOKEN="insert-your-huggingface-token-here"
17+
helm install agentqna oci://ghcr.io/opea-project/charts/agentqna --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f cpu-values.yaml
18+
```
+22
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,22 @@
1+
# Copyright (C) 2025 Intel Corporation
2+
# SPDX-License-Identifier: Apache-2.0
3+
4+
tgi:
5+
enabled: false
6+
vllm:
7+
enabled: true
8+
LLM_MODEL_ID: "meta-llama/Meta-Llama-3-8B-Instruct"
9+
extraCmdArgs: ["--max-seq_len-to-capture", "16384", "--enable-auto-tool-choice", "--tool-call-parser", "llama3_json"]
10+
11+
supervisor:
12+
llm_endpoint_url: http://{{ .Release.Name }}-vllm
13+
llm_engine: vllm
14+
model: "meta-llama/Meta-Llama-3-8B-Instruct"
15+
ragagent:
16+
llm_endpoint_url: http://{{ .Release.Name }}-vllm
17+
llm_engine: vllm
18+
model: "meta-llama/Meta-Llama-3-8B-Instruct"
19+
sqlagent:
20+
llm_endpoint_url: http://{{ .Release.Name }}-vllm
21+
llm_engine: vllm
22+
model: "meta-llama/Meta-Llama-3-8B-Instruct"

0 commit comments

Comments
 (0)