AgentQnA - add support for remote server #1900

alexsin368 · 2025-05-02T00:47:29Z

Description

Add support for usage of a remote server inference endpoint. Supports Enterprise Inference.
Clean up README with fixed typos and additional instructions for setting up the OpenAI endpoint on the WebUI server.

Related dependent PR: opea-project/GenAIComps#1644

Issues

N/A

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

None

Tests

Tested AgentQnA on the UI and verified the chat completions are working.

Signed-off-by: alexsin368 <[email protected]>

github-actions · 2025-05-02T00:47:41Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

for more information, see https://pre-commit.ci

louie-tsai

Could you just create a yaml file to overwrite original compose.yaml instead of creating a new one? it increase a maintenance efforts, and you probably just change couple values for vllm. please check compose.telemetry.yaml as an example.

if all new features create a new compose.yaml, we will be overwhelmed with many compose.yaml files soon.

louie-tsai · 2025-05-02T02:04:04Z

AgentQnA/README.md

+To run on Xeon with models deployed on a remote server, run with the `compose_remote.yaml` instead. Additional environment variables also need to be set.
+
+```bash
+export model=<name-of-model-card>


what is the difference between this model variable and MODEL_ID in set_env.sh. why users need to set model name in two difference places?

For the agent, "model" is the env variable used. There is no LLM_MODEL_ID. EMBEDDING_MODEL_ID and RERANK_MODEL_ID are used by the retriever.

We need to set model here to overwrite the original value (gpt-4o-mini-2024-07-18" set in set_env.sh. Added a note on this.

louie-tsai · 2025-05-02T02:07:04Z

AgentQnA/README.md

+
+```bash
+export model=<name-of-model-card>
+export LLM_ENDPOINT_URL=<http-endpoint-of-remote-server>


you probably need to give an example to explain how it works for Denvr.

I added some notes

AgentQnA/README.md

Signed-off-by: alexsin368 <[email protected]>

alexsin368

addressed comments

…amples into agentqna-iaas Signed-off-by: alexsin368 <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: alexsin368 <[email protected]>

…amples into agentqna-iaas

alexsin368 · 2025-05-03T00:40:50Z

The hyperlink check is failing with https://platform.openai.com/api-keys. I checked that it is a valid link, just requires the user to log in to their OpenAI account. Can we bypass this error?

Copilot

Pull Request Overview

A pull request to add support for remote server inference, including configuration updates and documentation enhancements.

Introduces a new docker-compose file with environment variables for remote inference.
Updates the README with corrected typos and detailed instructions for setting up both OpenAI and remote server inference.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
AgentQnA/docker_compose/intel/cpu/xeon/compose_remote.yaml	New configuration file for setting environment variables for remote inference.
AgentQnA/README.md	Updated documentation with instructions for configuring remote inference and fixes.

Comments suppressed due to low confidence (1)

AgentQnA/README.md:225

[nitpick] The variable name 'model' is generic and may lead to confusion; consider renaming it to something more descriptive, such as MODEL_ID, to clearly indicate its purpose.

export model=<name-of-model-card>

yinghu5 · 2025-05-07T09:00:35Z

Hi @vikramts @NeoZhangJianyu Here the Hyperlink check CI report error, could you please help to resolve it?

vikramts · 2025-05-08T06:09:14Z

@alexsin368 - If the https://platform.openai.com/api-keys URL is valid, then I don't see why we cannot bypass the error detail. Technically, this is not an error. And we can expect the user to understand that they need to log into their OpenAI account. So I do not see an error here really.

add support for remote server

c442768

Signed-off-by: alexsin368 <[email protected]>

alexsin368 requested review from lkk12014402 and minmin-intel as code owners May 2, 2025 00:47

[pre-commit.ci] auto fixes from pre-commit.com hooks

a9cbcbe

for more information, see https://pre-commit.ci

alexsin368 mentioned this pull request May 2, 2025

Agent: add support for remote server opea-project/GenAIComps#1644

Open

4 tasks

louie-tsai reviewed May 2, 2025

View reviewed changes

address comments, simplify compose_remote.yaml

1ccdda0

Signed-off-by: alexsin368 <[email protected]>

alexsin368 commented May 3, 2025

View reviewed changes

alexsin368 and others added 4 commits May 2, 2025 17:35

Merge branch 'agentqna-iaas' of https://github.com/alexsin368/GenAIEx…

65f803d

…amples into agentqna-iaas Signed-off-by: alexsin368 <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

2137e6f

for more information, see https://pre-commit.ci

simplify compose_remote.yaml

c3738c4

Signed-off-by: alexsin368 <[email protected]>

Merge branch 'agentqna-iaas' of https://github.com/alexsin368/GenAIEx…

b87c25c

…amples into agentqna-iaas

yinghu5 requested review from yinghu5 and Copilot May 7, 2025 08:35

Copilot AI reviewed May 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AgentQnA - add support for remote server #1900

AgentQnA - add support for remote server #1900

alexsin368 commented May 2, 2025 •

edited

Loading

github-actions bot commented May 2, 2025 •

edited

Loading

louie-tsai left a comment •

edited

Loading

louie-tsai May 2, 2025

alexsin368 May 3, 2025 •

edited

Loading

louie-tsai May 2, 2025

alexsin368 May 3, 2025

alexsin368 left a comment

alexsin368 commented May 3, 2025

Copilot AI left a comment

yinghu5 commented May 7, 2025

vikramts commented May 8, 2025

AgentQnA - add support for remote server #1900

Are you sure you want to change the base?

AgentQnA - add support for remote server #1900

Conversation

alexsin368 commented May 2, 2025 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

github-actions bot commented May 2, 2025 • edited Loading

Dependency Review

Scanned Files

louie-tsai left a comment • edited Loading

Choose a reason for hiding this comment

louie-tsai May 2, 2025

Choose a reason for hiding this comment

alexsin368 May 3, 2025 • edited Loading

Choose a reason for hiding this comment

louie-tsai May 2, 2025

Choose a reason for hiding this comment

alexsin368 May 3, 2025

Choose a reason for hiding this comment

alexsin368 left a comment

Choose a reason for hiding this comment

alexsin368 commented May 3, 2025

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

yinghu5 commented May 7, 2025

vikramts commented May 8, 2025

alexsin368 commented May 2, 2025 •

edited

Loading

github-actions bot commented May 2, 2025 •

edited

Loading

louie-tsai left a comment •

edited

Loading

alexsin368 May 3, 2025 •

edited

Loading