bug: Model not found when enable vllm api key #150

JustinDuy · 2025-02-18T15:20:34Z

Describe the bug

I am using lmstack-router as a load balancer for my vllm server, it is not working when i serve openai vllm with an api key with a 404 Unauthorized error. I believe the problem is '/v1/models' endpoint does not take a Bearer token at the moment to be able to verify against with openai vllm server. https://github.com/vllm-project/production-stack/blob/main/src/vllm_router/service_discovery.py#L136

To Reproduce

Enable vllm api key by setting VLLM_API_KEY in the deployment. call curl cmd after port-forwarding from k8s service: curl -X POST http://localhost:30080/completions
-H "Content-Type: application/json"
-H "Authorization: Bearer $VLLM_API_KEY"
-d '{
"model": "/model/qwen/Qwen2-VL-7B-Instruct",
"prompt": "Once upon a time,",
"max_tokens": 10
}'

Expected behavior

router's service_discovery can list all models

Additional context

No response

YuhanLiu11 · 2025-02-19T00:05:38Z

hey @JustinDuy we are unable to reproduce your error using your querying command. Could you provide details on how you started vLLM engines?

JustinDuy · 2025-02-19T09:53:11Z

@YuhanLiu11 : I start vllm serve with LLVM_API_KEY env variable set from k8s secret

JustinDuy · 2025-02-19T09:58:21Z

@YuhanLiu11 : have you taken a look at the models endpoint request inside service discovery that i posted above? I just wonder how does it work when the vllm server is secured by api key (see https://github.com/vllm-project/vllm/blob/main/vllm/entrypoints/openai/api_server.py#L745) and the key itself is not passed through header? something like this would make sense: headers = {"Authorization": f"Bearer {VLLM_API_KEY}"}

response = requests.get(url, headers)

gaocegege · 2025-02-19T10:04:17Z

cc @ggaaooppeenngg

ggaaooppeenngg · 2025-02-20T03:27:55Z

The core problem is that k8s service discovery hinges on a model list API. However, currently, there's no way to obtain an authorization token to access the model through this API.

It appears that the authorization token has been set manually. Notably, I've observed that the Helm chart doesn't have settings for configuring this token.

I put forward two viable solutions:

First, we can enhance the Helm chart by adding relevant settings to support the configuration of the vLLM token during deployment. Simultaneously, when setting up the router, we should include a token argument that will be used as the authorization token in the header. Another approach could be to annotate the token in the pod so that it can be retrieved when the list model API is called.
As an alternative, we can choose to leave the vLLM instance without token - based security and instead implement authorization solely at the router level.

YuhanLiu11 · 2025-02-20T03:55:33Z

@YuhanLiu11 : have you taken a look at the models endpoint request inside service discovery that i posted above? I just wonder how does it work when the vllm server is secured by api key (see https://github.com/vllm-project/vllm/blob/main/vllm/entrypoints/openai/api_server.py#L745) and the key itself is not passed through header? something like this would make sense: headers = {"Authorization": f"Bearer {VLLM_API_KEY}"}

response = requests.get(url, headers)

Yes this can be a quick fix to this bug 😄, but we'll still need something like what's brought up by @ggaaooppeenngg to let the router be aware of the API key. I can take a stab once I have bandwidth.

JustinDuy added the bug Something isn't working label Feb 18, 2025

vllm-project deleted a comment from Shaoting-Feng Feb 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Model not found when enable vllm api key #150

bug: Model not found when enable vllm api key #150

JustinDuy commented Feb 18, 2025 •

edited

Loading

YuhanLiu11 commented Feb 19, 2025

JustinDuy commented Feb 19, 2025

JustinDuy commented Feb 19, 2025 •

edited

Loading

gaocegege commented Feb 19, 2025

ggaaooppeenngg commented Feb 20, 2025

YuhanLiu11 commented Feb 20, 2025 •

edited

Loading

bug: Model not found when enable vllm api key #150

bug: Model not found when enable vllm api key #150

Comments

JustinDuy commented Feb 18, 2025 • edited Loading

Describe the bug

To Reproduce

Expected behavior

Additional context

YuhanLiu11 commented Feb 19, 2025

JustinDuy commented Feb 19, 2025

JustinDuy commented Feb 19, 2025 • edited Loading

gaocegege commented Feb 19, 2025

ggaaooppeenngg commented Feb 20, 2025

YuhanLiu11 commented Feb 20, 2025 • edited Loading

JustinDuy commented Feb 18, 2025 •

edited

Loading

JustinDuy commented Feb 19, 2025 •

edited

Loading

YuhanLiu11 commented Feb 20, 2025 •

edited

Loading