[Feature]: vLLM support #3003

fpaupier · 2024-11-27T12:38:19Z

The Feature

(vLLM)[https://github.com/vllm-project/vllm] is a LLM serving framework that enables you to expose openAI API compatible endpoint. How can I get helicone working with vLLM?

Motivation, pitch

For on prem deployment, vLLM is a great option for secure inference data handling and for full control on the model you use. Given helicone can be self deployed with docker compose / K8s, it would be a great complimentary service to have on our infra.

Twitter / LinkedIn details

https://www.linkedin.com/in/fpaupier/

fpaupier added the enhancement New feature or request label Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: vLLM support #3003

[Feature]: vLLM support #3003

fpaupier commented Nov 27, 2024

[Feature]: vLLM support #3003

[Feature]: vLLM support #3003

Comments

fpaupier commented Nov 27, 2024

The Feature

Motivation, pitch

Twitter / LinkedIn details