Discussion: Unifying versions for helm and router #80

gaocegege · 2025-02-07T08:50:23Z

Currently, the stack version in the router is hard-coded at this line.

The Helm chart has its own version management. Should we maintain separate versions for different components, or just use the same version across the board?

gaocegege · 2025-02-07T11:40:26Z

I really think we should steer clear of using the latest tag in the Helm chart for lmcache/lmstack-router:latest. It always pulls from the Docker registry, which can take extra time. Plus, it makes troubleshooting a bit tricky since we won't know exactly which commit the user is on.

Related to #74

YuhanLiu11 · 2025-02-07T16:25:46Z

Good question! @ApostaC what do you think?

ApostaC · 2025-02-07T17:42:38Z

@gaocegege Do you know what the best industry practice is towards this?

My current feeling is we should automate the router docker container image build and upload it to github docker registry with the version ID defined in the router (or maybe the commit id) as the tag.

But not sure how to best maintain the compatibility issue between the router and the helm chart.

gaocegege · 2025-02-07T23:07:58Z

We usually keep the Helm chart and router in separate repos. The Helm chart repo gets published via GitHub Pages (there's a handy guide in the Helm docs here), so users can easily pull and use the chart, with the help of Helm chart operator on K8s (example here) or just via CLI:

helm repo add vllm-production-stack https://production-stack.github.io/charts

The image tags for the deployed applications will use .Chart.AppVersion by default:

      containers:
      - name: router
        image: "{{ .Values.image }}:{{ default .Chart.AppVersion .Values.imageTag }}"

In this case, the router could use Helm Chart App version, while the Chart itself could use Helm Chart version. This setup accommodates different deployment cycles since the Helm chart may have more or fewer changes than the router.

This approach is pretty common in the industry, but I’m not sure it’s the best fit for us since we’re also deploying things like lmcache and other components, each with their own versions. We might want to skip using AppVersion and just define imageTag for each component in the values instead. That way, we can keep versioning separate for each piece, even if it means bumping the Helm chart version more often. Just a thought.

gaocegege · 2025-02-07T23:11:21Z

Here’s what we could do:

Turn this repo into a Helm Chart Repository
- Set up GitHub Pages to host the charts
- Add a workflow to automate Helm chart releases
- Pin the router’s image tags to specific versions

gaocegege · 2025-02-07T23:13:47Z

I strongly feel we should introduce a Kubernetes Custom Resource Definition (CRD) to handle the complexity, if we add more components like lmcache. At that point, managing configurations with just a Helm chart becomes too cumbersome, and we’ll need to write code to handle the logic and orchestration properly.

ApostaC · 2025-02-09T16:53:37Z

We usually keep the Helm chart and router in separate repos. The Helm chart repo gets published via GitHub Pages (there's a handy guide in the Helm docs here), so users can easily pull and use the chart, with the help of Helm chart operator on K8s (example here) or just via CLI:

helm repo add vllm-production-stack https://production-stack.github.io/charts
The image tags for the deployed applications will use .Chart.AppVersion by default:
  containers:
  - name: router
    image: "{{ .Values.image }}:{{ default .Chart.AppVersion .Values.imageTag }}"
In this case, the router could use Helm Chart App version, while the Chart itself could use Helm Chart version. This setup accommodates different deployment cycles since the Helm chart may have more or fewer changes than the router.

This approach is pretty common in the industry, but I’m not sure it’s the best fit for us since we’re also deploying things like lmcache and other components, each with their own versions. We might want to skip using AppVersion and just define imageTag for each component in the values instead. That way, we can keep versioning separate for each piece, even if it means bumping the Helm chart version more often. Just a thought.

Thanks for the suggestion!

The helm chart is already hosted on the GitHub page https://vllm-project.github.io/production-stack/index.yaml and the a new release will be created every time the version is increased.

Since the current repo is under vllm-project, I would like to not separating it to multiple repos for now, as it may incur extra complexities in logistics.

0xThresh · 2025-02-10T03:37:39Z

Since the router component and model component(s) have separate services and deployments, and the router code is independent of the Helm chart, I would say that having separate versioning makes sense. For Open WebUI we have two separate charts with separate versioning, one for the actual Open WebUI deployment and another for an optional add-on called Pipelines: https://github.com/open-webui/helm-charts/tree/main/charts

We then add the Pipelines chart as an optional dependency to the main Open WebUI chart to allow people to easily deploy and manage it as part of their OWUI chart. I don't think the separate chart for the router is necessary here, just throwing that out there as an example.

mikeengland · 2025-02-11T14:32:51Z

Out of interest, why is the router container stored under the lmcache project as lmcache/lm-router? This confused me at first, as the router is part of this vllm repository, and not lmcache!

Was this just done for convenience?

+1 on pinning a router version, with the option to override it via a helm variable.

ApostaC · 2025-02-11T17:20:25Z

Out of interest, why is the router container stored under the lmcache project as lmcache/lm-router? This confused me at first, as the router is part of this vllm repository, and not lmcache!

Was this just done for convenience?

Yeah, it's just for convenience right now before we find a better place to host the docker image.

gaocegege · 2025-02-12T07:55:36Z

Currently, our Docker image action repushes the image with the same tag if the router version in setup.py remains unchanged, which is not a common practice, The versioned image should be immutable.

https://github.com/vllm-project/production-stack/blob/main/.github/workflows/router-docker-release.yml#L45

I suggest using the Git commit as the tag for the pull request, and only incrementing the version during the release process.

gaocegege added the question Further information is requested label Feb 7, 2025

gaocegege changed the title ~~Discussion: Unifying versions for helm and router, or not~~ Discussion: Unifying versions for helm and router Feb 14, 2025

gaocegege mentioned this issue Feb 19, 2025

[release] Add github sha tag for router image #153

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion: Unifying versions for helm and router #80

Discussion: Unifying versions for helm and router #80

gaocegege commented Feb 7, 2025

gaocegege commented Feb 7, 2025 •

edited

Loading

YuhanLiu11 commented Feb 7, 2025

ApostaC commented Feb 7, 2025

gaocegege commented Feb 7, 2025

gaocegege commented Feb 7, 2025

gaocegege commented Feb 7, 2025

ApostaC commented Feb 9, 2025

0xThresh commented Feb 10, 2025

mikeengland commented Feb 11, 2025

ApostaC commented Feb 11, 2025

gaocegege commented Feb 12, 2025

Discussion: Unifying versions for helm and router #80

Discussion: Unifying versions for helm and router #80

Comments

gaocegege commented Feb 7, 2025

gaocegege commented Feb 7, 2025 • edited Loading

YuhanLiu11 commented Feb 7, 2025

ApostaC commented Feb 7, 2025

gaocegege commented Feb 7, 2025

gaocegege commented Feb 7, 2025

gaocegege commented Feb 7, 2025

ApostaC commented Feb 9, 2025

0xThresh commented Feb 10, 2025

mikeengland commented Feb 11, 2025

ApostaC commented Feb 11, 2025

gaocegege commented Feb 12, 2025

gaocegege commented Feb 7, 2025 •

edited

Loading