Releases · vllm-project/aibrix

What's Changed

Add common project documents and skeleton folders by @Jeffwan in #4
Scaffolding aibrix project using kubebuilder by @Jeffwan in #17
Optimize project layouts by moving controllers to pkg folder by @Jeffwan in #21
Create Lora api and controller by @Jeffwan in #23
Rename LoraAdapter to ModelAdapter by @Jeffwan in #25
Add ModelAdapter API by @Jeffwan in #26
Use better way to set up controller with Manager by @Jeffwan in #27
Initial model adapter controller implementation by @Jeffwan in #32
Add mocked model container for lora adapter fast prototyping by @Jeffwan in #33
[Misc] Add the PR and issues template by @jsw-zorro in #38
[Docs] Add example to run vLLM distributed inference using Ray by @Jeffwan in #39
[Doc] Improve the model adapter mock service by @Jeffwan in #45
[Misc] Simplify the feature/bug/enhancement template. by @jsw-zorro in #48
[Misc] Make model adapter controller e2e work by @Jeffwan in #50
[Docs] A draft version of the contributing guideline document by @kr11 in #47
[Core] Improve model adapter controller by handling existing resources by @Jeffwan in #54
[Feat] Initial Implementation of PodAutoscaler Reconciler by @kr11 in #55
[Docs] Move the sample mocked application to common folder by @Jeffwan in #64
[Misc] Minor refactor the PodAutoscaler codes by @Jeffwan in #68
[Core] Add model router controller by @varungup90 in #57
Add rbac rules in model router by @varungup90 in #71
[bugs] Add autoscaler RBAC to successfully list horizontalpodautoscalers by @kr11 in #72
[Misc] Update license info; Add license check by @happyandslow in #73
add github workflow to lint & test code by @M00nF1sh in #74
[CI] Fix the golang lint issues by @Jeffwan in #77
[CI] fix the failures from make test by @Jeffwan in #80
[Misc] Add code-generator and openapi-gen as dependencies by @Jeffwan in #59
[Misc] Reconcile hpa, kpa and apa separately by @Jeffwan in #83
[feat] Add rpm/tpm extension proc plugin by @varungup90 in #79
Add kpa scale algorithm implementation by @kr11 in #87
Add host override to query specific pod by @varungup90 in #86
[Core] init aibrix runtime framework by @brosoul in #88
Support kpa/apa autoscaling workflow part I by @Jeffwan in #85
Fix Dockerfile Packaging Issues Related to Go Version and Missing Utils by @kr11 in #92
Autoscaling Workflow Enhancement - Part 2 by @kr11 in #94
Add custom CRD clientset by @varungup90 in #97
Autoscaling Workflow Enhancement - Part 3 by @kr11 in #101
[Core] Add Downloader implementation for runtime by @brosoul in #96
Add RayClusterReplicaSet and RayClusterFleet apis by @Jeffwan in #103
Apply crd:maxDescLen=0 in manifest generation by @Jeffwan in #108
Apply filter to objects owned by model adapters by @varungup90 in #111
Add custom cache and interface for model adapter scheduling by @varungup90 in #100
Refactor gateway package by @varungup90 in #112
BatchAPI storage component together with test by @xinchen384 in #104
Update the installation guidance and README.md by @Jeffwan in #115
[CI] Package AI Runtime by @brosoul in #118
Add gateway installation by @varungup90 in #122
[CI] Support container image build and push in CI by @Jeffwan in #120
[CI] Fix nightly image push error by @Jeffwan in #127
[Bug] Fix download bugs during download benchmark by @brosoul in #134
Autoscaling Workflow Enhancement - Part 4: Integrating MetricClient into Autoscaling Workflow by @kr11 in #116
Update make generate by @varungup90 in #132
Model adapter controller improvement and refactor by @Jeffwan in #135
Improve the aibrix installation scripts by @Jeffwan in #141
[CI] Support python package publish by @brosoul in #138
Fix some typo and naming issues by @Jeffwan in #150
Fix gateway bootstrap issues by @varungup90 in #154
Add kubeconfig flag for cache initialization by @varungup90 in #155
Using sphinx to generate html pages for our project static site by @xinchen384 in #153
Add finalizer and handle the model unload requests by @Jeffwan in #152