Skip to content

Releases: vllm-project/aibrix

v0.1.0-rc.1

10 Sep 23:30
a60653a
Compare
Choose a tag to compare
v0.1.0-rc.1 Pre-release
Pre-release

What's Changed

  • Add common project documents and skeleton folders by @Jeffwan in #4
  • Scaffolding aibrix project using kubebuilder by @Jeffwan in #17
  • Optimize project layouts by moving controllers to pkg folder by @Jeffwan in #21
  • Create Lora api and controller by @Jeffwan in #23
  • Rename LoraAdapter to ModelAdapter by @Jeffwan in #25
  • Add ModelAdapter API by @Jeffwan in #26
  • Use better way to set up controller with Manager by @Jeffwan in #27
  • Initial model adapter controller implementation by @Jeffwan in #32
  • Add mocked model container for lora adapter fast prototyping by @Jeffwan in #33
  • [Misc] Add the PR and issues template by @jsw-zorro in #38
  • [Docs] Add example to run vLLM distributed inference using Ray by @Jeffwan in #39
  • [Doc] Improve the model adapter mock service by @Jeffwan in #45
  • [Misc] Simplify the feature/bug/enhancement template. by @jsw-zorro in #48
  • [Misc] Make model adapter controller e2e work by @Jeffwan in #50
  • [Docs] A draft version of the contributing guideline document by @kr11 in #47
  • [Core] Improve model adapter controller by handling existing resources by @Jeffwan in #54
  • [Feat] Initial Implementation of PodAutoscaler Reconciler by @kr11 in #55
  • [Docs] Move the sample mocked application to common folder by @Jeffwan in #64
  • [Misc] Minor refactor the PodAutoscaler codes by @Jeffwan in #68
  • [Core] Add model router controller by @varungup90 in #57
  • Add rbac rules in model router by @varungup90 in #71
  • [bugs] Add autoscaler RBAC to successfully list horizontalpodautoscalers by @kr11 in #72
  • [Misc] Update license info; Add license check by @happyandslow in #73
  • add github workflow to lint & test code by @M00nF1sh in #74
  • [CI] Fix the golang lint issues by @Jeffwan in #77
  • [CI] fix the failures from make test by @Jeffwan in #80
  • [Misc] Add code-generator and openapi-gen as dependencies by @Jeffwan in #59
  • [Misc] Reconcile hpa, kpa and apa separately by @Jeffwan in #83
  • [feat] Add rpm/tpm extension proc plugin by @varungup90 in #79
  • Add kpa scale algorithm implementation by @kr11 in #87
  • Add host override to query specific pod by @varungup90 in #86
  • [Core] init aibrix runtime framework by @brosoul in #88
  • Support kpa/apa autoscaling workflow part I by @Jeffwan in #85
  • Fix Dockerfile Packaging Issues Related to Go Version and Missing Utils by @kr11 in #92
  • Autoscaling Workflow Enhancement - Part 2 by @kr11 in #94
  • Add custom CRD clientset by @varungup90 in #97
  • Autoscaling Workflow Enhancement - Part 3 by @kr11 in #101
  • [Core] Add Downloader implementation for runtime by @brosoul in #96
  • Add RayClusterReplicaSet and RayClusterFleet apis by @Jeffwan in #103
  • Apply crd:maxDescLen=0 in manifest generation by @Jeffwan in #108
  • Apply filter to objects owned by model adapters by @varungup90 in #111
  • Add custom cache and interface for model adapter scheduling by @varungup90 in #100
  • Refactor gateway package by @varungup90 in #112
  • BatchAPI storage component together with test by @xinchen384 in #104
  • Update the installation guidance and README.md by @Jeffwan in #115
  • [CI] Package AI Runtime by @brosoul in #118
  • Add gateway installation by @varungup90 in #122
  • [CI] Support container image build and push in CI by @Jeffwan in #120
  • [CI] Fix nightly image push error by @Jeffwan in #127
  • [Bug] Fix download bugs during download benchmark by @brosoul in #134
  • Autoscaling Workflow Enhancement - Part 4: Integrating MetricClient into Autoscaling Workflow by @kr11 in #116
  • Update make generate by @varungup90 in #132
  • Model adapter controller improvement and refactor by @Jeffwan in #135
  • Improve the aibrix installation scripts by @Jeffwan in #141
  • [CI] Support python package publish by @brosoul in #138
  • Fix some typo and naming issues by @Jeffwan in #150
  • Fix gateway bootstrap issues by @varungup90 in #154
  • Add kubeconfig flag for cache initialization by @varungup90 in #155
  • Using sphinx to generate html pages for our project static site by @xinchen384 in #153
  • Add finalizer and handle the model unload requests by @Jeffwan in #152

New Contributors

Full Changelog: https://github.com/aibrix/aibrix/commits/v0.1.0-rc.1