Update the installation guidance and README.md #115

Jeffwan · 2024-08-30T08:08:14Z

Pull Request Description

Start to write the installation guidance
Update the README.md feature section

Related Issues

Resolves: #[Insert issue number(s)]

Important: Before submitting, please complete the description above and review the checklist below.

Contribution Guidelines (Expand for Details)

We appreciate your contribution to aibrix! To ensure a smooth review process and maintain high code quality, please adhere to the following guidelines:

Pull Request Title Format

Your PR title should start with one of these prefixes to indicate the nature of the change:

[Bug]: Corrections to existing functionality
[CI]: Changes to build process or CI pipeline
[Docs]: Updates or additions to documentation
[API]: Modifications to aibrix's API or interface
[CLI]: Changes or additions to the Command Line Interface
[Misc]: For changes not covered above (use sparingly)

Note: For changes spanning multiple categories, use multiple prefixes in order of importance.

Submission Checklist

PR title includes appropriate prefix(es)
Changes are clearly explained in the PR description
New and existing tests pass successfully
Code adheres to project style and best practices
Documentation updated to reflect changes (if applicable)
Thorough testing completed, no regressions introduced

By submitting this PR, you confirm that you've read these guidelines and your changes align with the project's contribution standards.

Jeffwan · 2024-08-30T08:08:54Z

README.md

@@ -3,15 +3,26 @@
 Welcome to AIBrix, the foundational building blocks for constructing your own GenAI inference infrastructure. AIBrix offers a cloud-native solution tailored to meet the demands of enterprises aiming to deploy, manage, and scale LLMs efficiently.

 ## Key Features
-TODO
+
+- High density Lora management


/cc @kr11 @brosoul @varungup90 please help review this part. what's the right description? any feedbacks?

@Jeffwan Here are some drafted features about autoscaling, for your reference:

Direct Metrics Access: Optimize scaler responsiveness with direct metrics retrieval from vLLM engine servers, bypassing traditional data pathways.

Multiple Autoscaling Algorithms: Equip your infrastructure with HPA and KPA to ensure scalable and adaptable LLM deployment.

Autoscaling Case Study: Provide actionable insights by examining autoscaling performance from experiments on practical LLM applications.

For detailed features, we can list them in the feature intro section once this #2 is done

here, we just a short description. any comments here?

brosoul · 2024-08-30T08:19:25Z

README.md

+- High density Lora management
+- Intelligent and LLM specific routing strategies
+- LLM tailored pod autoscaler
+- AI runtime sidecar (metrics merge, fast model downloading, admin operations)


fast

If we mention fast here, we may need to provide a comparison of its performance with some tools. But I don't think our current implementation (reusing SDK) has any reason to be faster.
The automatic selection of the number of download threads may be a breakthrough point. @Jeffwan

Great. It's ok that we list some of the expected features here. We can adjust to more concrete words before release

* Update the README.md with features and installations * Comment the controllers needs dependencies

Jeffwan added 2 commits August 30, 2024 15:13

Update the README.md with features and installations

e3f7548

Comment the controllers needs dependencies

39f0f28

Jeffwan commented Aug 30, 2024

View reviewed changes

brosoul reviewed Aug 30, 2024

View reviewed changes

Jeffwan merged commit 7a821c7 into main Sep 4, 2024
2 checks passed

Jeffwan deleted the jiaxin/v0.1.0-rc.0-doc branch September 4, 2024 04:19

gangmuk pushed a commit that referenced this pull request Jan 25, 2025

Update the installation guidance and README.md (#115)

419c200

* Update the README.md with features and installations * Comment the controllers needs dependencies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the installation guidance and README.md #115

Update the installation guidance and README.md #115

Jeffwan commented Aug 30, 2024 •

edited

Loading

Jeffwan Aug 30, 2024

kr11 Aug 30, 2024

Jeffwan Sep 2, 2024

Jeffwan Sep 2, 2024 •

edited

Loading

brosoul Aug 30, 2024

Jeffwan Sep 2, 2024

Update the installation guidance and README.md #115

Update the installation guidance and README.md #115

Conversation

Jeffwan commented Aug 30, 2024 • edited Loading

Pull Request Description

Related Issues

Pull Request Title Format

Submission Checklist

Jeffwan Aug 30, 2024

Choose a reason for hiding this comment

kr11 Aug 30, 2024

Choose a reason for hiding this comment

Jeffwan Sep 2, 2024

Choose a reason for hiding this comment

Jeffwan Sep 2, 2024 • edited Loading

Choose a reason for hiding this comment

brosoul Aug 30, 2024

Choose a reason for hiding this comment

Jeffwan Sep 2, 2024

Choose a reason for hiding this comment

Jeffwan commented Aug 30, 2024 •

edited

Loading

Jeffwan Sep 2, 2024 •

edited

Loading