Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Docs] Add info on when to use which PTQ/Sparsification #1157

Merged
merged 4 commits into from
Mar 6, 2025

Conversation

horheynm
Copy link
Collaborator

SUMMARY:
Current README shows which algo we support + how to run. However, to a user it is still hard to understand when to use which. Add more info on based on the users use-case and hardware the optimization to apply.

TEST PLAN:
N/A

Copy link

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

@horheynm horheynm marked this pull request as ready for review March 6, 2025 20:55
@horheynm horheynm added the ready When a PR is ready for review label Mar 6, 2025
Copy link
Collaborator

@brian-dellabetta brian-dellabetta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cool!

@horheynm
Copy link
Collaborator Author

horheynm commented Mar 6, 2025

Next todos will be to add more in detail info in the /docs on (1) why use those methods; (2) what happens in llm compressor when quantizing

@dsikka dsikka enabled auto-merge (squash) March 6, 2025 22:36
@dsikka dsikka merged commit 4607036 into main Mar 6, 2025
8 checks passed
@dsikka dsikka deleted the update-readme-quant branch March 6, 2025 22:59
brian-dellabetta pushed a commit that referenced this pull request Mar 10, 2025
SUMMARY:
Current README shows which algo we support + how to run. However, to a
user it is still hard to understand when to use which. Add more info on
based on the users use-case and hardware the optimization to apply.

TEST PLAN:
N/A

Signed-off-by: Brian Dellabetta <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready When a PR is ready for review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants