Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix 3 bit packing regression, fixed #1278 #1280

Merged
merged 10 commits into from
Feb 15, 2025
Merged

fix 3 bit packing regression, fixed #1278 #1280

merged 10 commits into from
Feb 15, 2025

Conversation

CSY-ModelCloud
Copy link
Member

No description provided.

@CSY-ModelCloud
Copy link
Member Author

@Qubitium

@Qubitium Qubitium marked this pull request as draft February 15, 2025 06:54
@Qubitium
Copy link
Collaborator

Need to fix PackableQuantLinear and merge in pack_dtype diffs.

@CSY-ModelCloud CSY-ModelCloud changed the title fix torch's pack() function was removed fix 3 bit packing regression, fixed #1278 Feb 15, 2025
@CSY-ModelCloud CSY-ModelCloud linked an issue Feb 15, 2025 that may be closed by this pull request
@CSY-ModelCloud CSY-ModelCloud marked this pull request as ready for review February 15, 2025 12:53
@Qubitium Qubitium merged commit 52d2c42 into main Feb 15, 2025
11 checks passed
@Qubitium Qubitium deleted the CSY/fix-compile branch February 15, 2025 14:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] 3bit quant and/or inference regression vs AutoGPTQ
2 participants