fix 3 bit packing regression， fixed #1278 #1280

CSY-ModelCloud · 2025-02-15T05:51:10Z

No description provided.

CSY-ModelCloud · 2025-02-15T06:24:28Z

@Qubitium

Qubitium · 2025-02-15T06:55:40Z

Need to fix PackableQuantLinear and merge in pack_dtype diffs.

CSY-ModelCloud added 2 commits February 15, 2025 13:35

fix torch's pack()

ca724ac

allow setting bits & quant backend

c253dde

add 3 bit test

024e167

CSY-ModelCloud force-pushed the CSY/fix-compile branch from da8209d to 024e167 Compare February 15, 2025 06:27

Qubitium marked this pull request as draft February 15, 2025 06:54

fix 3 bit packing in base

7aeefd5

CSY-ModelCloud changed the title ~~fix torch's pack() function was removed~~ fix 3 bit packing regression， fixed #1278 Feb 15, 2025

CSY-ModelCloud linked an issue Feb 15, 2025 that may be closed by this pull request

[BUG] 3bit quant and/or inference regression vs AutoGPTQ #1278

Open

CSY-ModelCloud marked this pull request as ready for review February 15, 2025 12:53

CSY-ModelCloud added 6 commits February 15, 2025 20:54

revert data clone changes

a6a644e

remove 3bits test in q4 cuda

bf4f571

fix error was printed but ignored

8c51cac

add delta windows size

ae18243

update scores

3a34885

fix score

cf34934

Qubitium merged commit 52d2c42 into main Feb 15, 2025
11 checks passed

Qubitium deleted the CSY/fix-compile branch February 15, 2025 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix 3 bit packing regression， fixed #1278 #1280

fix 3 bit packing regression， fixed #1278 #1280

CSY-ModelCloud commented Feb 15, 2025

CSY-ModelCloud commented Feb 15, 2025

Qubitium commented Feb 15, 2025

fix 3 bit packing regression， fixed #1278 #1280

fix 3 bit packing regression， fixed #1278 #1280

Conversation

CSY-ModelCloud commented Feb 15, 2025

CSY-ModelCloud commented Feb 15, 2025

Qubitium commented Feb 15, 2025