Skip to content

quantization process takes too long #35

@yyfcc17

Description

@yyfcc17

during the ptq process, it only use 1 gpu, although i have set 4 gpus in the config file (i have 4 gpus installed).

it takes almost 52 hours to quantize my 8B flux model (the Step 3 in your readme) using a L40s gpu, is this normal?

can we accelerate the ptq process by quantizing blocks parallelly on different gpus?

thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestquestionFurther information is requestedsvdquant

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions