Skip to content

Commit

Permalink
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix dequant 4bit weight
Browse files Browse the repository at this point in the history
jiqing-feng committed Jul 31, 2024
1 parent d385aea commit fbcdf62
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion bitsandbytes/backends/cpu_xpu_common.py
Original file line number Diff line number Diff line change
@@ -484,7 +484,7 @@ def dequantize_4bit_impl(
out_reshaped[n - rem :] = out_dq[n - rem :] * absmax[-1]

# take transpose here because weight is transposed (again) for computation
return out.t()
return out


# Do not need torch.compile here as we are calling torch/ipex kernel

0 comments on commit fbcdf62

Please sign in to comment.