Replies: 1 comment
-
Hey @lkwq007, we still haven't ported any of the quantization utilities. Quantization APIs might be different in NNX to leverage the the ability to do model surgery. I'd imagine having a proper |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I was wondering if there are any examples demonstrating how to use aqt with flax.nnx, specifically for quantizing a pretrained model into a quantized version. Using
nnx.bridge.ToLinen
to wrap the model and then doing the quantization seems to be a feasible approach, but is there a better way to achieve this? Thanks.Beta Was this translation helpful? Give feedback.
All reactions