Replies: 1 comment
-
@mluerig did you find anything about this issue ? I'm having it right now and wonder the same, "not sure whether this causes any degradation in performance or results" |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm getting these warning during inference with an 8bit quantized InternVL chat model, but I'm not sure whether this causes any degradation in performance or results - can anyone help me find out what might be going on? I'm on Linux with CUDA 12.4 (pasted
conda list
output below)Beta Was this translation helpful? Give feedback.
All reactions