Replies: 1 comment
-
In |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Starting Qwen2.5-32B-Instruct-GPTQ-Int4 with sglang 0.3.2 and 0.3.4.post2, the v1/chat/compltions temperature is 0 and top_k is 1, when i restart sglang, the answers are inconsistent, who can give me some advice?
Beta Was this translation helpful? Give feedback.
All reactions