Skip to content

Commit 046db94

Browse files
author
Weichao Luo
committed
fix max_new_token.
1 parent 7daf768 commit 046db94

File tree

1 file changed

+1
-1
lines changed
  • lightllm/server/httpserver_for_pd_master

1 file changed

+1
-1
lines changed

lightllm/server/httpserver_for_pd_master/manager.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -272,7 +272,7 @@ async def fetch_stream_nixl(
272272
"node_id": p_start_args["pd_node_id"],
273273
"ip": p_start_args["host"],
274274
"rpyc_port": p_start_args["pd_remote_prefill_port"],
275-
"max_new_tokens": sampling_params.max_new_tokens - 1,
275+
"max_new_tokens": sampling_params.max_new_tokens,
276276
"pd_master_node_id": self.args.pd_node_id,
277277
}
278278

0 commit comments

Comments
 (0)