kv cache 可以关掉吗，我想每次模型重新生成答案，不使用历史数据？ #336

amulil · 2023-08-30T11:30:23Z

amulil
Aug 30, 2023

rt.

lvhan028 · 2023-08-31T01:19:05Z

调用stream_infer的时候，sequence_start，sequence_end都设置为True即可

0 replies

amulil · 2023-09-01T11:27:43Z

sequence_end都设置为True，多轮对话就关闭掉了，怎么在多轮对话开启的时候关闭 kv cache，huggingface model 加载是通过 use_cache 配置开关 kv cache 的，lmdeploy 有类似的配置项吗 @lvhan028

0 replies

lvhan028 · 2023-09-04T03:15:05Z

sequence_end并不表示多轮对话关闭了。它意味着，调用方（用户）负责拼接历史prompt和当前的prompt

0 replies