I run triton with tensorrtllm. But when i give long text to llm, triton returns a long array of zeros named output_log_probs in every token. If my text be longer than some number, the request not work truly.
can you add a option to config.pbtxt that prevent sending output_log_probs?