From 203a0c1adbaf07025f25774c05ccbb4de89a546a Mon Sep 17 00:00:00 2001 From: Aravinda Kumar <76619616+surprisedPikachu007@users.noreply.github.com> Date: Wed, 25 Sep 2024 17:59:35 +0530 Subject: [PATCH] Update llama.md replaced `--kv_cache_type paged` with `--paged_kv_cache enable` --- docs/llama.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/llama.md b/docs/llama.md index 44dabe89..d1ae08ed 100644 --- a/docs/llama.md +++ b/docs/llama.md @@ -46,7 +46,7 @@ trtllm-build --checkpoint_dir ${UNIFIED_CKPT_PATH} \ --context_fmha enable \ --gemm_plugin float16 \ --output_dir ${ENGINE_PATH} \ - --kv_cache_type paged \ + --paged_kv_cache enable \ --max_batch_size 64 ```