Skip to content

Commit 5b5eb96

Browse files
authored
Update TensorRT-LLM backend (#352)
* Update TensorRT-LLM backend
1 parent 2c8c6ae commit 5b5eb96

File tree

5 files changed

+3
-4
lines changed

5 files changed

+3
-4
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -95,6 +95,7 @@ cd server
9595
--filesystem=gcs --filesystem=s3 --filesystem=azure_storage \
9696
--endpoint=http --endpoint=grpc --endpoint=sagemaker --endpoint=vertex-ai \
9797
--backend=ensemble --enable-gpu --endpoint=http --endpoint=grpc \
98+
--no-container-pull \
9899
--image=base,${TRTLLM_BASE_IMAGE} \
99100
--backend=tensorrtllm:${TENSORRTLLM_BACKEND_REPO_TAG} \
100101
--backend=python:${PYTHON_BACKEND_REPO_TAG}

all_models/gemma/ensemble/1/.tmp

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +0,0 @@
1-
Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +0,0 @@
1-

tensorrt_llm

Submodule tensorrt_llm updated 232 files

tools/version.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
6dad7bf5c32e5067ac49cc898e434a24759cfd58
1+
721a579afde43dd2e2037153da244baac6eedd29

0 commit comments

Comments
 (0)