Update TensorRT-LLM backend (triton-inference-server#272)

kaiyux · web-flow · commit 6e6e34e0944c · 2024-01-02T17:59:06.000+08:00
* Update TensorRT-LLM backend
diff --git a/README.md b/README.md
@@ -1,5 +1,5 @@
 <!--
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
@@ -41,20 +41,22 @@ available in the main [server](https://github.com/triton-inference-server/server
 repo. If you don't find your answer there you can ask questions on the
 [issues page](https://github.com/triton-inference-server/tensorrtllm_backend/issues).
 
-## Building the TensorRT-LLM Backend
+## Accessing the TensorRT-LLM Backend
 
 There are several ways to access the TensorRT-LLM Backend.
 
-**Before Triton 23.10 release, please use [Option 3 to build TensorRT-LLM backend via Docker](#option-3-build-via-docker)**
+**Before Triton 23.10 release, please use [Option 3 to build TensorRT-LLM backend via Docker](#option-3-build-via-docker).**
 
-### Option 1. Run the Docker Container
+### Run the Pre-built Docker Container
 
 Starting with Triton 23.10 release, Triton includes a container with the TensorRT-LLM
 Backend and Python Backend. This container should have everything to run a
 TensorRT-LLM model. You can find this container on the
 [Triton NGC page](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/tritonserver).
 
-### Option 2. Build via the build.py Script in Server Repo
+### Build the Docker Container
+
+#### Option 1. Build via the `build.py` Script in Server Repo
 
 Starting with Triton 23.10 release, you can follow steps described in the
 [Building With Docker](https://github.com/triton-inference-server/server/blob/main/docs/customization_guide/build.md#building-with-docker)
@@ -90,7 +92,7 @@ the TensorRT-LLM backend and Python backend repositories that will be used
 to build the container. You can also remove the features or endpoints that you
 don't need by removing the corresponding flags.
 
-### Option 3. Build via Docker
+#### Option 2. Build via Docker
 
 The version of Triton Server used in this build option can be found in the
 [Dockerfile](./dockerfile/Dockerfile.trt_llm_backend).
diff --git a/all_models/inflight_batcher_llm/ensemble/config.pbtxt b/all_models/inflight_batcher_llm/ensemble/config.pbtxt
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/postprocessing/1/model.py b/all_models/inflight_batcher_llm/postprocessing/1/model.py
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/postprocessing/config.pbtxt b/all_models/inflight_batcher_llm/postprocessing/config.pbtxt
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/preprocessing/1/model.py b/all_models/inflight_batcher_llm/preprocessing/1/model.py
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/preprocessing/config.pbtxt b/all_models/inflight_batcher_llm/preprocessing/config.pbtxt
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/tensorrt_llm/config.pbtxt b/all_models/inflight_batcher_llm/tensorrt_llm/config.pbtxt
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/tensorrt_llm_bls/1/model.py b/all_models/inflight_batcher_llm/tensorrt_llm_bls/1/model.py
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/all_models/inflight_batcher_llm/tensorrt_llm_bls/config.pbtxt b/all_models/inflight_batcher_llm/tensorrt_llm_bls/config.pbtxt
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/ci/L0_backend_trtllm/base_metrics_verification_tests.py b/ci/L0_backend_trtllm/base_metrics_verification_tests.py
@@ -1,5 +1,5 @@
 #!/usr/bin/python
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/ci/L0_backend_trtllm/custom_metrics_verification_tests.py b/ci/L0_backend_trtllm/custom_metrics_verification_tests.py
@@ -1,5 +1,5 @@
 #!/usr/bin/python
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/ci/L0_backend_trtllm/generate_engines.sh b/ci/L0_backend_trtllm/generate_engines.sh
@@ -1,5 +1,5 @@
 #!/bin/bash
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/ci/L0_backend_trtllm/test.sh b/ci/L0_backend_trtllm/test.sh
@@ -1,5 +1,5 @@
 #!/bin/bash
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
@@ -39,6 +39,10 @@ CUSTOM_METRICS_VERIFICATION_TEST=custom_metrics_verification_tests.py
 CUSTOM_METRICS_VERIFICATION_LOG="custom_metrics_verification.log"
 SERVER_PID=0
 
+# Force environment to use python version 3
+apt update -q=2 \
+    && apt install -y python-is-python3
+
 # Helpers ===============================
 function replace_config_tags {
   tag_to_replace="${1}"
diff --git a/ci/README.md b/ci/README.md
@@ -1,5 +1,5 @@
 <!--
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/dockerfile/Dockerfile.triton.trt_llm_backend b/dockerfile/Dockerfile.triton.trt_llm_backend
@@ -0,0 +1,47 @@
+ARG BASE_IMAGE=nvcr.io/nvidia/tritonserver:23.11-py3-min
+
+FROM ${BASE_IMAGE} as base
+
+RUN apt-get update -q=2 && apt-get install -y --no-install-recommends python3-pip
+# Remove previous TRT installation
+# We didn't remove libnvinfer* here because tritonserver depends on the pre-installed libraries.
+RUN apt-get remove -y tensorrt*
+RUN pip3 uninstall -y tensorrt
+
+ARG TRT_VER
+
+ENV TRT_VERSION=$TRT_VER \
+    TRT_VER=$TRT_VER \
+    CUDA_VER=$CUDA_VERSION \
+    CUDNN_VER=$CUDNN_VERSION \
+    NCCL_VER=$NCCL_VERSION \
+    CUBLAS_VER=$CUBLAS_VERSION
+
+LABEL TRT_VERSION $TRT_VER
+
+RUN echo TRT_VERSION=$TRT_VER \
+    TRT_VER=$TRT_VER \
+    CUDA_VER=$CUDA_VERSION \
+    CUDNN_VER=$CUDNN_VERSION \
+    NCCL_VER=$NCCL_VERSION \
+    CUBLAS_VER=$CUBLAS_VERSION
+# Download & install internal TRT release
+RUN [ "$(uname -m)" != "x86_64" ] && arch="sbsa" || arch="x86_64" \
+    && curl -o /tmp/cuda-keyring.deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/$arch/cuda-keyring_1.0-1_all.deb \
+    && apt install /tmp/cuda-keyring.deb \
+    && rm /tmp/cuda-keyring.deb \
+    && apt-get update -q=2
+
+ARG RELEASE_URL_TRT_x86
+ARG RELEASE_URL_TRT_ARM
+
+RUN [ "$(uname -m)" != "x86_64" ] && RELEASE_URL_TRT=${RELEASE_URL_TRT_ARM} || RELEASE_URL_TRT=${RELEASE_URL_TRT_x86} \
+    && curl -fSL -o /tmp/tensorrt.tar.gz ${RELEASE_URL_TRT} \
+    && tar xzvf /tmp/tensorrt.tar.gz -C /usr/local \
+    && rm /tmp/tensorrt.tar.gz \
+    && find /usr/local -maxdepth 1 -name Tens* -type d -exec ln -s {} /usr/local/tensorrt \;
+
+RUN pip3 install /usr/local/tensorrt/python/tensorrt-*-cp$( python3 -c "import sys; print(str(sys.version_info.major) + str(sys.version_info.minor))" )*
+
+ENV LD_LIBRARY_PATH=/usr/local/tensorrt/lib:${LD_LIBRARY_PATH}
+ENV TRT_ROOT=/usr/local/tensorrt
diff --git a/inflight_batcher_llm/CMakeLists.txt b/inflight_batcher_llm/CMakeLists.txt
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions are met: *
@@ -28,20 +28,7 @@ set(TRITON_BUILD
     OFF
     CACHE STRING "Using Triton build process")
 
-if(TRITON_BUILD)
-  set_ifndef(TRTLLM_DIR ${CMAKE_CURRENT_SOURCE_DIR}/tensorrt_llm)
-  # Install build time dependencies. This section is executed during cmake
-  # configure time.
-  execute_process(
-    COMMAND bash -x ./tools/environment_setup.sh
-    WORKING_DIRECTORY ${CMAKE_CURRENT_SOURCE_DIR}
-    RESULT_VARIABLE CMD_RESULT)
-  if(NOT CMD_RESULT EQUAL "0")
-    message(FATAL_ERROR "Failed to install build time dependencies")
-  endif()
-else()
-  set_ifndef(TRTLLM_DIR ${CMAKE_CURRENT_SOURCE_DIR}/../tensorrt_llm)
-endif()
+set_ifndef(TRTLLM_DIR ${CMAKE_CURRENT_SOURCE_DIR}/../tensorrt_llm)
 
 include(${TRTLLM_DIR}/cpp/cmake/modules/find_library_create_target.cmake)
 
@@ -64,16 +51,6 @@ if(TRITON_ENABLE_METRICS AND NOT TRITON_ENABLE_STATS)
     FATAL_ERROR "TRITON_ENABLE_METRICS=ON requires TRITON_ENABLE_STATS=ON")
 endif()
 
-# The TRTLLM_BUILD_CONTAINER is used to compile the TRT-LLM libraries that are
-# needed for the TRT-LLM backend. The TRTLLM_BUILD_CONTAINER is launched
-# separately, and the artifacts will be copied back to the backend installation
-# directory.
-if(TRITON_BUILD)
-  set(TRTLLM_BUILD_CONTAINER
-      ""
-      CACHE STRING "Base image for building TRT-LLM")
-endif()
-
 set(TRITON_COMMON_REPO_TAG
     "main"
     CACHE STRING "Tag for triton-inference-server/common repo")
@@ -116,31 +93,6 @@ FetchContent_Declare(
   GIT_SHALLOW ON)
 FetchContent_MakeAvailable(repo-common repo-core repo-backend)
 
-# Compile TRT-LLM
-if(TRITON_BUILD)
-  set(TRITON_TRTLLM_DOCKER_NAME "tritonserver-trtllm")
-  add_custom_command(
-    OUTPUT tensorrt_llm_build
-    COMMENT "Building TensorRT-LLM"
-    COMMAND
-      cd ${CMAKE_CURRENT_SOURCE_DIR} && python3 tools/gen_trtllm_dockerfile.py
-      --trtllm-build-config="${CMAKE_BUILD_TYPE}"
-      --trtllm-base-image="${TRTLLM_BUILD_CONTAINER}" --output=Dockerfile.trtllm
-    COMMAND
-      cd ${CMAKE_CURRENT_SOURCE_DIR} && docker build
-      --cache-from=${TRITON_TRTLLM_DOCKER_NAME}
-      --cache-from=${TRITON_TRTLLM_DOCKER_NAME}_cache0
-      --cache-from=${TRITON_TRTLLM_DOCKER_NAME}_cache1 -t
-      ${TRITON_TRTLLM_DOCKER_NAME} -f ./Dockerfile.trtllm .
-    COMMAND docker rm trtllm_build || echo 'error ignored...' || true
-    COMMAND docker create --name trtllm_build ${TRITON_TRTLLM_DOCKER_NAME}
-    COMMAND cd ${CMAKE_CURRENT_SOURCE_DIR} && rm -fr tensorrt_llm
-    COMMAND cd ${CMAKE_CURRENT_SOURCE_DIR} && docker cp
-            trtllm_build:/app/tensorrt_llm tensorrt_llm
-    COMMAND docker cp trtllm_build:/opt/trtllm_lib trtllm_build
-    COMMAND docker rm trtllm_build)
-endif()
-
 #
 # The backend must be built into a shared library. Use an ldscript to hide all
 # symbols except for the TRITONBACKEND API.
@@ -153,11 +105,6 @@ set(SRCS src/libtensorrtllm.cc src/work_item.cc src/work_items_queue.cc
 
 add_library(triton-tensorrt-llm-backend SHARED ${SRCS})
 
-if(TRITON_BUILD)
-  add_custom_target(trtllm_target DEPENDS tensorrt_llm_build)
-  add_dependencies(triton-tensorrt-llm-backend trtllm_target)
-endif()
-
 add_library(TritonTensorRTLLMBackend::triton-tensorrt-llm-backend ALIAS
             triton-tensorrt-llm-backend)
 
@@ -352,10 +299,25 @@ if(TRITON_ENABLE_METRICS)
 endif()
 
 if(TRITON_BUILD)
-  add_dependencies(tensorrt_llm trtllm_target)
-  add_dependencies(tensorrt_llm_batch_manager trtllm_target)
-  add_dependencies(nvinfer_plugin_tensorrt_llm trtllm_target)
-endif()
+
+  if(CMAKE_HOST_SYSTEM_PROCESSOR STREQUAL "x86_64")
+    execute_process(
+      WORKING_DIRECTORY ${TRTLLM_DIR}
+      COMMAND bash -x docker/common/install_pytorch.sh pypi COMMAND_ECHO STDOUT
+              COMMAND_ERROR_IS_FATAL ANY)
+  else()
+    execute_process(
+      WORKING_DIRECTORY ${TRTLLM_DIR}
+      COMMAND bash -x docker/common/install_pytorch.sh src_non_cxx11_abi
+              COMMAND_ECHO STDOUT COMMAND_ERROR_IS_FATAL ANY)
+  endif() # CMAKE_HOST_SYSTEM_PROCESSOR
+
+  execute_process(
+    WORKING_DIRECTORY ${TRTLLM_DIR}
+    COMMAND python3 scripts/build_wheel.py --trt_root /usr/local/tensorrt
+            COMMAND_ECHO STDOUT COMMAND_ERROR_IS_FATAL ANY)
+
+endif() # TRITON_BUILD
 
 target_link_libraries(
   triton-tensorrt-llm-backend
@@ -407,9 +369,14 @@ install(
   RUNTIME DESTINATION ${CMAKE_INSTALL_PREFIX}/backends/tensorrtllm)
 
 if(TRITON_BUILD)
-  install(DIRECTORY ${CMAKE_CURRENT_BINARY_DIR}/trtllm_build/
+  file(
+    GLOB
+    LIBINFER_PLUGIN_TENSORRT_LLM
+    "${TRTLLM_DIR}/cpp/build/tensorrt_llm/plugins/libnvinfer_plugin_tensorrt_llm.so*"
+    FOLLOW_SYMLINKS)
+  install(FILES ${LIBINFER_PLUGIN_TENSORRT_LLM}
           DESTINATION ${CMAKE_INSTALL_PREFIX}/backends/tensorrtllm)
-endif()
+endif() # TRITON_BUILD
 
 install(
   EXPORT triton-tensorrt-llm-backend-targets
diff --git a/inflight_batcher_llm/client/inflight_batcher_llm_client.py b/inflight_batcher_llm/client/inflight_batcher_llm_client.py
@@ -1,5 +1,5 @@
 #!/usr/bin/env python
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/cmake/TritonTensorRTLLMBackendConfig.cmake.in b/inflight_batcher_llm/cmake/TritonTensorRTLLMBackendConfig.cmake.in
@@ -1,4 +1,4 @@
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/custom_metrics_reporter/custom_metrics_reporter.cc b/inflight_batcher_llm/src/custom_metrics_reporter/custom_metrics_reporter.cc
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/custom_metrics_reporter/custom_metrics_reporter.h b/inflight_batcher_llm/src/custom_metrics_reporter/custom_metrics_reporter.h
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/libtensorrtllm.cc b/inflight_batcher_llm/src/libtensorrtllm.cc
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/model_instance_state.cc b/inflight_batcher_llm/src/model_instance_state.cc
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/model_instance_state.h b/inflight_batcher_llm/src/model_instance_state.h
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/model_state.cc b/inflight_batcher_llm/src/model_state.cc
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/model_state.h b/inflight_batcher_llm/src/model_state.h
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/utils.cc b/inflight_batcher_llm/src/utils.cc
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/utils.h b/inflight_batcher_llm/src/utils.h
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/work_item.cc b/inflight_batcher_llm/src/work_item.cc
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/work_item.h b/inflight_batcher_llm/src/work_item.h
@@ -1,4 +1,4 @@
-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 //
 // Redistribution and use in source and binary forms, with or without
 // modification, are permitted provided that the following conditions
diff --git a/inflight_batcher_llm/src/work_items_queue.cc b/inflight_batcher_llm/src/work_items_queue.cc
diff --git a/inflight_batcher_llm/src/work_items_queue.h b/inflight_batcher_llm/src/work_items_queue.h
diff --git a/tensorrt_llm b/tensorrt_llm
diff --git a/tools/environment_setup.sh b/tools/environment_setup.sh
diff --git a/tools/gen_trtllm_dockerfile.py b/tools/gen_trtllm_dockerfile.py
diff --git a/tools/version.txt b/tools/version.txt

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
	`1`	`+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
`2`	`2`	`#`
`3`	`3`	`# Redistribution and use in source and binary forms, with or without`
`4`	`4`	`# modification, are permitted provided that the following conditions`
Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`#!/usr/bin/python`
`2`		`-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
	`2`	`+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
`3`	`3`	`#`
`4`	`4`	`# Redistribution and use in source and binary forms, with or without`
`5`	`5`	`# modification, are permitted provided that the following conditions`
Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`#!/bin/bash`
`2`		`-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
	`2`	`+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
`3`	`3`	`#`
`4`	`4`	`# Redistribution and use in source and binary forms, with or without`
`5`	`5`	`# modification, are permitted provided that the following conditions`
Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`<!--`
`2`		`-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
	`2`	`+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
`3`	`3`	`#`
`4`	`4`	`# Redistribution and use in source and binary forms, with or without`
`5`	`5`	`# modification, are permitted provided that the following conditions`
Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`#!/usr/bin/env python`
`2`		`-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
	`2`	`+# Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
`3`	`3`	`#`
`4`	`4`	`# Redistribution and use in source and binary forms, with or without`
`5`	`5`	`# modification, are permitted provided that the following conditions`
Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-// Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
	`1`	`+// Copyright 2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.`
`2`	`2`	`//`
`3`	`3`	`// Redistribution and use in source and binary forms, with or without`
`4`	`4`	`// modification, are permitted provided that the following conditions`