v0.20.0 #5351
nv-guomingz
announced in
Announcements
v0.20.0
#5351
Replies: 2 comments 6 replies
-
|
Hi All, Can we deploy llms in production using tensorrt without additional licenses? tensorrt-llm has tensorrt as a dependency. Kindly clarify |
Beta Was this translation helpful? Give feedback.
5 replies
-
|
TnesorRT-LLM can be used for production deployment without additional license constraints and it has already been used by lots of production customers. June |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
TensorRT-LLM Release 0.20.0
Key Features and Enhancements
examples/models/core/qwen/README.md.examples/models/contrib/hyperclovax/README.mdexamples/scaffolding/contrib/Dynasor/README.mdInfrastructure Changes
API Changes
Fixed Issues
Known Issues
What's Changed
enable_overlap_schedulerby @kaiyux in fix: wrong argument nameenable_overlap_scheduler#4433Llama-3_3-Nemotron-Super-49B-v1integration-perf-tests (TRT flow, trtllm-bench) by @venkywonka in test(perf): Add someLlama-3_3-Nemotron-Super-49B-v1integration-perf-tests (TRT flow, trtllm-bench) #4128Phi-4-mini-instructperf tests (test(perf): Add remainingPhi-4-mini-instructperf tests #4443) by @venkywonka in [cherry-pick] test(perf): Add remainingPhi-4-mini-instructperf tests (#4443) #4589Llama-3_1-Nemotron-Ultra-253B-v1perf tests (cpp) #4446) by @venkywonka in [cherry-pick] test(perf): Add Llama-3_1-Nemotron-Ultra-253B-v1 perf tests (cpp) (#4446) #4590Llama-3_3-Nemotron-Super-49B-v1integration-perf-tests (cpp) #4499) by @venkywonka in [cherry-pick] test(perf): Pt.2 Add Llama-3_3-Nemotron-Super-49B-v1 integration-perf-tests (cpp) (#4499) #4588New Contributors
Full Changelog: v0.20.0rc3...v0.20.0
This discussion was created from the release v0.20.0.
Beta Was this translation helpful? Give feedback.
All reactions