-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][doc] Paragraph adjustment and fix statistic
#8568
opened Oct 22, 2025 by
yunruis
Loading…
1 task done
[https://nvbugs/5437384][test] CHERRY-PICK: fix trtllm-llmapi-launch multi tests
#8567
opened Oct 22, 2025 by
Superjomn
Loading…
1 task done
[https://nvbugs/5488576][fix] Propagate disable_finalize_fusion config flag in WIDEEP MoE backend (cherry-pick #8141)
#8566
opened Oct 22, 2025 by
kaiyux
Loading…
1 task
[https://nvbugspro.nvidia.com/bug/5564465][test]ensure deepseek_v3_lite isl + osl < max_seq_len
#8565
opened Oct 22, 2025 by
ruodil
Loading…
1 task done
[None][feat] Enable rms norm fusion for Nemotron MOE
#8563
opened Oct 22, 2025 by
suyoggupta
Loading…
[TRTLLM-8817][chore] Set default value of KvCacheConfig.free_gpu_memory_fraction explicitly
#8561
opened Oct 22, 2025 by
QiJune
Loading…
1 task done
[None][feat] Dev scaffolding bench load_generator
#8559
opened Oct 22, 2025 by
dcaox
Loading…
1 task done
[TRTLLM-8812][chore] Limit the scope of pybind based CacheTransceiverConfig
#8558
opened Oct 22, 2025 by
QiJune
Loading…
1 task done
[https://nvbugs/5568961][fix] Fix a merge conflict (cherrypick from PR 8365)
#8553
opened Oct 21, 2025 by
chang-l
Loading…
1 task done
[https://nvbugs/5549081][fix] Fix device id assignment for some visio…
#8552
opened Oct 21, 2025 by
chang-l
Loading…
1 task done
[#8245][feat] Autodeploy: Guided Decoding Support
#8551
opened Oct 21, 2025 by
govind-ramnarayan
Loading…
1 task done
[None][fix] fixed cached model path in test
#8549
opened Oct 21, 2025 by
MrGeva
Loading…
1 task done
[TRTLLM-8201][feat] TP sharding of Mamba layers
AutoDeploy
<NV> AutoDeploy Backend
#8548
opened Oct 21, 2025 by
greg-kwasniewski1
•
Draft
1 task
[None][infra] enable lfs for generateLockFile pipeline
#8547
opened Oct 21, 2025 by
yuanjingx87
Loading…
1 task
[https://nvbugs/5576192][fix] Unwaive the test for test_weight_only_quant_gemm.
#8546
opened Oct 21, 2025 by
zheyuf
Loading…
1 task done
[None] [refactor] Include Python attributions in wheel packaging
#8545
opened Oct 21, 2025 by
venkywonka
Loading…
1 task done
[https://nvbugs/5569719][fix] Gptoss sm120 cherrypick to release 1.1
#8544
opened Oct 21, 2025 by
farazkh80
Loading…
1 task done
[None][fix] generate nanobind stubs for submodules
#8539
opened Oct 21, 2025 by
ixlmar
Loading…
1 task done
[https://nvbugs/5564465][fix] Overwrite only if default_max_tokens is legal
#8538
opened Oct 21, 2025 by
LinPoly
Loading…
1 task done
[None][fix] Allow multi-threaded copy for GDRCopy wrapper
#8535
opened Oct 21, 2025 by
dongxuy04
Loading…
1 task done
[None][chore] add precommit hook to remove redundant tab and white space
#8534
opened Oct 21, 2025 by
xinhe-nv
Loading…
1 task done
[https://nvbugs/5575920][fix] Fix cublas/cublasLt handle creation memory not sufficient error
#8533
opened Oct 21, 2025 by
dominicshanshan
Loading…
1 task done
[None][doc] add visualization of perf metrics in time breakdown tool doc
#8530
opened Oct 21, 2025 by
zhengd-nv
Loading…
1 task done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.