-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-8832][feat] fully async _select_generated_logits with tests
#8628
opened Oct 23, 2025 by
ixlmar
Loading…
1 task done
[https://nvbugs/5508536][fix] Reintroduce: Move stop_criteria to sample_async (#7041)
#8627
opened Oct 23, 2025 by
netanel-haber
•
Draft
[None][chore] exclude InductorSubproc from thread leak check
#8624
opened Oct 23, 2025 by
leslie-fang25
Loading…
1 task done
[None][feat] Refactor scaffolding streaming feature and fix openai wo…
#8622
opened Oct 23, 2025 by
WeiHaocheng
Loading…
1 task
[TRTLLM-8658][infra] upgrade to DLFW 25.10 and pytorch 2.9.0 / triton 3.5.0
#8621
opened Oct 23, 2025 by
ZhanruiSunCh
Loading…
1 task
[None][feat] Enable nvfp4 cuda core for sm120
#8620
opened Oct 23, 2025 by
Njuapp
Loading…
1 task done
[None][test] Clean cache for certain easily hang cases
#8619
opened Oct 23, 2025 by
crazydemo
Loading…
1 task done
[https://nvbugs/5558117][fix] Allow per-layer quant config from hf_quant_config.json
#8617
opened Oct 23, 2025 by
rosenrodt
Loading…
1 task done
[https://nvbugs/5597647][fix] Fix MNNVL Allreduce accuracy issue on Hopper
#8612
opened Oct 23, 2025 by
timlee0212
Loading…
1 task done
[https://nvbugs/5587456][fix] Remove multimodal test cases using TRT backend
#8611
opened Oct 23, 2025 by
jieli-matrix
Loading…
1 task done
[https://nvbugs/5541145][fix] Remove DeepSeekR1 test case from H20 to prevent OOM
#8610
opened Oct 23, 2025 by
jieli-matrix
Loading…
1 task done
[None][infra] Minor Update on Perf Sanity Testdb Files
#8607
opened Oct 23, 2025 by
chenfeiz0326
Loading…
1 task done
[None][test]: Add longbench v2 for long context evaluation
#8604
opened Oct 23, 2025 by
baize97
Loading…
1 task
[None][fix] Fix e2e tests for phi4mm and NVILA
#8603
opened Oct 23, 2025 by
xinhe-nv
Loading…
1 task done
[TRTLLM-8431][doc] update public doc and example
#8602
opened Oct 23, 2025 by
reasonsolo
Loading…
1 task done
[TRTLLM-8836][chore] Create ModelEngine from LlmArgs
#8600
opened Oct 23, 2025 by
QiJune
Loading…
1 task done
[TRTLLM-8511][feat] AutoDeploy: optimize fused_mlp_moe_kernel tiles
#8597
opened Oct 22, 2025 by
nzmora-nvidia
Loading…
1 task done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.