Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Feature/cp block ownership
#777 opened Mar 12, 2026 by LLLLKKKK Loading…
【WIP】only for test Qwen3 next 0304
#776 opened Mar 12, 2026 by hxy0118 Loading…
address review comments in remoteconnector
#774 opened Mar 12, 2026 by lucky-zzz Loading…
feat: update deps
#773 opened Mar 11, 2026 by Bruce-Lee-LY Loading…
fix - mv trtllm to first
#772 opened Mar 11, 2026 by zerozw Loading…
[Wip] Feat/remove gpt model
#771 opened Mar 11, 2026 by JackTan25 Loading…
Feat/support gqa cp reuse cache
#770 opened Mar 11, 2026 by MMadhatter Loading…
fix: device_reuse_len metric double report
#768 opened Mar 11, 2026 by SJTUGavinLiu Loading…
tiered kv cache
#765 opened Mar 10, 2026 by netaddi Loading…
fix: glm5 load weights
#763 opened Mar 10, 2026 by Bruce-Lee-LY Loading…
fix: fix cuda graph debug mode in torch2.8.0
#762 opened Mar 10, 2026 by JackTan25 Loading…
feat:refactor graph to support rocm backend
#761 opened Mar 10, 2026 by muse-coder Loading…
feat: optimize cuda graph can run logic
#760 opened Mar 10, 2026 by JackTan25 Loading…
fix - fix pywrappedmodel attn object not hold
#759 opened Mar 9, 2026 by zerozw Loading…
Headwise ut
#758 opened Mar 9, 2026 by qqbbiu Loading…
Feature/support qwen35 merge
#755 opened Mar 6, 2026 by alibaba-miji Loading…
feat: remove old sp engine
#753 opened Mar 6, 2026 by JackTan25 Loading…
feat: separate py model from gpt model
#752 opened Mar 6, 2026 by JackTan25 Loading…
feat: refactor reuse cache on rocm python mode
#751 opened Mar 6, 2026 by muse-coder Loading…
feat: support w4a8
#750 opened Mar 6, 2026 by Bruce-Lee-LY Loading…
Qwen moe pure tp support
#748 opened Mar 5, 2026 by hxy0118 Loading…
feat: support sp prefill cuda graph
#745 opened Mar 5, 2026 by JackTan25 Loading…
feat: support headwise attention
#742 opened Mar 5, 2026 by Echo-2334 Loading…
Develop/kvcache refactor 3
#739 opened Mar 3, 2026 by xinfei-shi Loading…
ProTip! Exclude everything labeled bug with -label:bug.