Skip to content

Pull requests: deepseek-ai/DeepEP

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs(readme): fix typos, clarify wording
#444 opened Oct 9, 2025 by Dhie-boop Loading…
Fix racing condition in large batch size
#440 opened Oct 2, 2025 by fzyzcjy Loading…
opt ll dispatch layered algo
#425 opened Sep 24, 2025 by alpha-baby Loading…
Support per tensor transfer
#416 opened Sep 18, 2025 by ayrnb Loading…
Add imbalance factor in test_low_latency
#393 opened Sep 4, 2025 by JianboDong Loading…
Feature/sm free normal kernel
#347 opened Jul 31, 2025 by ZhiyiHu1999 Loading…
4 tasks
Support nvfp4 low latency mode dispatch
#341 opened Jul 30, 2025 by shifangx Loading…
Support prefill with 2 GPUs
#331 opened Jul 27, 2025 by fzyzcjy Loading…
enhance warp copy efficiency in cached_notify()
#315 opened Jul 18, 2025 by ZhiyiHu1999 Loading…
support low latency dispatch tma
#293 opened Jul 10, 2025 by ayrnb Loading…
Tiny support custom nvcc flags
#280 opened Jul 5, 2025 by fzyzcjy Loading…
Allow using few SMs for low-latency mode
#277 opened Jul 3, 2025 by fzyzcjy Loading…
Computation communication overlap
#249 opened Jun 24, 2025 by fzyzcjy Draft
Support other NVLink scenarios
#218 opened Jun 17, 2025 by fzyzcjy Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.