Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Speed up update_weights_from_tensor
#2695 opened Jan 1, 2025 by fzyzcjy Loading…
3 tasks
Hierarchical Caching for SGLang enhancement New feature or request
#2693 opened Jan 1, 2025 by xiezhq-hermann Loading…
3 tasks
Support twoshot kernel
#2688 opened Dec 31, 2024 by yizhang2077 Loading…
3 tasks
[feat] Add math eval to CI nightly run
#2663 opened Dec 30, 2024 by XiaotongJiang Loading…
Support InternVL2 Series
#2629 opened Dec 28, 2024 by amosyou Draft
3 of 7 tasks
Refactor Scheduler to improve code organization
#2593 opened Dec 26, 2024 by libratiger Loading…
3 tasks done
[Docs] add quantization docs dependencies Pull requests that update a dependency file
#2572 opened Dec 25, 2024 by JamesSand Loading…
3 tasks done
Refactor SchedulePolicy to improve code organization
#2571 opened Dec 25, 2024 by libratiger Loading…
3 tasks done
feat:support 2 kenrels for mixed chunked prefill
#2546 opened Dec 22, 2024 by chosen-ox Loading…
2 tasks
Enable Nvidia's ModelOpt fp8 quantized models high priority quant LLM Quantization
#2535 opened Dec 21, 2024 by Edwardf0t1 Loading…
1 of 3 tasks
[Cache Offload] Remove device sync overhead
#2533 opened Dec 20, 2024 by Edenzzzz Loading…
3 tasks
adapt custom allreduce for tensorrt llm high priority
#2511 opened Dec 18, 2024 by yizhang2077 Loading…
3 tasks
Add InfiniteBench for long context benchmarking high priority
#2421 opened Dec 9, 2024 by iankur Loading…
2 of 3 tasks
[Feature] Add sampler custom logits processor
#2396 opened Dec 8, 2024 by hongpeng-guo Loading…
2 of 3 tasks
ProTip! Updated in the last three days: updated:>2024-12-29.