Refactor Scheduler to improve code organization #2593

libratiger · 2024-12-26T10:29:35Z

Motivation

When I try to deep into the Zero-Overhead Batch Scheduler , I find is hard to get clear on the scheduling, and is hard to impl a new scheduling policy, so I try to refactor SchedulePolicy,and make it easy to add new policy for me and others.
McCabe indicates that the code complexity has exceeded 15

Modifications

Move sorting logic into separate static methods for better maintainability

Testing:

python3 -m sglang.launch_server --model Qwen/Qwen2.5-0.5B-Instruct
python3 -m sglang.bench_serving --backend sglang --dataset-name random --num-prompts 500 --random-input 4096 --random-output 2048

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

libratiger · 2024-12-26T10:31:17Z

related with #2571

cc @merrymercy

libratiger · 2024-12-30T06:16:44Z

cc @merrymercy @hnyls2002 if you have time for this PR

I would like to optimize for the task in #2273

Further reduce the scheduling overhead of mixed chunked prefill by simplifying the mix_with_running. The current code first constructs a prefill batch and a decode batch and them merge them. A better method can directly construct a whole mixed batch.

merrymercy · 2024-12-30T13:53:48Z

@libratiger Are you in the slack channel? If you are interested in optimizing the mixed chunked prefill, we can chat in more details.

Currently, we hold this PR because there are several big high-priority pending PRs (speculative decoding, multi-node TP + DP), we probably want to merge them before making big refactor.

libratiger added 3 commits December 26, 2024 16:14

simplify the get_new_batch_prefill in Scheduler

c5c3942

remove duplicate

1d5db0c

refactor the process_batch_result_prefill

d7e56a3

libratiger requested review from merrymercy, Ying1123 and hnyls2002 as code owners December 26, 2024 10:29

Merge branch 'main' into refactorscheduler

67acc85

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Scheduler to improve code organization #2593

Refactor Scheduler to improve code organization #2593

libratiger commented Dec 26, 2024

libratiger commented Dec 26, 2024

libratiger commented Dec 30, 2024

merrymercy commented Dec 30, 2024 •

edited

Loading

Refactor Scheduler to improve code organization #2593

Are you sure you want to change the base?

Refactor Scheduler to improve code organization #2593

Conversation

libratiger commented Dec 26, 2024

Motivation

Modifications

Checklist

libratiger commented Dec 26, 2024

libratiger commented Dec 30, 2024

merrymercy commented Dec 30, 2024 • edited Loading

merrymercy commented Dec 30, 2024 •

edited

Loading