Add generator-style run_batch function #2513

xingyaoww · 2024-12-18T20:01:56Z

This PR adds a new generator_style parameter to run_batch that allows yielding results as they become available, while maintaining the performance benefits of batch processing. This is particularly useful when you want to process results as soon as they are ready, for example to save them to disk.

Changes

Added a new generator_style parameter to run_program_batch in interpreter.py that defaults to False
Modified the implementation to support both the original behavior and the new generator behavior
Added the generator_style parameter to the run_batch method in ir.py to expose it to users

Usage Examples

# Original batch mode (default):
states = my_program.run_batch(args)
for state in states:
   save_result(state)

# New generator mode:
for state in my_program.run_batch(args, generator_style=True):
   save_result(state)

# With progress bar:
for state in tqdm.tqdm(my_program.run_batch(args, generator_style=True), total=len(args)):
   save_result(state)

The generator mode yields results as soon as they are available, which is useful for:

Saving results to disk immediately to avoid memory pressure
Processing results in real-time while others are still being generated
Getting a more accurate progress indication

Implementation Details

The implementation maintains the same efficient batching and threading mechanisms as before
When generator_style=True, results are yielded through Python generators as they complete
Progress bar support is maintained in both modes
The change is fully backward compatible - existing code will continue to work without modification

Fixes #303

PR co-authored by OpenHands: https://www.all-hands.dev/share?share_id=b1757eabec18e7204a615b889819333dca4cb4388a7da4fe5b8b074f3595a582

This change adds a new generator_style parameter to run_batch that allows yielding results as they become available, while maintaining the performance benefits of batch processing. This is particularly useful when you want to process results as soon as they are ready, for example to save them to disk. When generator_style=True, run_batch yields tuples of (arguments, result) as they become available, instead of returning a list at the end. Fixes sgl-project#303

As suggested in the issue, we don't need to return the arguments with each result. The user can maintain their own mapping if needed.

python/sglang/lang/interpreter.py

merrymercy · 2024-12-26T17:54:39Z

Please fix the CI test cases https://github.com/sgl-project/sglang/actions/runs/12506724584/job/34892097249?pr=2513

…r-run-batch

merrymercy · 2024-12-28T22:10:33Z

The CI stil fails

xingyaoww · 2025-01-02T03:26:16Z

~~@merrymercy Sorry for the delay! Finally got it pass now and can confirm this works locally for me as well~~

EDIT: ok.. when call next on the generator, it didn't really yield an item until all the jobs are done..

merrymercy · 2025-01-02T10:06:42Z

Let me know when it is fully ready.

xingyaoww · 2025-01-02T21:03:07Z

@merrymercy Now i confirm it actually works in my case -- feel free to take a look when you have time

merrymercy

You chunking logic changes the behavior of the old code. Please revert it. When you add this functionality, please do not change any existing behavior and only add additional code to support the new functionality. Make sure when generator_style == False, it runs exactly the old code.

…rator style support

openhands-agent added 2 commits December 18, 2024 18:14

Simplify generator-style run_batch to only yield results

a8b28a2

As suggested in the issue, we don't need to return the arguments with each result. The user can maintain their own mapping if needed.

xingyaoww commented Dec 18, 2024

View reviewed changes

python/sglang/lang/interpreter.py Outdated Show resolved Hide resolved

python/sglang/lang/interpreter.py Show resolved Hide resolved

xingyaoww added 2 commits December 19, 2024 04:05

Update python/sglang/lang/interpreter.py

779479d

Update python/sglang/lang/interpreter.py

8aa9f30

xingyaoww marked this pull request as ready for review December 20, 2024 17:07

xingyaoww requested review from merrymercy, Ying1123, hnyls2002 and ByronHsu as code owners December 20, 2024 17:07

Merge branch 'main' into generator-run-batch

9e4f4ee

merrymercy reviewed Dec 22, 2024

View reviewed changes

python/sglang/lang/interpreter.py Outdated Show resolved Hide resolved

merrymercy added the await-response label Dec 26, 2024

openhands-agent added 2 commits December 26, 2024 17:34

Maintain input order in generator_style=True mode and improve docstrings

f726142

Remove docstrings to fix linting errors

2d21542

Fix test cases to handle both list and generator results

d860e97

merrymercy self-assigned this Dec 26, 2024

openhands-agent and others added 7 commits December 26, 2024 18:15

Fix formatting

58fb561

Fix generator check in test cases

c1f44d3

remove unused future_to_arguments

a20ed9d

fix indentation

eb27ec7

Merge branch 'main' into generator-run-batch

f509eee

revert test change

c2a0feb

Merge commit 'f509eee799a8bd06c1d81058fd1a85eb4eb89146' into generato…

c7d6573

…r-run-batch

xingyaoww and others added 3 commits January 1, 2025 21:08

Merge branch 'main' into generator-run-batch

d41e23f

simplify generator

03399f6

linter fix

3b8a151

xingyaoww added 5 commits January 2, 2025 18:27

add test for generator style True

13401ef

fix the issue where it start yield late

6e2750f

fix yield for a large number of tasks

44ec998

Merge branch 'main' into generator-run-batch

42c5ba7

fix linter

acaf7ba

This comment was marked as duplicate.

Sign in to view

merrymercy requested changes Jan 2, 2025

View reviewed changes

xingyaoww force-pushed the generator-run-batch branch 2 times, most recently from 4b96829 to acaf7ba Compare January 3, 2025 05:04

openhands-agent and others added 4 commits January 3, 2025 05:15

Refactor run_program_batch to preserve original behavior and add gene…

194909d

…rator style support

fix linter

d908645

Merge branch 'main' into generator-run-batch

a06e4df

fix linter

2159cdb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add generator-style run_batch function #2513

Add generator-style run_batch function #2513

xingyaoww commented Dec 18, 2024 •

edited

Loading

merrymercy commented Dec 26, 2024 •

edited

Loading

merrymercy commented Dec 28, 2024

xingyaoww commented Jan 2, 2025 •

edited

Loading

merrymercy commented Jan 2, 2025

xingyaoww commented Jan 2, 2025

This comment was marked as duplicate.

merrymercy left a comment

Add generator-style run_batch function #2513

Are you sure you want to change the base?

Add generator-style run_batch function #2513

Conversation

xingyaoww commented Dec 18, 2024 • edited Loading

Changes

Usage Examples

Implementation Details

merrymercy commented Dec 26, 2024 • edited Loading

merrymercy commented Dec 28, 2024

xingyaoww commented Jan 2, 2025 • edited Loading

merrymercy commented Jan 2, 2025

xingyaoww commented Jan 2, 2025

This comment was marked as duplicate.

merrymercy left a comment

Choose a reason for hiding this comment

xingyaoww commented Dec 18, 2024 •

edited

Loading

merrymercy commented Dec 26, 2024 •

edited

Loading

xingyaoww commented Jan 2, 2025 •

edited

Loading