FOEPD-2119: Limit number of async tasks writing to DB #6361

exupero · 2025-09-25T21:45:09Z

What changes are proposed in this pull request?

#6288 allows asynchronous background tasks to accumulate without limit, on the assumption that predictions on a batch would take longer than saving that batch to the DB. In cases where saving to the DB takes longer, the number of async tasks can grow without limit and could cause the process to run out of memory.

This PR wraps the batch iterator in a new utility function that limits the number background tasks. After each iteration of the source iterator, if there are more background tasks than the specified limit, tasks are waited for until the backlog is reduced to the limit, at which point the next item is taken from the iterator.

This approach is open to discussion. If writing to the DB is the bottleneck and creates backpressure on the prediction pipeline, then as soon as enough async tasks accumulate, performance degenerates to the same throughput as never using async tasks. In that case, we might prefer simplicity and revert #6288 to go back a single thread.

How is this patch tested? If it is not, please explain why.

Ran the following script on a machine with a GPU, with some added logging that recorded how many tasks had accumulated. Logs showed the expected limits were observed.

I also tried using memory-profiler, but the profiles showed the same memory characteristics as the original code. I suspect memory usage is dominated by the downloaded samples, not by the data needed for the async tasks, so the effect of limiting the queue doesn't show:

@jacobsela has some more sophisticated testing he might be able to apply.

Release Notes

Is this a user-facing change that should be mentioned in the release notes?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release
notes for FiftyOne users.

(Details in 1-2 sentences. You can just refer to another PR with a description
if this PR is part of a larger change.)

What areas of FiftyOne does this PR affect?

App: FiftyOne application changes
Build: Build and test infrastructure changes
Core: Core fiftyone Python library changes
Documentation: FiftyOne documentation changes
Other

coderabbitai · 2025-09-25T21:45:15Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch bugfix/ess/limit-async-queue

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

exupero · 2025-09-25T21:45:46Z

@jacobsela are you able to run your analysis on this branch to see if it makes any difference?

exupero · 2025-09-25T21:47:48Z

fiftyone/core/utils.py

        yield submit

-        for future in _futures:
+        for future in futures:


These lines are an opportunistic cleanup in the original code.

With this branch this function is no longer used in fiftyone, so it could be removed, but the code has been released so there could be third-party uses of it. Suggestions on whether to keep it or get rid of it?

exupero · 2025-09-25T21:48:16Z

fiftyone/core/models.py

+            limit=10,
+            max_workers=1,


Some defaults. Ideally these would be configurable by the user.

what is the reason for limit to 10?

Arbitrary. Didn't have any reason to choose something different. @jacobsela might be able to weigh in with an informed option.

kaixi-wang · 2025-09-26T00:25:05Z

fiftyone/core/utils.py

+            return future
+
+        for item in iterator:
+            yield submit, item


is this supposed to be submit(item)?

No. This is yielding the submit function and the current item to the consumer loop, so the consumer can call submit where it wants and with the function it wants to run async.

exupero · 2025-10-02T13:26:33Z

@jacobsela any word on whether this branch makes any different in your benchmarks?

exupero · 2025-10-07T15:13:00Z

Profiled memory on this branch with @jacobsela's testing script and found that limiting the number of background tasks doesn't seem to help. Even when using a limit of 0, which should be behaviorally equivalent to develop before #6288 (i.e. each batch's prediction is blocked on the previous batch being saved), memory usage climbs almost as high as without a limit. Increasing the number of workers speeds up execution but doesn't use that much less memory.

(Note: the light blue run on 2000 samples, "queue limit 2, 2 workers", was aborted before finishing.)

exupero · 2025-10-07T15:31:41Z

Considering the results found for #6389, the increasing memory usage appears to be the result of an interaction between async_executor and SaveContext (maybe the SaveContext instance doesn't write batches on the same schedule and flush data quite so soon?).

exupero requested a review from jacobsela September 25, 2025 21:45

exupero commented Sep 25, 2025

View reviewed changes

kaixi-wang reviewed Sep 26, 2025

View reviewed changes

exupero added 4 commits October 6, 2025 15:47

Rename '_futures' to 'futures'

836beb4

Limit number of background tasks

0b8091e

Use deques and locks

2b4343e

Limit to 1 async task

b1596c7

exupero force-pushed the bugfix/ess/limit-async-queue branch from 6e348e9 to b1596c7 Compare October 6, 2025 19:52

exupero mentioned this pull request Oct 7, 2025

FOEPD-2119: Use SaveContext to write to DB in background thread #6389

Open

7 tasks

exupero closed this Oct 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FOEPD-2119: Limit number of async tasks writing to DB #6361

FOEPD-2119: Limit number of async tasks writing to DB #6361

Uh oh!

exupero commented Sep 25, 2025

Uh oh!

coderabbitai bot commented Sep 25, 2025 •

edited

Loading

Review skipped

Uh oh!

exupero commented Sep 25, 2025

Uh oh!

exupero Sep 25, 2025

Uh oh!

exupero Sep 25, 2025

Uh oh!

kaixi-wang Sep 26, 2025

Uh oh!

exupero Sep 26, 2025

Uh oh!

kaixi-wang Sep 26, 2025

Uh oh!

exupero Sep 26, 2025

Uh oh!

exupero commented Oct 2, 2025

Uh oh!

exupero commented Oct 7, 2025

Uh oh!

exupero commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FOEPD-2119: Limit number of async tasks writing to DB #6361

FOEPD-2119: Limit number of async tasks writing to DB #6361

Uh oh!

Conversation

exupero commented Sep 25, 2025

What changes are proposed in this pull request?

How is this patch tested? If it is not, please explain why.

Release Notes

Is this a user-facing change that should be mentioned in the release notes?

What areas of FiftyOne does this PR affect?

Uh oh!

coderabbitai bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

exupero commented Sep 25, 2025

Uh oh!

exupero Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

exupero Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

kaixi-wang Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

exupero Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

kaixi-wang Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

exupero Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

exupero commented Oct 2, 2025

Uh oh!

exupero commented Oct 7, 2025

Uh oh!

exupero commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Sep 25, 2025 •

edited

Loading