Refactor Velox Writer to Use New Flush Policy #242

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

macvincent wants to merge 3 commits into facebookincubator:main from macvincent:export-D81545433

Contributor

macvincent commented Sep 4, 2025

Summary:
This should be a no-op. We make two changes in this dif:

We accumulate the previous raw size of the encoded stripe data in the writer context
We also return whether or not chunking was applied after a writeChunk call.

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack:

Support per stream chunking instead of always chunking all eligible streams.
Support breaking down large stream into multiple smaller chunks.

Rollback Plan:

Differential Revision: D81545433

meta-cla bot added the CLA Signed label

Contributor

facebook-github-bot commented Sep 4, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

facebook-github-bot added the fb-exported label

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

cbf3885

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from ce53d64 to cbf3885 Compare

September 4, 2025 16:35

Contributor

facebook-github-bot commented Sep 4, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

786f9b3

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from cbf3885 to 786f9b3 Compare

September 4, 2025 16:57

Contributor

facebook-github-bot commented Sep 4, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

d357775

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 786f9b3 to d357775 Compare

September 5, 2025 00:56

Contributor

facebook-github-bot commented Sep 5, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from d357775 to c731111 Compare

September 5, 2025 01:49

Contributor

facebook-github-bot commented Sep 5, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

de1b048

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from c731111 to de1b048 Compare

September 10, 2025 05:15

Contributor

facebook-github-bot commented Sep 10, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

468375d

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from de1b048 to 468375d Compare

September 10, 2025 09:32

Contributor

facebook-github-bot commented Sep 10, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

df50d87

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 468375d to df50d87 Compare

September 10, 2025 18:37

Contributor

facebook-github-bot commented Sep 10, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

6b742dd

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from df50d87 to 6b742dd Compare

September 10, 2025 18:38

Contributor

facebook-github-bot commented Sep 10, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

2fbe79a

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 6b742dd to c72af15 Compare

September 11, 2025 01:39

Contributor

facebook-github-bot commented Sep 11, 2025

This pull request was exported from Phabricator. Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

e153909

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 57cc414 to 4277796 Compare

October 9, 2025 16:04

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

92978c7

…ator#242)

Summary:
Pull Request resolved: facebookincubator#242

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 4277796 to 404a3f1 Compare

October 9, 2025 17:50

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

404a3f1

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

16c575d

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

c8df031

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

c110a04

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

e431a82

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

950c61f

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

f32d15c

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

d7f6020

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 404a3f1 to d7f6020 Compare

October 13, 2025 23:27

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

912ac92

…ator#242)

Summary:
Pull Request resolved: facebookincubator#242

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

99429f1

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from d7f6020 to 99429f1 Compare

October 14, 2025 00:57

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

d22a404

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from 99429f1 to d22a404 Compare

October 14, 2025 00:58

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

08230ea

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

07c5e5e

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch 2 times, most recently from fe66a9f to c96c6da Compare

October 14, 2025 07:13

macvincent added 3 commits

October 14, 2025 13:28


          Clean Up Nimble Flush Policy Code (facebookincubator#235)

6696c5d

Summary:

As preparation for our [Nimble chunked encoding](https://fburl.com/gdoc/zjck7lo6) work, we decided to clean up the previous contract to remove unused methods and attributes. Should be a no-op since these methods and attributes were not used. We also clarified the naming of some attributes.

Reviewed By: sdruzkin, helfman

Differential Revision: D81514657


          feat(Nimble): New Flush Policy Implementation With Chunking (facebook…

d662f96

…incubator#240)

Summary:
X-link: facebookexternal/presto-facebook#3412

X-link: facebookincubator/velox#14846


This is an implementation of the new chunking policy described in this [doc](https://fburl.com/gdoc/gkdwwju1). It has two phases:

**Phase 1 - Memory Pressure Management (shouldChunk)**
The policy monitors total in-memory data size:
*  When memory usage exceeds the maximum threshold, initiates chunking to reduce memory footprint while continuing data ingestion
*  When previous chunking attempts succeeded and memory remains above the minimum threshold, continues chunking to further reduce memory usage

**Phase 2 - Storage Size Optimization (shouldFlush)**
 Implements compression-aware stripe size prediction:
*   When chunking fails to reduce memory usage effectively and memory stays above the minimum threshold, forces a full stripe flush to guarantee memory relief
*   Calculates the anticipated final compressed stripe size by applying the estimated compression ratio to unencoded data
*   Triggers stripe flush when the predicted compressed size reaches the target stripe size threshold

`shouldChunk` is also now a separate method required by all flush policies. We updated all previous tests and code references

NOTE: The Velox repo change here is just test copied into an experimental directory that references the flush policy.

Differential Revision: D81516697


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

46c3027

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent force-pushed the export-D81545433 branch from c96c6da to 46c3027 Compare

October 14, 2025 20:30

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

ee8c934

…ator#242)

Summary:
Pull Request resolved: facebookincubator#242

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

f81d628

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

macvincent added a commit to macvincent/nimble that referenced this pull request


          Refactor Velox Writer to Use New Flush Policy Contract (facebookincub…

5b14b2b

…ator#242)

Summary:

This should be a no-op since no chunking flush policy is currently being used in Prod. but we make three changes in this dif:
1. `writeChunk` now returns a boolean to indicate whether any stream was successfully chunked
2. The previous raw size of the encoded stripe data in the writer context is now stored in the Writer context
3. We update and pass down the memory stats needed by the new flush policy contract

TODO: We will be introducing two more VeloxWriter changes in the next diffs in this stack to:
1. Support per stream chunking instead of always chunking all eligible streams
2. Support breaking down large stream into multiple smaller chunks

Differential Revision: D81545433

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported meta-exported