Skip to content

Conversation

@eli-r-ph
Copy link
Contributor

Problem

In order to track all the files we need to reconstitute a complete RocksDB store for reimport while minimizing uploads to remote storage across many incremental checkpoints, we'll need to include file checksums and use them in the decision to upload files present in each incremental checkpoint. The rules are different for SST vs. non-SST store file types.

Changes

  • Implement file checksum tracking in metadata file format
  • Implement new planner decisioning logic relying on checksums
  • Implement new unit tests and update existing ones to account for new behavior

How did you test this code?

Locally and in CI

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

Changelog: (features only) Is this feature complete?

No update required

@eli-r-ph eli-r-ph requested a review from a team October 18, 2025 00:42
@eli-r-ph eli-r-ph self-assigned this Oct 18, 2025
Copy link
Contributor

@greptile-apps greptile-apps bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

8 files reviewed, no comments

Edit Code Review Agent Settings | Greptile

@eli-r-ph eli-r-ph force-pushed the eli.r/kdedup-cp-plan-checksums branch from 11dc521 to 16581bb Compare October 20, 2025 04:21
Copy link
Contributor

@jose-sequeira jose-sequeira left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall looks good to me, tiny number of nitpicks. Test should be fixed before merged as they have a small logic error. Approving preemptively though

@eli-r-ph eli-r-ph force-pushed the eli.r/kdedup-cp-plan-checksums branch from aecfb8e to 0508c01 Compare October 22, 2025 01:34
@eli-r-ph eli-r-ph merged commit d4ac007 into master Oct 22, 2025
135 of 206 checks passed
@eli-r-ph eli-r-ph deleted the eli.r/kdedup-cp-plan-checksums branch October 22, 2025 15:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants