[flink]Newly discovered partition read from earliest rather than scan.startup.mode by loserwang1024 · Pull Request #3548 · apache/fluss

loserwang1024 · 2026-06-30T07:03:00Z

Purpose

Linked issue: close #3543

Brief change log

Tests

API and Format

Documentation

….startup.mode

loserwang1024 · 2026-06-30T08:21:17Z

@beryllw @leonardBang @wuchong @swuferhong , CC

leonardBang · 2026-06-30T15:06:02Z

+
+        initialDiscoveryFinished = true;
+        for (SourceSplitBase split : splits) {
+            unassignedSplits.put(split.getTableBucket(), split);


This stores unassigned splits by TableBucket, but TableBucket is not unique for all split types. LakeSnapshotSplit uses splitIndex to distinguish multiple lake splits in the same bucket, so checkpointing only unassignedSplits.values() can drop all but one split (or fail on restore with duplicate keys). Could we persist these by splitId or as a list instead of de-duplicating by TableBucket?

leonardBang · 2026-06-30T15:36:25Z

+                assignedPartitions,
+                remainingHybridLakeFlussSplits,
+                leaseId,
+                false,


This default is used when restoring old V2 enumerator state, but it leaves initialDiscoveryFinished=false. After an upgrade, partitions created after that old checkpoint can be classified as initial partitions and use the user startup mode (for example latest), which reintroduces the data-loss case this patch is trying to avoid. Could we default old V2 state to true here, or otherwise make the migration preserve post-initial discovery semantics?

loserwang1024 force-pushed the FLIP-288 branch from 11bda29 to 98b45ed Compare June 30, 2026 07:04

[flink]Newly discovered partition read from earliest rather than scan…

199a3e4

….startup.mode

loserwang1024 force-pushed the FLIP-288 branch from 98b45ed to 199a3e4 Compare June 30, 2026 07:08

leonardBang reviewed Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[flink]Newly discovered partition read from earliest rather than scan.startup.mode#3548

[flink]Newly discovered partition read from earliest rather than scan.startup.mode#3548
loserwang1024 wants to merge 1 commit into
apache:mainfrom
loserwang1024:FLIP-288

loserwang1024 commented Jun 30, 2026

Uh oh!

loserwang1024 commented Jun 30, 2026

Uh oh!

leonardBang Jun 30, 2026

Uh oh!

leonardBang Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

loserwang1024 commented Jun 30, 2026

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

loserwang1024 commented Jun 30, 2026

Uh oh!

leonardBang Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

leonardBang Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants