fix: Use separate snapshots for sequencer and validator workloads #116

tobidae-cb · 2025-10-27T17:27:37Z

Description

There's currently a bug where both the sequencing and validation runs use the same snapshot which makes the metrics gathered by the validation run look ultra fast. We want to split out the snapshot used for validation into separate runs to ensure we have the right data

Testing

cb-heimdall · 2025-10-27T17:27:41Z

🟡 Heimdall Review Status

Requirement Status More Info

Reviews

🟡 0/1

Denominator calculation

Show calculation

1 if user is bot 0

1 if user is external 0

2 if repo is sensitive 0

From .codeflow.yml 1

Additional review requirements

Show calculation

Max	0

0

From CODEOWNERS 0

Global minimum 0

Max 1

1

1 if commit is unverified 0

Sum 1

…date yarn lock

meyer9

LG overall! Maybe we can move the logic to a new struct so that it's all easily traceable? I think the setupDataDirs/setupInternalDirectories distinction is a bit unclear, so probably makes sense to just combine them in the new file/struct.

meyer9 · 2025-10-28T00:36:16Z

runner/service.go

+					return nil, nil, errors.Wrap(err, "failed to create sequencer test directory")
+				}
+
+				err = os.Rename(initialSnapshotPath, sequencerDataDirOverride)


I think this will only allow running the test once which may be confusing for reuse_existing. Maybe we should rename that and add an option that allows copying instead of moving here? Or use force_clean to decide?

What's happening here is that after copying the initial snapshot for the validator run, we want to just "move" the original snapshot data to the sequencer data directory. That way we don't have to do 2 copies (for sequencer, validator) and simply do 1 copy with the final action being a move. Saves time with this approach.

The directory setup doesn't have any bearing on the tests that run. It just does the setup the first time. That being said, I could manage it outside of the runTest module for clarity

just to clarify, after moving from the initial snapshot directory with reuse_existing, we wouldn't be able to rerun the test again because now the initial snapshot directory doesn't exist right? Or I might be misunderstanding.

Definitely agree there are speed benefits, so we should support moving rather than copying, but I don't think we should require it.

After moving the initial snapshot directory, all future tests including the current test will use the new directory. You're right that the initial snapshot dir no longer exists but that's fine because the move happens in the first run and then reuses the updated directory for the immediate test and future tests.

If after all the tests are done you decide to run adhoc tests, in this case yeah the initial directory won't exist unless you manually copied it back but this is setup for automated use

meyer9 · 2025-10-28T00:42:43Z

runner/service.go

+		isSnapshot = true // dataDirOverride is only set when using snapshots
+		s.log.Info("Using pre-configured datadir", "path", dataDirPath, "role", role)
+	} else {
+		isSnapshot = snapshot != nil && snapshot.Command != ""


I wonder if we could clean this up by moving the snapshot setup logic to it's own struct/file. The logic right now is separated between setupInternalDirectories and setupDataDirs. Previously, setupDataDirs just called setupInternalDirectories for the sequencer and the validator, but now it looks like they're doing similar things and we could just combine the logic. For example, they both call CopyFromInitialSnapshot.

meyer9 · 2025-10-28T00:43:28Z

runner/service.go


+	// tracks persistent test directories for reuse_existing snapshots
+	// key: nodeType, value: map["sequencer"|"validator"] -> directory path
+	persistentTestDirs map[string]map[string]string


we could move this to the manager struct also (or move to the snapshot manager that already exists)

fix: Use separate snapshots for sequencer and validator workloads, up…

be8056f

…date yarn lock

tobidae-cb force-pushed the fix-snapshot-bug branch from fb6504b to be8056f Compare October 27, 2025 21:22

Add comma

be41d20

meyer9 reviewed Oct 28, 2025

View reviewed changes

tobidae-cb added 4 commits October 28, 2025 13:03

Move datadir management to separate module

eff0e15

Remove unused function

d871d09

Disable caching

6dcccce

Ensure snapshot exists for both roles

e566596

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Use separate snapshots for sequencer and validator workloads #116

fix: Use separate snapshots for sequencer and validator workloads #116

tobidae-cb commented Oct 27, 2025

Uh oh!

cb-heimdall commented Oct 27, 2025 •

edited

Loading

Uh oh!

meyer9 left a comment

Uh oh!

meyer9 Oct 28, 2025

Uh oh!

tobidae-cb Oct 28, 2025

Uh oh!

meyer9 Oct 28, 2025

Uh oh!

tobidae-cb Oct 28, 2025

Uh oh!

meyer9 Oct 28, 2025

Uh oh!

meyer9 Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: Use separate snapshots for sequencer and validator workloads #116

Are you sure you want to change the base?

fix: Use separate snapshots for sequencer and validator workloads #116

Conversation

tobidae-cb commented Oct 27, 2025

Description

Testing

Uh oh!

cb-heimdall commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🟡 Heimdall Review Status

Uh oh!

meyer9 left a comment

Choose a reason for hiding this comment

Uh oh!

meyer9 Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

tobidae-cb Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

meyer9 Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

tobidae-cb Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

meyer9 Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

meyer9 Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cb-heimdall commented Oct 27, 2025 •

edited

Loading