Fix max rollback and lockstep handling #88

caspark · 2024-12-14T13:05:08Z

NB: This builds on PR #82 so we better merge that one first to make this diff easier to review. (is Github working on stacked diff support yet?)

This fixes several issues that (I am fairly certain) came from merging #79:

When at the limit of the prediction window, rolling back would crash. This is fairly easy to trigger by adding a lot of latency in a network simulator. The cause was an off by one error in SavedStates' calculation of how many states need to be saved.
Clearer logic controlling when it's okay to request advancing the frame. I am not 100% certain but I suspect that the Fix/Feature: Lockstep #79 logic wasn't handling NULL_FRAME correctly? In any case I think this new structure makes it a lot easier to reason about.
Lockstep mode specific stuff:
- Document lockstep mode as first class thing.
- Fix disconnect handling in lockstep mode; previously the game would crash if a player disconnects (trying to rollback to the current frame, or perhaps the frame before the current? I can't quite remember). The fix is to skip all the "should we roll back" logic entirely if lockstep mode is on.
- Make lockstep mode not issue save requests ever, since that's pointless - and besides, the fix for the previous bullet makes this almost necessary.
  - Because of this, we also need to turn off sparse saving mode in lockstep mode, since sparse saving mode somewhat confusingly causes us to not mark frames as confirmed, and not having frames confirmed means we won't advance.
  - Update example to more easily catch the case where lockstep mode is on but somehow we got issued a save or rollback request, by panicking in that case.

I have tested manually with 2 and 3 player example games, with lockstep mode and 8 frames of prediction and with sparse saving on and off. In a previous version of these fixes (those on the main branch of my personal fork) I also did a lot of testing with high latency and such to verify the "rollback when at limit of prediction window" fix, but in the name of full disclosure, I haven't done that with this branch specifically (I have a smallish test harness that makes it easy to do that testing with my game and my ggrs fork).

Anyway, based on that testing, everything seems to work.. but this is fiddly stuff so 🤞. I am at least pretty confident that it's more reliable than what's in main currently :)

caspark · 2024-12-14T13:09:53Z

src/sessions/p2p_session.rs

+            // in lockstep mode, saving will never happen, but we use the last saved frame to mark
+            // control marking frames confirmed, so we need to turn off sparse saving to ensure that
+            // frames are marked as confirmed - otherwise we will never advance the game state.
+            false


Ideally we'd log a warning (or at least at debug level) here to let the user know their sparse saving request is being ignored, but we don't have a logging framework right now.

Edit: raised #89 to discuss adding some logging.

gschup · 2024-12-14T16:18:51Z

this looks good, but the commit history needs to be cleaned up. thanks for all the good work on ggrs :)

As a result of gschup#79, the synclayer and the logic in p2psession's advance_frame() were off by one in their agreement on the number of world states that needed to be stored. This fixes that so that they agree.

By making sure that in lockstep mode we never issue save or load requests, we avoid various edge cases, such as trying to rollback to the frame where the disconnect happened when we don't actually have any historical game states stored. However, we do need to make sure that sparse saving is off, otherwise the game won't advance, because with sparse saving we'll end up never marking a frame as confirmed. This also cleans up some convoluted logic for deciding whether to advance frames, which was also possibly incorrectly handling the case where no frames were yet confirmed. Lastly, we document lockstep mode as a first class thing.

caspark · 2024-12-14T23:50:03Z

Oh, sorry, I had actually squashed some commits locally to remove the WIP and dead end commits, but I forgot to actually push that! Have rebased onto main again now.

caspark commented Dec 14, 2024

View reviewed changes

caspark mentioned this pull request Dec 14, 2024

Use with no predictions #60

Closed

caspark added 3 commits December 15, 2024 07:47

fix: sync layer stores enough world states to roll back fully

dc25e8d

As a result of gschup#79, the synclayer and the logic in p2psession's advance_frame() were off by one in their agreement on the number of world states that needed to be stored. This fixes that so that they agree.

Block saves and rollbacks in example game in lockstep mode

dd8b30d

caspark force-pushed the fix-max-rollback-and-disconnects-and-lockstep branch from e837824 to dd8b30d Compare December 14, 2024 23:49

gschup merged commit c2814a1 into gschup:main Dec 15, 2024
2 checks passed

caspark mentioned this pull request Dec 15, 2024

Bevy 0.15 and latest ggrs main gschup/bevy_ggrs#114

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix max rollback and lockstep handling #88

Fix max rollback and lockstep handling #88

caspark commented Dec 14, 2024

caspark Dec 14, 2024 •

edited

Loading

gschup commented Dec 14, 2024

caspark commented Dec 14, 2024

Fix max rollback and lockstep handling #88

Fix max rollback and lockstep handling #88

Conversation

caspark commented Dec 14, 2024

caspark Dec 14, 2024 • edited Loading

Choose a reason for hiding this comment

gschup commented Dec 14, 2024

caspark commented Dec 14, 2024

caspark Dec 14, 2024 •

edited

Loading