Optimizations for `write_cfg_data` #4569

nwatson22 · 2024-08-02T20:06:08Z

Makes progress in fixing the slowdown as proofs get large.

I'll call the "sync time" the time in between receiving a step result and starting to wait for the next step result on the master thread.

As proofs get more and more nodes, calling to_dict() on every node gets progressively harder and this was taking up a substantial portion of the sync time as proofs got to 500+ nodes.

Instead of generating the entire KCFG dict with KCFG.to_dict() before passing this to KCFGStore.write_cfg_data(), which discards all of this work anyway for the final product and replaces it with just a list of the node IDs, it is significantly faster to just pass it the list of the node IDs and have it use the KCFG object, which already has a dictionary storing nodes by ID to find the vacuous and stuck nodes, and to get the newly created nodes. This way we can call to_dict() on each node only when it is first created.

To give an idea of a benchmark, I used a LoopsTest.test_sum_1000() (linear and long-running) with --max-depth 1 and --max-iterations 1000. Before this change it gets to 1000 iterations in 59:06, and after this change it does it in 40:31. Before the change the sync time as this proof approached 1000 nodes ranged between about 3.4 to 4.2 seconds. After the change ranged from about 1.39 to 1.54 seconds.

The big remaining chunk of sync time when the proof gets large seems to be in get_steps().

PetarMax · 2024-08-05T11:53:32Z

Before I forget, I think that it would be a good idea to have the set of pending nodes made explicit and maintained as they are modified - this should introduce a major speed-up since get_steps() looks for pending nodes, which means a traversal and analysis of the entire KCFG.

PetarMax · 2024-08-05T12:03:04Z

This looks like a good improvement, but I think that we might need to assess on engagement code. I am testing it with intermittent writing to disk (every max-frontier-parallel iterations), and the speed-up is (expectedly) greater than the one shown here. Perhaps we could do both?

nwatson22 · 2024-08-05T13:59:38Z

@PetarMax Thanks, that is a good suggestion for how we can solve the performance issues with the APRProof implementation of get_steps().

Between the intermittent writing to disk and this change I suspect that the two would eat into each other's performance gain, but it sounds from what you're saying they stack at least somewhat. I will try to test this and #4563 together as well.

Edit: I think I misunderstood. The performance gain on the engagement code from this PR that I tested was also greater than what I wrote here about the test I ran from kontrol's test suite, although I don't know the exact reason why. Maybe it's just because the individual node JSONs are larger for lido proofs. But I will still test and see if these two techniques stack.

PetarMax · 2024-08-06T19:41:35Z

This looks good to me. @ehildenb, @tothtamas28?

Don't generate node dicts unless the node newly created

b3bcaaa

nwatson22 self-assigned this Aug 2, 2024

rv-jenkins changed the base branch from master to develop August 2, 2024 20:06

nwatson22 added 2 commits August 2, 2024 15:26

Fix node not being converted to dict

50bcfce

Fix vacuous and stuck nodes not being computed correctly

72ce747

nwatson22 requested review from ehildenb and PetarMax August 2, 2024 23:29

nwatson22 and others added 2 commits August 5, 2024 13:32

Merge branch 'develop' into noah/cfg-writing-optimization

d417438

Merge branch 'develop' into noah/cfg-writing-optimization

261e677

PetarMax approved these changes Aug 6, 2024

View reviewed changes

PetarMax added 2 commits August 6, 2024 23:04

Merge branch 'develop' into noah/cfg-writing-optimization

c96e71b

Merge branch 'develop' into noah/cfg-writing-optimization

e6aa332

PetarMax marked this pull request as ready for review August 8, 2024 10:17

Merge branch 'develop' into noah/cfg-writing-optimization

b50a8a0

PetarMax merged commit d55c98f into develop Aug 8, 2024
17 checks passed

PetarMax deleted the noah/cfg-writing-optimization branch August 8, 2024 11:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizations for `write_cfg_data` #4569

Optimizations for `write_cfg_data` #4569

nwatson22 commented Aug 2, 2024 •

edited

Loading

PetarMax commented Aug 5, 2024

PetarMax commented Aug 5, 2024 •

edited

Loading

nwatson22 commented Aug 5, 2024 •

edited

Loading

PetarMax commented Aug 6, 2024

Optimizations for write_cfg_data #4569

Optimizations for write_cfg_data #4569

Conversation

nwatson22 commented Aug 2, 2024 • edited Loading

PetarMax commented Aug 5, 2024

PetarMax commented Aug 5, 2024 • edited Loading

nwatson22 commented Aug 5, 2024 • edited Loading

PetarMax commented Aug 6, 2024

Optimizations for `write_cfg_data` #4569

Optimizations for `write_cfg_data` #4569

nwatson22 commented Aug 2, 2024 •

edited

Loading

PetarMax commented Aug 5, 2024 •

edited

Loading

nwatson22 commented Aug 5, 2024 •

edited

Loading