Weird DDP RNG/seed behavior #13391

amit-miller · 2022-06-23T15:03:06Z

amit-miller
Jun 23, 2022

Hi.
I'm trying to validate some DDP code. The aim is to show numeric identity when working with DDP with >1 workers, compared to standard single process mode ["Standard"]. Batch size and other quantities have been adjusted properly to make sure that all things are equal.

Clearly, considering sampling involved, the state of the RNG is critical. if the RNG state is somehow different between DDP and Standard modes, changes will appear.

Consider the following pseudo code:

def shared_step():
   print(torch.rand(1))
   .....

def main():
   pl.seed_everything(1)
   create_dataloaders()
   trainer = pl.Trainer()
   model = pl.Module()
   print(torch.rand(1))  # [*]
   # [**]
   trainer.fit()

Now, in Standard mode, stepping the RNG once, i.e. the including the line marked with [*], create changes to the value printed in the shared_step, as expected.
However, in DDP mode, including or omitting the line marked with [*] does NOT change the value printed by the shared step. [this is of course replicated deterministically on all workers]
This is rather surprising.

For my version [pytorch/lightning = 1.12/1.6.3] adding another call to pl.seed_everything(1) at the line marked with [**] makes both Standard and DDP modes behave the same. It suggests that perhaps the the fit() call in DDP mode is somehow applying a seed reset with a cached value.

akihironitta · 2022-06-25T18:14:06Z

akihironitta
Jun 25, 2022

However, in DDP mode, including or omitting the line marked with [*] does NOT change the value printed by the shared step.

Yes, as you've noticed, it is expected. The DDP strategy resets the seed during its setup at:

https://github.com/Lightning-AI/lightning/blob/f54abc506f5c735e8e917a0c591cf789f2dbb95c/src/pytorch_lightning/strategies/ddp.py#L202

1 reply

amit-miller Jun 26, 2022
Author

Thanks for your answer.

I think this should be only optional [and better documented]. It means that another seed_everything() must be done explicitly if one wants to remain truly reproducible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Weird DDP RNG/seed behavior #13391

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Weird DDP RNG/seed behavior #13391

Uh oh!

Uh oh!

amit-miller Jun 23, 2022

Replies: 1 comment · 1 reply

Uh oh!

akihironitta Jun 25, 2022

Uh oh!

amit-miller Jun 26, 2022 Author

amit-miller
Jun 23, 2022

Replies: 1 comment 1 reply

akihironitta
Jun 25, 2022

amit-miller Jun 26, 2022
Author