OOM with batch size 1 when with ViT-bigG on 40GB GPU #296

mitchellnw · 2022-12-15T04:24:23Z

Similarly to #261, getting OOM with batch size 1 on 40GB GPU with ViT-G.

OrangeSodahub · 2022-12-15T04:39:16Z

Weird. I once tested ViT-g-14 on RTX3090 (10G) and it could work, could refer to this, Maybe you could try multiple machines.

mitchellnw · 2022-12-15T04:40:20Z

sorry I mean bigG not g

OrangeSodahub · 2022-12-15T04:42:19Z

Sorry for misunderstand

rwightman · 2022-12-15T07:45:47Z

I think we've got two 'easy' options right now, DeepSpeed Zero (PR for this #264 might be worth testing) or PyTorch native FSDP. Talking w/ someone close to TPUs & PyTorch XLA recently, and they were stronly recommending giving FSDP a try for large scale runs (there's both an XLA specific varaint and normal PyTorch one).

Going full tensor parallelism is more work and I feel things are about to change w/ upcoming native PyTorch features (compilation w/ annotations for parallelism) such that needing to do it Megatron style will be a thing of the past.

mitchellnw · 2023-01-09T21:46:50Z

seems like progress is being made with FSDP and also we think the OOM was because of model size + activations

mitchellnw added bug Something isn't working help wanted Extra attention is needed labels Dec 15, 2022

mitchellnw changed the title ~~OOM with batch size 1 when with ViT-G on 40GB GPU~~ OOM with batch size 1 when with ViT-bigG on 40GB GPU Dec 15, 2022

mitchellnw closed this as completed Jan 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM with batch size 1 when with ViT-bigG on 40GB GPU #296

OOM with batch size 1 when with ViT-bigG on 40GB GPU #296

mitchellnw commented Dec 15, 2022

OrangeSodahub commented Dec 15, 2022

mitchellnw commented Dec 15, 2022

OrangeSodahub commented Dec 15, 2022

rwightman commented Dec 15, 2022

mitchellnw commented Jan 9, 2023

OOM with batch size 1 when with ViT-bigG on 40GB GPU #296

OOM with batch size 1 when with ViT-bigG on 40GB GPU #296

Comments

mitchellnw commented Dec 15, 2022

OrangeSodahub commented Dec 15, 2022

mitchellnw commented Dec 15, 2022

OrangeSodahub commented Dec 15, 2022

rwightman commented Dec 15, 2022

mitchellnw commented Jan 9, 2023