Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FDSP #310

Open
rom1504 opened this issue Dec 20, 2022 · 4 comments
Open

FDSP #310

rom1504 opened this issue Dec 20, 2022 · 4 comments

Comments

@rom1504
Copy link
Collaborator

rom1504 commented Dec 20, 2022

https://pytorch.org/docs/stable/fsdp.html

this should allow us to go to bigger model

would be quite useful to look into

@Quan-Sun
Copy link
Contributor

Hi @rom1504 Deepspeed may be another option for bigger models. It's also easy and effective to use(PR for this #264). It can be applied with older versions of Pytorch, such as Pytorch <= 1.11.

@mehdidc
Copy link
Contributor

mehdidc commented Dec 23, 2022

Hi, I am doing some tests with FSDP, got a basic version running, will try to do a full run and see if we can reproduce some of the results with ViT-B/32

@rom1504
Copy link
Collaborator Author

rom1504 commented Jan 7, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants