Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Apple's DCLM-Baseline-7B model #32000

Open
2 tasks done
minipasila opened this issue Jul 16, 2024 · 3 comments
Open
2 tasks done

Add support for Apple's DCLM-Baseline-7B model #32000

minipasila opened this issue Jul 16, 2024 · 3 comments

Comments

@minipasila
Copy link

Model description

This is a new model released by Apple using a new framework called "openlm" so it doesn't work with Huggingface Transformers currently.

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

https://huggingface.co/apple/DCLM-Baseline-7B Link to the model weights.
https://github.com/mlfoundations/open_lm Link to the model framework.
https://github.com/mlfoundations/dclm
https://arxiv.org/abs/2406.11794 Link to the paper.

@amyeroberts
Copy link
Collaborator

cc @ArthurZucker

@shauray8
Copy link
Contributor

Hey @amyeroberts, I can have a look at this if it's one of the projected additions to the library.

@stephanpeitz
Copy link

This seems to be a decent workaround: mlfoundations/open_lm#291

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants