-
Notifications
You must be signed in to change notification settings - Fork 91
Open
Description
Question from @SherinBojappa
In Table 13 of the OLMoE paper (https://arxiv.org/pdf/2409.02060), you report DCLM eval metrics for the checkpoints at 1.2M steps, 1.22M steps, and the annealed model. I was wondering whether you might be willing to share (or point me to) the corresponding evaluation metrics for earlier intermediate checkpoints of OLMoE-1B-7B.
We did not run the DCLM evals for other checkpoints I think but you should be able to run them easily by following these instructions: https://github.com/allenai/OLMoE?tab=readme-ov-file#after-pretraining & using the many pretraining ckpts at https://huggingface.co/allenai/OLMoE-1B-7B-0924/tree/main
Metadata
Metadata
Assignees
Labels
No labels