-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问一下训练时间大概是多少? #5
Comments
看你的硬件和训练的token数量。 |
好的,了解了,感谢。 |
请问,下面这个训练时间是针对多大规模的模型,96M还是440M参数的模型。 |
92M的模型,不是440M和96M的,看你的训练资源, |
96M的参数规模,是不是2B左右的Tokens就可以了? |
@dage0127 尽可能多吧,我训练了40多B,才有这效果,还是有点差 |
另外,请教一下,这些数据是一次性加载进去训练的吗? |
@dage0127 使用MAP方式加载,不用一次把所有数据加载到内存。 |
非常感谢,我试试! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
No description provided.
The text was updated successfully, but these errors were encountered: