Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

预训练数据集生成 #8

Open
dage0127 opened this issue Jul 24, 2024 · 2 comments
Open

预训练数据集生成 #8

dage0127 opened this issue Jul 24, 2024 · 2 comments

Comments

@dage0127
Copy link

请问生成预训练数据集的脚本没有找到,是不是代码里面漏了。

@wdndev
Copy link
Owner

wdndev commented Jul 24, 2024

预训练数据开源预料的脚本在这:https://github.com/wdndev/tiny-llm-zh/tree/main/utils ;因为用到部分未开源的数据,暂时没有放

@dage0127
Copy link
Author

谢谢了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants