Skip to content

This is a small scale dataset for training and fine-tuning large models.

Notifications You must be signed in to change notification settings

Try-nothing/Small-scale-dataset-for-LLM-learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

Small-scale-dataset-for-LLM-learning

This is a small scale dataset for training and fine-tuning large models. https://pan.baidu.com/s/1E-D2baatTytqRo-38qbCqA?pwd=fypd 提取码: fypd

This dataset comprises Chinese and English novels harvested using web crawlers. It contains a total of 2,000 text files within the dataset, amounting to 1 gigabyte of data.

Due to the author's lack of expertise in uploading the dataset to Google Drive, please accept my apologies for any inconvenience.

If someone kindly uploads it to Google Drive, please do let me know.

Note that this dataset is intended solely for academic research and should not be used for commercial purposes.

About

This is a small scale dataset for training and fine-tuning large models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published