lance-deeplearning-recipes/examples/llm-pretraining at main · lancedb/lance-deeplearning-recipes

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
llm-pretraining.ipynb		llm-pretraining.ipynb

README.md

LLM pre-training using Lance text dataset

Overview

Using a Lance text dataset for pre-training / fine-tuning a Large Language model is straightforward and memory-efficient. We'll be using the wikitext_100K.lance dataset that we created in the Creating text dataset for LLM pre-training example to train a basic GPT2 model from scratch using 🤗 transformers on a 1x A100 GPU. The wikitext dataset, is a collection of over 100 million tokens extracted from the set of verified good and featured articles on Wikipedia.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-pretraining

llm-pretraining

README.md

LLM pre-training using Lance text dataset

Overview

Code and Blog

Files

llm-pretraining

Directory actions

More options

Directory actions

More options

Latest commit

History

llm-pretraining

Folders and files

parent directory

README.md

LLM pre-training using Lance text dataset

Overview

Code and Blog