LLaMA

This repository is intended as a minimal, hackable and readable example to load LLaMA (arXiv) models and run inference. In order to download the checkpoints and tokenizer, fill this google form

Setup

conda install -r requirements.txt

Then, in this repository

pip install -e .

To run:

torchrun --nproc_per_node $MP example.py --ckpt_dir $TARGET_FOLDER/model_size --tokenizer_path $TARGET_FOLDER/tokenizer.model

Different models require different MP values:

Model	MP
7B	1
13B	2
33B	4
65B	8

Model Card

See MODEL_CARD.md

License

See the LICENSE file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

LLaMA

Setup

Model Card

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

LLaMA

Setup

Model Card

License