From d42a979aa1ff6930475d05b868d0620c2daa3d12 Mon Sep 17 00:00:00 2001
From: YeonwooSung <neos960518@gmail.com>
Date: Sun, 16 Jun 2024 22:20:13 +0900
Subject: [PATCH] docs: Document llama

---
 Transformers/llama/README.md | 16 ++++++++++++++++
 1 file changed, 16 insertions(+)
 create mode 100644 Transformers/llama/README.md

diff --git a/Transformers/llama/README.md b/Transformers/llama/README.md
new file mode 100644
index 0000000..ba3ec07
--- /dev/null
+++ b/Transformers/llama/README.md
@@ -0,0 +1,16 @@
+# Llama
+
+[LLaMA](https://arxiv.org/abs/2302.13971) is a foundation language model from Meta.
+It is fully open-sourced.
+
+## Llama 3
+
+- 8B and 70B models are available, and 400B model is coming soon (24.06.16)
+- The tokenizer of LLaMA 3 trained with 128K tokens, where LLaMA 2 tokenizer was trained with 32K tokens
+- Context window is 8192 tokens, where LLaMA 2 is 4096 tokens and LLaMA 1 is 2048 tokens
+- Uses grouped query attention, which is more efficient than the standard multi-head attention
+
+## References
+
+- [LLaMA: Open and Efficient Foundation Language Models](https://arxiv.org/abs/2302.13971)
+- [Introducing Meta Llama 3: The most capable openly available LLM to date](https://ai.meta.com/blog/meta-llama-3/)