Skip to content

Commit

Permalink
feat: Add deepseek
Browse files Browse the repository at this point in the history
  • Loading branch information
YeonwooSung committed Feb 3, 2025
1 parent 95060bf commit fc4fb9a
Show file tree
Hide file tree
Showing 2 changed files with 34 additions and 0 deletions.
34 changes: 34 additions & 0 deletions LLMs/DeepSeek/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
# DeepSeek

![deepseek_1pager](./imgs/deepseek_1pager.jpg)

## DeepSeekMath

[paper](https://arxiv.org/abs/2402.03300)

- Introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO), that enhances mathematical reasoning abilities while concurrently optimizing the memory usage of PPO
- Harness the significant potential of publicly available web data through a meticulously engineered data selection pipeline

By training the model with carefully selected data via GRPO trainer, DeepSeekMath model (7B) outperforms 400B model in mathematical reasoning.

## DeepSeek v3

TBD

## DeepSeek R1

[paper](https://arxiv.org/abs/2501.12948)

DeepSeek R1 overcomes these limitations by using Group Relative Policy Optimization (GRPO), a reinforcement learning technique that improves reasoning efficiency by comparing multiple possible answers within the same context.

Some facts about DeepSeek’s R1 model are as follows:

1. DeepSeek-R1 uses a Mixture-of-Experts (MoE) architecture with 671 billion total parameters, activating only 37 billion parameters per task.
2. It employs selective parameter activation through MoE for resource optimization.
3. The model is pre-trained on 14.8 trillion tokens across 52 languages.
4. DeepSeek-R1 was trained using just 2000 Nvidia GPUs. By comparison, ChatGPT-4 needed approximately 25K Nvidia GPUs over 90-100 days.
5. The model is 85-90% more cost-effective than competitors.
6. It excels in mathematics, coding, and reasoning tasks.
7. Also, the model has been released as open-source under the MIT license.


Binary file added LLMs/DeepSeek/imgs/deepseek_1pager.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit fc4fb9a

Please sign in to comment.