Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New Model]: DeepSeek V3 / R1 #72

Open
Yikun opened this issue Feb 17, 2025 · 1 comment
Open

[New Model]: DeepSeek V3 / R1 #72

Yikun opened this issue Feb 17, 2025 · 1 comment
Assignees

Comments

@Yikun
Copy link
Collaborator

Yikun commented Feb 17, 2025

This issue tracks initial support for the Deepseek V3 model with vllm-ascend:

https://huggingface.co/deepseek-ai/DeepSeek-R1
https://huggingface.co/deepseek-ai/DeepSeek-V3

cc @wangxiyuan feel free to update any investigations

@Yikun
Copy link
Collaborator Author

Yikun commented Feb 18, 2025

For v0.7.1-dev: #68 #88

update (2025.02.19): #88 merged to v0.7.1-dev, DeepSeek test passed (via DeepSeek-V2-Lite), V3 arch same as V2 should also work, will backport to main soon.

Here is the note for DeepSeek-V2-Lite deploy: #112

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants