vllm-project / vllm-ascend Public

Notifications You must be signed in to change notification settings
Fork 25
Star 85

Code
Issues 15
Pull requests 9
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: vllm-project/vllm-ascend

[main] vllm-ascend Roadmap Q1 2025

#71 opened Feb 17, 2025 by Yikun

Open

[v0.7.1rc1] FAQ & Feedback

#19 opened Feb 8, 2025 by Yikun

Open 3

Labels 13 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

15 Open 5 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Strange Memory Consumption Phenomenon in vLLM

#89 opened Feb 18, 2025 by whu-dft

Add a turtorial for Qwen 2.5-VL documentation

Improvements or additions to documentation

help wanted

Extra attention is needed

#75 opened Feb 17, 2025 by Yikun

[New Model]: DeepSeek V3 / R1 new model

#72 opened Feb 17, 2025 by Yikun

[main] vllm-ascend Roadmap Q1 2025

#71 opened Feb 17, 2025 by Yikun

24 of 34 tasks

running of v0.7.1 problem documentation

Improvements or additions to documentation

#56 opened Feb 13, 2025 by hz0ne

Add issue template documentation

Improvements or additions to documentation

#48 opened Feb 12, 2025 by Yikun

Speculative decoding not working feature request

#47 opened Feb 11, 2025 by michelemarzollo

Abnormal First Token Output on 910B GPU during Inference bug

Something isn't working

#46 opened Feb 11, 2025 by Jozenn

Does this project support the deployment of deepseek-v3 and deepseek-r1 on Ascend? new model

#39 opened Feb 11, 2025 by Kangzf1996

Atlas 800 推理服务器（型号：3000）安装失败 feature request

#37 opened Feb 11, 2025 by niejingwei

Question about the difference of inference results between NPU and GPU

#31 opened Feb 11, 2025 by AIR-hl

Add doc for benchmark and profiling on Ascend NPU documentation

Improvements or additions to documentation

#26 opened Feb 10, 2025 by Yikun

[v0.7.1rc1] FAQ & Feedback

#19 opened Feb 8, 2025 by Yikun

[RFC] V1 engine support

#9 opened Feb 6, 2025 by wangxiyuan

Glad to see that vLLM has officially added support for the Ascend hardware backend!

#1 opened Jan 29, 2025 by liaoyanqing666

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly