Skip to content

[BUG]是否支持 deepseek v2 (moe)系列的 一系列转换和训练 #166

@yiyepiaoling0715

Description

@yiyepiaoling0715

Describe the bug
A clear and concise description of what the bug is.

To Reproduce
Steps to reproduce the behavior:

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.
文档里边仅看到llama系列的支持,想问下 deepsek v2 这种 moe系列的,是否存在问题

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions