Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEAT] Update MLA to use triton attention and include MI300X MoE config file #479

Merged

Conversation

tjtanaa
Copy link

@tjtanaa tjtanaa commented Mar 13, 2025

Update MLA to use triton attention and include MI300X MoE config file

Please direct your PRs to the upstream vllm (https://github.com/vllm-project/vllm.git)

Accepting PRs into the ROCm fork (https://github.com/ROCm/vllm) will require a clear previously communicated exception

@hongxiayang hongxiayang merged commit 69c1cd0 into ROCm:llama_fp8_12062024 Mar 13, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants