Skip to content

Conversation

LucasWilkinson
Copy link
Collaborator

No description provided.

beginlner and others added 6 commits April 28, 2025 18:53
* Add more GPU architctures support

* Merge fmha and mla runner

* add varlen & non varlen support, and add incontiguous tensor support

* update readme

* add varlen api

---------

Co-authored-by: dianzhangc <[email protected]>
@hypdeb
Copy link

hypdeb commented Aug 20, 2025

Hey @LucasWilkinson, any insight on how we can resolve the failing sign-off checks? I have some work based on the latest version of FlashMLA that I would like to try out in vLLM, and so it would be convenient for me if this could be merged 😅 .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants