-
Notifications
You must be signed in to change notification settings - Fork 561
update trtllm cutlass moe #2020
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+3,272
β1,828
Merged
Changes from 32 commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
f109a2b
Fix aot failures
yongwww 3cfba8e
>launcher.inl
aleozlx 9047135
>generate_kernels.py
aleozlx d9d7723
>generate_kernels.py
aleozlx 3166245
>launcher.inl
aleozlx 4dce85b
>moe_gemm_kernels.h
aleozlx d47c865
cutlass_fused_moe_kernels.cuh is troublesome...
aleozlx 307fe30
fix compilation errors in cutlass_fused_moe_kernels.cuh
aleozlx 76a9220
>gather_tensor.hpp
aleozlx 96c0ed4
fix compilation errors
aleozlx af4036d
fix compilation error for sm120
yongwww a49d1fd
Add #if defined(ENABLE_FP4) guards
yongwww 13d8664
fix: use FLASHINFER_ENABLE_FP8_E8M0 guard for __nv_fp8_e8m0
yongwww 4f94bf0
fix build
yongwww eddb10b
fix aot errors
yongwww ddb1345
Merge remote-tracking branch 'origin/main' into feature/cutlass_moe_uβ¦
yongwww 2563556
fix stale sm100 configs
aleozlx bfe2852
Merge branch 'main' of https://github.com/flashinfer-ai/flashinfer inβ¦
aleozlx da54367
debug..
aleozlx a81fbd1
merge
nv-yunzheq 0bbab20
remove debug stdout
nv-yunzheq 0ad03f9
update incorrect comment
nv-yunzheq 39691b5
update default layout to use swizzled
nv-yunzheq 4a8c8cf
update format
nv-yunzheq f42af7c
update autotunner
nv-yunzheq 42de94e
update precomiit
nv-yunzheq b09efb0
update sm121 failure
nv-yunzheq 22b97b0
fix H100 unit test error
nv-yunzheq a30f033
address comments
nv-yunzheq 31b0df0
fix compilation error
nv-yunzheq 91a85ad
update compilation error
nv-yunzheq e15a96c
update to address comment
nv-yunzheq 33aec35
fix compilation error
nv-yunzheq File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
871 changes: 431 additions & 440 deletions
871
csrc/fused_moe/cutlass_backend/cutlass_fused_moe_kernels.cuh
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.