Skip to content

Conversation

@DorianZi
Copy link
Collaborator

No description provided.

@DorianZi DorianZi requested a review from LLLLKKKK October 10, 2025 02:09
@CLAassistant
Copy link

CLAassistant commented Oct 10, 2025

CLA assistant check
All committers have signed the CLA.

@DorianZi DorianZi changed the title optimizations for dense models on ROCM/AMD feat: optimizations for dense models on ROCM/AMD Oct 10, 2025
@amd-yilizhao amd-yilizhao force-pushed the develop/qwen3-rocm-main_more_opt branch from e4c77f2 to 688ee2d Compare October 13, 2025 05:59
@LLLLKKKK
Copy link
Collaborator

需要增加 smoke 测试

@DorianZi DorianZi force-pushed the develop/qwen3-rocm-main_more_opt branch 4 times, most recently from 4544c0e to eea02b6 Compare October 15, 2025 07:58
@DorianZi
Copy link
Collaborator Author

需要增加 smoke 测试
Done

  1. 已增加swizzle、fp8 attention的测试到open_merge/204
  2. 其它优化(norm, attention, rotary embeding等)已经默认打开,原有smoke可以覆盖

@DorianZi DorianZi force-pushed the develop/qwen3-rocm-main_more_opt branch 2 times, most recently from 90118ef to 6d3697a Compare October 16, 2025 05:46
@liaocz liaocz force-pushed the develop/qwen3-rocm-main_more_opt branch 3 times, most recently from f2d3c6c to 78652a1 Compare October 16, 2025 07:52
@DorianZi DorianZi force-pushed the develop/qwen3-rocm-main_more_opt branch from 78652a1 to 0390b98 Compare October 16, 2025 11:39
@LLLLKKKK LLLLKKKK enabled auto-merge (rebase) October 20, 2025 04:41
@hangy-amd hangy-amd force-pushed the develop/qwen3-rocm-main_more_opt branch from 2520022 to 6083468 Compare October 22, 2025 04:08
@LLLLKKKK LLLLKKKK merged commit 08ad962 into main Oct 23, 2025
7 of 10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.