Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【执行测试错误】:no kernel image is available for execution on the device #39

Open
RoadTLife opened this issue Feb 25, 2025 · 3 comments

Comments

@RoadTLife
Copy link

RoadTLife commented Feb 25, 2025

执行命令:
python tests/test_flash_mla.py
错误信息:

b=128, s_q=1, mean_sk=4096, h_q=16, h_kv=1, d=576, dv=512, causal=True, varlen=False
CUDA error (csrc/flash_fwd_mla_metadata.cu:76): no kernel image is available for execution on the device

环境情况:

  • CUDA版本:12.4
  • 显卡:A100
@defei-coder
Copy link

only support sm90(Hopper+)

@RoadTLife
Copy link
Author

这个库是那些原因,导致只能在H系列上跑了呢?

@defei-coder
Copy link

this repo used features only supported by sm90, such as WGMM, and this only built with sm_90 arch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants