-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Issue]: Build errors for gfx90a (MI250) architecture #179
Comments
@OscarSavNS thanks for this report. we are currently in rapid development period, more focusing on |
@carlushuang Thanks for the quick answer! For now I've commented out the Aiter import in our branch of SGLang, as it's currently (commit 98c73d7) used in 2 places that don't seem to apply for MI250s: fp8 kernels, and an unquantized fused moe kernel (which I think also has MI300 specific operations). |
@OscarSavNS Thank you for the guide on how to build on MI250. Did you notice any speedups compared to using |
@WissamAntoun I haven't tried it! I've been puled away to other stuff for a bit, but am currently focusing on having a SGLang version that works for MI250 and MI300. I'll try that out for MI300 though and report back, thanks the recommendation! |
Problem Description
In building SGLang for the
gfx90a
(MI250s) architecture, it fails due to Aiter, even if we target thegfx90a
architecture. It seems due to inclusion of fp8 kernels in the build. Is there a flag I should be passing to disable all fp8, or some other set of arguments to allow the build to go forward for MI250s?Operating System
Ubuntu 22.04.5 LTS (Jammy Jellyfish)
CPU
AMD EPYC 7713 64-Core Processor
GPU
AMD Instinct MI250X/MI250 - amdgcn-amd-amdhsa--gfx90a:sramecc+:xnack-
ROCm Version
ROCm 6.3.3
ROCm Component
No response
Steps to Reproduce
In the aiter (commit) build, in l.60 of the
Dockerfile.rocm
, replacewith
The Dockerfile fails to build with command:
This is seemingly due to fp8 kernels being included which are not supported on
gfx90a
. Is there a flag I should be passing to disable allfp8
, or to have it run ongfx90
? It seems largely hardcoded in in some places (although admittedly the following example is for DeepSeek CSRC kernels):aiter/aiter/jit/optCompilerConfig.json
Line 342 in e12d350
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
I have multiple GPUs on this node, but here is one of them:
Additional Information
Full SGLang
Dockerfile.rocm
:The text was updated successfully, but these errors were encountered: