You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
It outputs 'This example must be compiled with sm100a and CUDA Toolkit 12.8 or later' even after following the cmake command 'cmake $PATH -DCUTLASS_NVCC_ARCHS="100a" -DCUTLASS_ENABLE_GDC_FOR_SM100=1
'.
Manually output the following ENVs
--- Environment Check Status ---
CUTLASS_ARCH_MMA_SM100A_ENABLED: NOT DEFINED CUDACC_VER_MAJOR: 13 CUDACC_VER_MINOR: 0
Detected CUDA Toolkit Version: 13.0
I manually comment all CUTLASS_ARCH_MMA_SM100A_ENABLED in the cuda code, and then it can work.
So I guess CUTLASS_ARCH_MMA_SM100A_ENABLED is unnecessary if we have cuda >= 12.8? If this is true, any chance to remove this flag so that it does not block running from raw repo?