Skip to content

CUTLASS 4.2.1

Latest
Compare
Choose a tag to compare
@hwu36 hwu36 released this 24 Sep 05:23
f3fde58

CuTe DSL

  • Bug fixings and improvements
    • Fixed an issue when running DSL codes with cuda-python 13.0
    • Fixed an issue when running inductor with DSL codes
    • Fixed an issue with unexpected logging when running DSL codes in FlashInfer
    • Fixed the issue reported in #2647
    • Fixed an issue when conditional define of variables outside of dynamic control flow

CUTLASS C++

  • Bypass EVT for nosmem blockwise kernels on Blackwell.
  • Rename cutlass/python/cutlass directory to cutlass/python/cutlass_cppgen.