this post claims sageattention 1 should work: https://github.com/thu-ml/SageAttention/issues/234#issuecomment-3169590604