Fix: Verify scales are not None for Cutlass FP8 FusedMoE (#1961)

amirkl94 · web-flow · commit 99657eda71f2 · 2025-10-27T14:02:49.000-07:00
## 📌 Description
Verify quant scales for fp8 are non null in cutlass FusedMoE path.
Currently, if these tensors are passed as None from python it will
result in segmentation fault.

&lt;!-- This is an auto-generated comment: release notes by coderabbit.ai
--&gt;
## Summary by CodeRabbit

* **Bug Fixes**
* Enhanced validation for FP8 quantization parameters to improve system
robustness and prevent potential null reference issues during
quantization operations, reducing the risk of runtime errors when
processing quantized model data.
&lt;!-- end of auto-generated comment: release notes by coderabbit.ai --&gt;

---------

Signed-off-by: Amir Klein &lt;203507526+amirkl94@users.noreply.github.com&gt;
diff --git a/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_sm100_binding.cu b/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_sm100_binding.cu
@@ -822,6 +822,13 @@ class FusedMoeRunner : public tvm::ffi::ModuleObj {
       auto const fc2_dequant = quant_scales.value()[2];
       auto const fc1_input_dequant = quant_scales.value()[3];
 
+      TVM_FFI_ICHECK(fc1_dequant.get() != nullptr) << "Expecting fc1_dequant to be non null";
+      TVM_FFI_ICHECK(fc2_quant.get() != nullptr) << "Expecting fc2_quant to be non null";
+      TVM_FFI_ICHECK(fc2_dequant.get() != nullptr)
+          << "Expecting fc2_dequant_dequant to be non null";
+      TVM_FFI_ICHECK(fc1_input_dequant.get() != nullptr)
+          << "Expecting fc1_input_dequant to be non null";
+
       // Check types
       CHECK_INPUT_TYPE(fc1_dequant, dl_float32);
       CHECK_INPUT_TYPE(fc2_quant, dl_float32);