introduce "--webnn-ort-ov-gpu-precision" to specify gpu precision #162

mingmingtasd · 2025-02-26T07:37:24Z

Change the --webnn-ort-use-ov-gpu-fp32 flag to --webnn-ort-ov-gpu-precision

You can specify the flag to use FP32 or ACCURACY inference precision:
Usage1: --webnn-ort-ov-gpu-precision=FP32
Usage2: --webnn-ort-ov-gpu-precision=ACCURACY

--webnn-ort-ov-gpu-precision=ACCURACY will let OV EP GPU use ACCURACY execution mode. As my observation, this mode will not decrease the SD Turbo's inference time a lot, but can increase the pass rate of many ops' wpt tests (resolve the accuracy failure). We can test more with this flag.

see https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/providers/openvino/backends/basic_backend.cc#L167

PTAL, thanks!
cc/ @BruceDai @lisa0314 @miaobin

huningxin

LGTM with comments

huningxin · 2025-02-26T08:34:18Z

services/webnn/webnn_switches.h

+// the flag to use FP32 or ACCURACY inference precision to get better accuracy,
+// but it may result in decreased performance.
+// Usage1: --webnn-ort-ov-gpu-precision=FP32
+// Usage2: --webnn-ort-ov-gpu-precision=ACCURACY


You may want to explain what the underlying setting to OV execution mode when specifying "FP32" or "ACCURACY". Adding a pointer to OV documentation would be helpful.

huningxin · 2025-02-26T08:35:38Z

services/webnn/ort/graph_impl_ort.cc

@@ -30,6 +30,18 @@ namespace webnn::ort {

 namespace {

+// These keys and values must align with the implementation of the ORT OpenVINO


Could you please point out the valid keys and values available in OV EP?

introduce "--webnn-ort-ov-gpu-precision" to specify gpu precision

64749ed

mingmingtasd requested review from huningxin and shiyi9801 February 26, 2025 07:37

huningxin approved these changes Feb 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

introduce "--webnn-ort-ov-gpu-precision" to specify gpu precision #162

introduce "--webnn-ort-ov-gpu-precision" to specify gpu precision #162

mingmingtasd commented Feb 26, 2025 •

edited

Loading

huningxin left a comment

huningxin Feb 26, 2025

huningxin Feb 26, 2025

		@@ -30,6 +30,18 @@ namespace webnn::ort {

		namespace {

		// These keys and values must align with the implementation of the ORT OpenVINO

introduce "--webnn-ort-ov-gpu-precision" to specify gpu precision #162

Are you sure you want to change the base?

introduce "--webnn-ort-ov-gpu-precision" to specify gpu precision #162

Conversation

mingmingtasd commented Feb 26, 2025 • edited Loading

huningxin left a comment

Choose a reason for hiding this comment

huningxin Feb 26, 2025

Choose a reason for hiding this comment

huningxin Feb 26, 2025

Choose a reason for hiding this comment

mingmingtasd commented Feb 26, 2025 •

edited

Loading