[RELAX][BYOC] OpenCLML offload support for Relax #17654

srkreddy1238 · 2025-02-14T04:43:58Z

This brings in OpenCLML offloading via BYOC path with available operators in Relax.
Adds Codegen tests for Mainline CI.
Also brings in pipeline definitions for Adreno targets.

tests/python/relax/backend/clml/utils.py

python/tvm/relax/backend/adreno/pipeline.py

srkreddy1238 · 2025-02-14T14:34:03Z

tests/python/relax/backend/clml/utils.py

+
+    # Verify codegen
+    clml_mod = OpenCLMLOffLoad()(clml_mod)
+    verify_codegen(clml_mod, clml_codegen)


We do codegen check here (JSON comparison).

srkreddy1238 · 2025-02-14T14:34:56Z

tests/python/relax/backend/clml/utils.py

+    clml_mod = OpenCLMLOffLoad()(clml_mod)
+    verify_codegen(clml_mod, clml_codegen)
+
+    # On Mainline CI


Mainline will not have rpc, Hence we don't proceed beyond this point

srkreddy1238 · 2025-02-15T09:49:29Z

@Hzfengsy for the OptimizeBatchnorm pass

DecomposeOpsForInference couldn't help here as it results Conv2D+few elem_wise ops.

OptimizeBatchnorm does folds batchnorm attributes into Conv2D weight and Bias, which we can offload as Fused Conv2D+Bias op of CLML.

Hzfengsy · 2025-02-15T10:34:09Z

Thanks for pinging me. @srkreddy1238

OptimizeBatchnorm does folds batchnorm attributes into Conv2D weight and Bias, which we can offload as Fused Conv2D+Bias op of CLML.

IIUC it works as a CLML-specific pass. If so, could you please move it under adreno or clml folders, or at least add comments saying that it works only for CLML.

srkreddy1238 · 2025-02-15T13:35:27Z

IIUC it works as a CLML-specific pass.

This is not CLML specific. Can be used for fusing Conv2D+BN -> Conv2D (Updated weight, bias) for inference case.

cmake/modules/contrib/CLML.cmake

python/tvm/relax/transform/optimize_batchnorm.py

tests/python/relax/backend/clml/test_ops.py

tqchen · 2025-02-17T14:53:22Z

This is not CLML specific. Can be used for fusing Conv2D+BN -> Conv2D (Updated weight, bias) for inference case..

Note on this pass. This was an optimization can indeed can apply more broadly. One note is that this optimization is a special case of FoldScaleAxis optimization that folds scale into weights, so we want to add a comment here that we can replace by general FoldScaleAxis in future, cc @Hzfengsy @srkreddy1238

tqchen · 2025-02-17T14:54:04Z

Thanks @srkreddy1238 for great effort, glad to see the new target aware pipeline helps to simplify the flow here. also cc @MasterJH5574

This brings in OpenCLML offloading via BYOC path with available operators in Relax. Adds codegen tests for Mainline CI.

tqchen · 2025-02-19T15:51:33Z

Thanks @srkreddy1238 this is merged!

srkreddy1238 · 2025-02-19T15:56:45Z

The codegen tests are added in unity->gpu. But I see this CI pipeline only builds GPU but doesn't run tests. Is this expected to be enabled soon ?

tqchen · 2025-02-19T16:09:21Z

https://github.com/apache/tvm/blob/main/ci/jenkins/unity_jenkinsfile.groovy#L356 the build stage of unity pipeline does run relax tests.

We can also followup to fold unity pipeline into the normal gpu pipeline

srkreddy1238 requested review from tqchen and yongwww February 14, 2025 04:44

tqchen reviewed Feb 14, 2025

View reviewed changes

tests/python/relax/backend/clml/utils.py Show resolved Hide resolved

python/tvm/relax/backend/adreno/pipeline.py Outdated Show resolved Hide resolved

python/tvm/relax/backend/adreno/pipeline.py Outdated Show resolved Hide resolved

srkreddy1238 commented Feb 14, 2025

View reviewed changes

srkreddy1238 requested a review from Hzfengsy February 15, 2025 09:49

srkreddy1238 force-pushed the clml_relax branch 2 times, most recently from 0b220c8 to d65a89a Compare February 17, 2025 08:46

tqchen requested changes Feb 17, 2025

View reviewed changes

cmake/modules/contrib/CLML.cmake Outdated Show resolved Hide resolved

python/tvm/relax/transform/optimize_batchnorm.py Outdated Show resolved Hide resolved

tests/python/relax/backend/clml/test_ops.py Outdated Show resolved Hide resolved

srkreddy1238 added 11 commits February 19, 2025 19:29

OpenCLML support for Relax

fd31595

This brings in OpenCLML offloading via BYOC path with available operators in Relax. Adds codegen tests for Mainline CI.

Remove unnecessary const fold

3522b7b

CLML Pipeline

7490917

Cmake

864b042

Lint

3c7f2c0

Test case dependency on clml

b447fb7

CLML codegen tests enabled for Mainline CI

bd4aadc

Rebase

141bb74

review

59c00cb

Lint

709ef3b

rebase

ea742a1

srkreddy1238 force-pushed the clml_relax branch from 2f37eb9 to ea742a1 Compare February 19, 2025 14:05

srkreddy1238 requested a review from tqchen February 19, 2025 15:20

tqchen approved these changes Feb 19, 2025

View reviewed changes

tqchen merged commit cc2f079 into apache:main Feb 19, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RELAX][BYOC] OpenCLML offload support for Relax #17654

[RELAX][BYOC] OpenCLML offload support for Relax #17654

srkreddy1238 commented Feb 14, 2025

srkreddy1238 Feb 14, 2025 •

edited

Loading

srkreddy1238 Feb 14, 2025

srkreddy1238 commented Feb 15, 2025

Hzfengsy commented Feb 15, 2025

srkreddy1238 commented Feb 15, 2025

tqchen commented Feb 17, 2025 •

edited

Loading

tqchen commented Feb 17, 2025

tqchen commented Feb 19, 2025

srkreddy1238 commented Feb 19, 2025

tqchen commented Feb 19, 2025

[RELAX][BYOC] OpenCLML offload support for Relax #17654

[RELAX][BYOC] OpenCLML offload support for Relax #17654

Conversation

srkreddy1238 commented Feb 14, 2025

srkreddy1238 Feb 14, 2025 • edited Loading

Choose a reason for hiding this comment

srkreddy1238 Feb 14, 2025

Choose a reason for hiding this comment

srkreddy1238 commented Feb 15, 2025

Hzfengsy commented Feb 15, 2025

srkreddy1238 commented Feb 15, 2025

tqchen commented Feb 17, 2025 • edited Loading

tqchen commented Feb 17, 2025

tqchen commented Feb 19, 2025

srkreddy1238 commented Feb 19, 2025

tqchen commented Feb 19, 2025

srkreddy1238 Feb 14, 2025 •

edited

Loading

tqchen commented Feb 17, 2025 •

edited

Loading