graph: enable quantized gated mlp dispatch by TaoLv · Pull Request #4838 · uxlfoundation/oneDNN

TaoLv · 2026-03-17T03:49:15Z

The example and test files in benchdnn are updated with f32 intermediate data type. (Need Arch review.)
Graph backend gated mlp kernel to support quantized inputs.
Dispatch quantized gated mlp patterns to the quantized gated mlp primitive.

Still, the dispatching is disabled by default, to enable it, set _ONEDNN_GRAPH_GATED_MLP_FORCE_PRIMITIVE=0.

TaoLv · 2026-03-17T04:09:27Z

make test
disable benchdnn_all
enable benchdnn_graph

TaoLv · 2026-03-18T05:24:26Z

make test
disable benchdnn_all
enable benchdnn_graph

TaoLv requested review from a team as code owners March 17, 2026 03:49

github-actions bot added component:graph-api Codeowner: @oneapi-src/onednn-graph component:tests Codeowner: @oneapi-src/onednn-arch component:examples labels Mar 17, 2026

TaoLv added 7 commits March 17, 2026 20:43

examples: graph: fix intermediate type for int4 gated mlp

884860b

graph: backend: dnnl: patterns: support typecast in quant gated mlp

03d6a88

benchdnn: inputs: graph: add cases for int4 gated mlp

7299aa9

graph: backend: dnnl: kernels: supports quantized gated mlp

ebf449b

graph: backend: dnnl: executables: prepare args for quantized gated mlp

cd17f82

graph: backend: dnnl: passes: fuse quantized gated mlp

826a827

graph: backend: dnnl: patterns: enable quantized gated mlp kernels

bd25ee1

TaoLv force-pushed the lvtao/main/quantized-gated-mlp branch from e437944 to bd25ee1 Compare March 18, 2026 03:43

rongzha1 approved these changes Mar 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph: enable quantized gated mlp dispatch#4838

graph: enable quantized gated mlp dispatch#4838
TaoLv wants to merge 7 commits intomainfrom
lvtao/main/quantized-gated-mlp

TaoLv commented Mar 17, 2026 •

edited

Loading

Uh oh!

TaoLv commented Mar 17, 2026

Uh oh!

TaoLv commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TaoLv commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TaoLv commented Mar 17, 2026

Uh oh!

TaoLv commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TaoLv commented Mar 17, 2026 •

edited

Loading