[tensor-descriptor]: Extend support when tensor descriptor created in control flow #4152

etiotto · 2025-05-09T18:07:34Z

Enhance layout propagation and tensor descriptor lowering to support cases where descriptors or pointers are created within control flow constructs.

…ad/descriptor_store operation that uses it Signed-off-by: Tiotto, Ettore <[email protected]>

Signed-off-by: Tiotto, Ettore <[email protected]>

mfrancepillois

The code LGTM, but I'm not sure to understand why we need this PR if the conversion from tensor_descriptor to block_pointer is one of the first passes to run at the ttir level, how can we have tensor_descriptor ops with different layouts given the encodings are assigned to ops when lowering from the ttir level to the ttgir level.
As I understand it, we are not supposed to have tensor_descriptor ops after translating to ttgir dialect, no?

kurapov-peter · 2025-05-14T11:06:40Z

BTW FYI, the existing TMA lowering expects the tensor descriptor to always have a layout.

…o_block_ptr.1

Signed-off-by: Tiotto, Ettore <[email protected]>

PR WIP

…o_block_ptr.1

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto · 2025-05-15T21:09:40Z

The code LGTM, but I'm not sure to understand why we need this PR if the conversion from tensor_descriptor to block_pointer is one of the first passes to run at the ttir level, how can we have tensor_descriptor ops with different layouts given the encodings are assigned to ops when lowering from the ttir level to the ttgir level. As I understand it, we are not supposed to have tensor_descriptor ops after translating to ttgir dialect, no?

I have updated the code quite a bit to change the loginc used to do the translation. The pass is now simpler and able to handle tesnor descriptors created in control flow (i.e. a branch) inside a loop.

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto · 2025-05-20T16:24:18Z

No performance degradation in Triton benchmarks: https://github.com/intel-sandbox/applications.python.intel-xpu-backend-for-triton.infrastructure/actions/runs/15142551794/job/42570177577

Copilot

Pull Request Overview

Enhance layout propagation and tensor descriptor lowering to support cases where descriptors or pointers are created within control flow constructs.

Add updateAdvanceOpChain to recursively update chains of AdvanceOp users.
Refactor rewriteStoreOp to trace back through AdvanceOp chains before creating new MakeTensorPtrOp.
Rewrite the TensorDesc-to-block-pointer pass to drop old descriptor lookup, always find or create MakeTensorPtrOp, and handle loop-carried block pointer types.

Reviewed Changes

Copilot reviewed 2 out of 5 changed files in this pull request and generated 2 comments.

File	Description
third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp	Added recursive chain update, improved `rewriteStoreOp`, and added verification asserts after each transformation stage.
third_party/intel/lib/Dialect/Triton/Transforms/TensorDescToBlockPointer.cpp	Removed legacy descriptor lookup, consolidated pointer creation via `findOrCreateMakeTensorPtr`, replaced offset logic, and updated loop argument types.

Files not reviewed (3)

test/Triton/Intel/TensorDescToBlockPointer/basic.mlir: Language not supported
test/Triton/Intel/TensorDescToBlockPointer/loop.mlir: Language not supported
test/TritonIntelGPU/backward_combine_dpas_dot_layout.mlir: Language not supported

Comments suppressed due to low confidence (1)

third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp:793

The variable value is undefined in this scope, leading to a compile error. It should reference the store's value (e.g., storeOp.getValue()) or the converted value extracted earlier.

Value dataToStore = getValueAs(value, encoding);

third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp

third_party/intel/lib/Dialect/Triton/Transforms/TensorDescToBlockPointer.cpp

test/Triton/Intel/TensorDescToBlockPointer/basic.mlir

third_party/intel/lib/Dialect/Triton/Transforms/TensorDescToBlockPointer.cpp

third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp

…o_block_ptr.1

Signed-off-by: Tiotto, Ettore <[email protected]>

…o_block_ptr.1

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto · 2025-05-22T18:19:51Z

@whitneywhtsang I have split this pull request and move the part that deals with removing layout conversion for store operation that use a block ptr updated by a tt.advance operation in PR #4277

Signed-off-by: Tiotto, Ettore <[email protected]>

whitneywhtsang · 2025-05-22T18:31:12Z

@whitneywhtsang I have split this pull request and move the part that deals with removing layout conversion for store operation that use a block ptr updated by a tt.advance operation in PR #4277

Thanks, this part of the code LGTM.

Ensure block ptr is created with the same layout as the descriptor_lo…

4384ad1

…ad/descriptor_store operation that uses it Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested a review from Copilot May 9, 2025 18:07

etiotto self-assigned this May 9, 2025

This comment was marked as outdated.

Sign in to view

Remove naked print and unnecessary headers

f0ce91c

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested review from alexbaden, whitneywhtsang, chengjunlu and a team May 9, 2025 18:24

etiotto linked an issue May 9, 2025 that may be closed by this pull request

[tensor-descriptor]: Improve translation for make_tensor_ptr operations in control flow #4132

Open

etiotto marked this pull request as ready for review May 9, 2025 18:28

chengjunlu approved these changes May 9, 2025

View reviewed changes

mfrancepillois approved these changes May 12, 2025

View reviewed changes

whitneywhtsang previously approved these changes May 12, 2025

View reviewed changes

etiotto added 3 commits May 14, 2025 20:09

Merge remote-tracking branch 'origin/main' into etiotto.tensor_desc_t…

04f1c1d

…o_block_ptr.1

WIP: TensorDescToBlockPtr updates

3543e6e

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: RemoveLAuoyutConversion improvement for tt.advance operation

38ef6c3

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto added 2 commits May 15, 2025 19:32

Merge remote-tracking branch 'origin/main' into etiotto.tensor_desc_t…

e4d5d7d

…o_block_ptr.1

WIP: TensorDescToBlockPtr updates

9eb16ec

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto changed the title ~~Created block_ptr with the layout of the descriptor load/store operation~~ [tensor-descriptor]: Extend support when tensor descriptor created in control flow May 15, 2025

etiotto requested a review from whitneywhtsang May 15, 2025 21:09

etiotto added 4 commits May 15, 2025 21:18

WIP: TensorDescToBlockPtr updates

7f9bbc9

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: TensorDescToBlockPtr updates

f6ed66a

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: TensorDescToBlockPtr updates

cb4bb2e

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge branch 'main' into etiotto.tensor_desc_to_block_ptr.1

f6ce50a

etiotto requested a review from Copilot May 20, 2025 16:27

Copilot AI reviewed May 20, 2025

View reviewed changes

third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp Outdated Show resolved Hide resolved

third_party/intel/lib/Dialect/Triton/Transforms/TensorDescToBlockPointer.cpp Show resolved Hide resolved

etiotto requested review from mfrancepillois and chengjunlu May 20, 2025 21:05

whitneywhtsang reviewed May 20, 2025

View reviewed changes

chengjunlu approved these changes May 21, 2025

View reviewed changes

etiotto added 4 commits May 22, 2025 15:19

Merge remote-tracking branch 'origin/main' into etiotto.tensor_desc_t…

b439d24

…o_block_ptr.1

Address code review comments

e5f74b8

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge remote-tracking branch 'origin/main' into etiotto.tensor_desc_t…

ff321f5

…o_block_ptr.1

Add unit test

7fcbe40

Signed-off-by: Tiotto, Ettore <[email protected]>

Split unrelated changes in a separate PR

567d82f

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested a review from whitneywhtsang May 22, 2025 18:28

whitneywhtsang approved these changes May 22, 2025

View reviewed changes

Merge branch 'main' into etiotto.tensor_desc_to_block_ptr.1

75e5ef4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[tensor-descriptor]: Extend support when tensor descriptor created in control flow #4152

[tensor-descriptor]: Extend support when tensor descriptor created in control flow #4152

Uh oh!

etiotto commented May 9, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

mfrancepillois left a comment •

edited

Loading

Uh oh!

kurapov-peter commented May 14, 2025

Uh oh!

etiotto commented May 15, 2025

Uh oh!

etiotto commented May 20, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

etiotto commented May 22, 2025

Uh oh!

whitneywhtsang commented May 22, 2025

Uh oh!

Uh oh!

[tensor-descriptor]: Extend support when tensor descriptor created in control flow #4152

Are you sure you want to change the base?

[tensor-descriptor]: Extend support when tensor descriptor created in control flow #4152

Uh oh!

Conversation

etiotto commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

mfrancepillois left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kurapov-peter commented May 14, 2025

Uh oh!

etiotto commented May 15, 2025

Uh oh!

etiotto commented May 20, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

etiotto commented May 22, 2025

Uh oh!

whitneywhtsang commented May 22, 2025

Uh oh!

Uh oh!

etiotto commented May 9, 2025 •

edited

Loading

mfrancepillois left a comment •

edited

Loading