ZImageTransformer2D: Only build attention mask if seqlens are not equal #12955

Beinsezii · 2026-01-10T02:08:43Z

What does this PR do?

Fixes a small performance regression for Z Image Turbo.

Basically just sets attn_mask to None when it would otherwise be all ones, which is always the case for Z Image Turbo where guidance_scale==1 for typical usage.

On an H100 this improves performance by about 4%, using AttentionBackendName._NATIVE_CUDNN.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@yiyixuxu or @sayakpaul probably

sayakpaul · 2026-01-10T03:30:07Z

Cc: @JerryWu-code who contributed the model.

ZImageTransformer2D: Only build attention mask if seqlens are not equal

02ae19d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ZImageTransformer2D: Only build attention mask if seqlens are not equal #12955

ZImageTransformer2D: Only build attention mask if seqlens are not equal #12955

Beinsezii commented Jan 10, 2026

Uh oh!

sayakpaul commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ZImageTransformer2D: Only build attention mask if seqlens are not equal #12955

Are you sure you want to change the base?

ZImageTransformer2D: Only build attention mask if seqlens are not equal #12955

Conversation

Beinsezii commented Jan 10, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants