-
Notifications
You must be signed in to change notification settings - Fork 68
Pull requests: intel/intel-xpu-backend-for-triton
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TritonGEN] Use the sub-group-size of the module instead of hard code number of 16 in block load.
#4764
opened Jul 22, 2025 by
chengjunlu
Loading…
[TritonGEN] Update the unsupported block load SPV interface list.
#4763
opened Jul 22, 2025 by
chengjunlu
Loading…
[RemoveLayout] Remove convert layout op for any layout if the user is tt.store with block pointer
#4751
opened Jul 21, 2025 by
chengjunlu
Loading…
[FlexDecoding] Support M < 8 tt.dot with DPAS to optimize the flex decoding performance.
#4727
opened Jul 16, 2025 by
chengjunlu
Loading…
[EXPERIMENTAL]: Load column major matrix with 2d block io
#4604
opened Jul 2, 2025 by
chengjunlu
•
Draft
Revert "[DEBUG] Revert "Enable
SPV_INTEL_fp_fast_math_mode
(#4058)"(#4473)"
#4576
opened Jun 26, 2025 by
anmyachev
Loading…
[FlashAttention] Sync from upstream tensor desc implementation (part 3)
#4520
opened Jun 17, 2025 by
whitneywhtsang
•
Draft
Clean up Intel specific code in the common TritonGPU dialect source file.
upstream: triton
#4469
opened Jun 10, 2025 by
chengjunlu
•
Draft
Do not use extractvalue if the inserted value is directly reachable
#4212
opened May 15, 2025 by
AndreyPavlenko
•
Draft
Use inline VISA to optimize horizontal batched subgroup reduce
#4171
opened May 12, 2025 by
chengjunlu
•
Draft
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.