-
Notifications
You must be signed in to change notification settings - Fork 141
Pull requests: pytorch/helion
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Pallas] Fix codegen for slice indexing when there are squeezed dimensions
CLA Signed
This label is managed by the Meta Open Source bot.
#2027
opened Apr 16, 2026 by
AmesingFlank
Contributor
Loading…
[Pallas] Remove non-tiled fallback paths which are no longer used after #2007
CLA Signed
This label is managed by the Meta Open Source bot.
#2026
opened Apr 16, 2026 by
AmesingFlank
Contributor
Loading…
[TPU][Pallas] Add lower bound analytical VMEM estimation and OOM guard for Pallas launchers
CLA Signed
This label is managed by the Meta Open Source bot.
#2024
opened Apr 15, 2026 by
yarongmu-google
Loading…
[CI] Pin Triton-to-tile-IR to last known-good commit
CLA Signed
This label is managed by the Meta Open Source bot.
[Pallas] Use FakeTensorMode to avoid HBM allocation for output-only tensors
CLA Signed
This label is managed by the Meta Open Source bot.
[cutedsl] Plan grouped-N matmuls and lower atomic tensor indices
CLA Signed
This label is managed by the Meta Open Source bot.
#2020
opened Apr 14, 2026 by
jansel
Contributor
Loading…
[TPU][Pallas]Fix example/cross_entropy.py on Pallas TPU
CLA Signed
This label is managed by the Meta Open Source bot.
#2019
opened Apr 14, 2026 by
yarongmu-google
Loading…
[cutedsl] Improve dot with epilogue handling
CLA Signed
This label is managed by the Meta Open Source bot.
#2014
opened Apr 14, 2026 by
jansel
Contributor
Loading…
[Autotuner] Apply numel constraints on search neighbors
CLA Signed
This label is managed by the Meta Open Source bot.
#2011
opened Apr 13, 2026 by
fulvius31
Collaborator
Loading…
[cutedsl] Strengthen layout planning pass invariants
CLA Signed
This label is managed by the Meta Open Source bot.
#2009
opened Apr 13, 2026 by
jansel
Contributor
Loading…
[cutedsl] Refactor reductions to use helper methods
CLA Signed
This label is managed by the Meta Open Source bot.
#2008
opened Apr 13, 2026 by
jansel
Contributor
Loading…
[Autotuner] Add LLM-seeded hybrid search
CLA Signed
This label is managed by the Meta Open Source bot.
#2004
opened Apr 12, 2026 by
choijon5
Contributor
Loading…
[Autotuner] Adding LLM-guided search
CLA Signed
This label is managed by the Meta Open Source bot.
#2003
opened Apr 12, 2026 by
choijon5
Contributor
Loading…
[Pallas] Don't record block_id in dim_map for hl.grid program_id dimensions
CLA Signed
This label is managed by the Meta Open Source bot.
#2001
opened Apr 10, 2026 by
norx1991
Contributor
Loading…
[Pallas] Exclude output-only tensors from pallas_call inputs
CLA Signed
This label is managed by the Meta Open Source bot.
[Pallas] Fix fori_loop multi-dim inner loop index unflattening
CLA Signed
This label is managed by the Meta Open Source bot.
#1995
opened Apr 9, 2026 by
thcmbs
Collaborator
Loading…
fix misleading benchmarking for fp8 gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1980
opened Apr 7, 2026 by
shunting314
Contributor
Loading…
WIP: fp8 all gather matmul
CLA Signed
This label is managed by the Meta Open Source bot.
#1974
opened Apr 7, 2026 by
shunting314
Contributor
Loading…
warning for process group name not found conditionally
CLA Signed
This label is managed by the Meta Open Source bot.
#1973
opened Apr 7, 2026 by
shunting314
Contributor
Loading…
[Pallas] Fix scalar .begin index not collapsing tensor dimensions
CLA Signed
This label is managed by the Meta Open Source bot.
[Compiler] Add This label is managed by the Meta Open Source bot.
reserved_launch_param_names to Backend ABC
CLA Signed
#1970
opened Apr 7, 2026 by
hinriksnaer
Collaborator
Loading…
[Pallas] Add non-DMA fori_loop fallback for DMA-unaligned inner blocks
CLA Signed
This label is managed by the Meta Open Source bot.
#1969
opened Apr 7, 2026 by
thcmbs
Collaborator
Loading…
Use persistent+tensor_descriptor defaults for dot kernels on Blackwell (sm100+)
CLA Signed
This label is managed by the Meta Open Source bot.
#1964
opened Apr 7, 2026 by
choijon5
Contributor
Loading…
fix _clone_args when there are non-tensor input
CLA Signed
This label is managed by the Meta Open Source bot.
#1963
opened Apr 6, 2026 by
shunting314
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-13.