-
Notifications
You must be signed in to change notification settings - Fork 18
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP][DO NOT REVIEW] [TPU host offload] delta load optimization for tpu connector local
#941
opened Oct 26, 2025 by
saikat-royc
Loading…
[Do not review yet] Fix issues when running multiple tests on the v6e-8 machine.
#926
opened Oct 23, 2025 by
vanbasten23
•
Draft
[Kernel] Refactor ragged_paged_attention to proxy for default and hd64
#918
opened Oct 22, 2025 by
yaochengji
•
Draft
[CI] remove lora_bias_stacked as it is deprecated in vllm
#835
opened Oct 11, 2025 by
bzgoogle
Loading…
feat: Add a procedures to record the vllm and tpu_inference's commit hashes in CI pipeline (WIP)
#795
opened Oct 7, 2025 by
dennisYehCienet
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.