-
-
Notifications
You must be signed in to change notification settings - Fork 5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V1] Add BlockTable class
ready
ONLY add when PR is ready to merge/full CI is needed
#11693
opened Jan 2, 2025 by
WoosukKwon
Loading…
[V1][Minor] Optimize token_ids_cpu copy
ready
ONLY add when PR is ready to merge/full CI is needed
#11692
opened Jan 2, 2025 by
WoosukKwon
Loading…
Add split_special_tokens to the Tokenize Endpoint
frontend
#11691
opened Jan 2, 2025 by
ruediste
Loading…
According to vllm.EngineArgs, the name should be distributed_executor_backend
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#11689
opened Jan 2, 2025 by
chunyang-wen
Loading…
[Bugfix] Change kv scaling factor by param json on nvidia gpu
#11688
opened Jan 2, 2025 by
bjmsong
Loading…
k8s-config: Update the secret to use stringData
documentation
Improvements or additions to documentation
#11679
opened Jan 2, 2025 by
surajssd
Loading…
[torch.compile] Hide KV cache behind torch.compile boundary
#11677
opened Jan 2, 2025 by
heheda12345
•
Draft
[Bugfix] Check chain_speculative_sampling before calling it
#11673
opened Jan 1, 2025 by
houseroad
Loading…
[Bugfix][SpecDecode] Adjust Eagle model architecture to align with intended design
#11672
opened Jan 1, 2025 by
llsj14
Loading…
[CI/Build] Update OpenVINO Dockerfile to Ubuntu 24.04
ci/build
#11670
opened Jan 1, 2025 by
ruediste
Loading…
[V1] Simplify Shutdown
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#11659
opened Dec 31, 2024 by
robertgshaw2-neuralmagic
Loading…
[XPU] Make pp group initilized for pipeline-parallelism
#11648
opened Dec 31, 2024 by
ys950902
Loading…
[Doc] [1/N] Reorganize Getting Started section
documentation
Improvements or additions to documentation
#11645
opened Dec 31, 2024 by
DarkLight1337
Loading…
[Docs] reorganize sponsorship page
documentation
Improvements or additions to documentation
#11639
opened Dec 30, 2024 by
simon-mo
Loading…
[Quantization/Parameter] WIP: Replace parameter subclasses with raw nn.Parameter with additional attributes
#11622
opened Dec 30, 2024 by
cennn
Loading…
[torch.compile] consider relevant code in compilation cache
#11614
opened Dec 30, 2024 by
youkaichao
Loading…
[Do Not Merge] - LoRA V1 Reference PR
needs-rebase
#11613
opened Dec 30, 2024 by
varun-sundar-rabindranath
•
Draft
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.