Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Docs] Add developer doc about CI failures documentation Improvements or additions to documentation
#18782 opened May 27, 2025 by russellb Loading…
Fail request if FSM fails to advance v1
#18780 opened May 27, 2025 by atbe Loading…
[V1] Support DP with Ray frontend v1
#18779 opened May 27, 2025 by ruisearch42 Loading…
[Perf] Tunings for SM100 FP8 CUTLASS kernel
#18778 opened May 27, 2025 by mgoin Loading…
Export NaNs in logits to scheduler_stats if output is corrupted tpu Related to Google TPUs v1
#18777 opened May 27, 2025 by vladmihailescu Loading…
[Core] Improve Tensor serialisation ready ONLY add when PR is ready to merge/full CI is needed v1
#18774 opened May 27, 2025 by lgeiger Loading…
[Bugfix] Fix for issue 17396
#18773 opened May 27, 2025 by frreiss Loading…
[Bugfix] Disable prefix caching by default for benchmark ready ONLY add when PR is ready to merge/full CI is needed
#18771 opened May 27, 2025 by cascade812 Loading…
[Torch Nightly]add missing dependency ci/build
#18770 opened May 27, 2025 by yangw-dev Loading…
[WIP] Add a metric to track request failures documentation Improvements or additions to documentation frontend
#18765 opened May 27, 2025 by harche Draft
[rocm] Fix wrong attention log
#18764 opened May 27, 2025 by fxmarty-amd Loading…
[Bugfix] Fix nomic max_model_len documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#18755 opened May 27, 2025 by noooop Loading…
[Kernel] GGUF MMVQ kernel for multiple input vectors ready ONLY add when PR is ready to merge/full CI is needed
#18754 opened May 27, 2025 by SzymonOzog Loading…
[Core] Rework dtype resolution multi-modality Related to multi-modality (#4194)
#18751 opened May 27, 2025 by DarkLight1337 Loading…
[CI] improve embed testing
#18747 opened May 27, 2025 by noooop Loading…
[Core] Support inplace model weights loading v1
#18745 opened May 27, 2025 by 22quinn Loading…
Add cuda 12.8 wheel nightly build ci/build
#18726 opened May 26, 2025 by atalman Loading…
ProTip! Add no:assignee to see everything that’s not assigned.