-
-
Notifications
You must be signed in to change notification settings - Fork 7.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Docs] Add developer doc about CI failures
documentation
Improvements or additions to documentation
#18782
opened May 27, 2025 by
russellb
Loading…
[V1] [Bugfix] eagle bugfix and enable correct lm_head for multimodal (2)
v1
#18781
opened May 27, 2025 by
RonaldBXu
Loading…
Export NaNs in logits to scheduler_stats if output is corrupted
tpu
Related to Google TPUs
v1
#18777
opened May 27, 2025 by
vladmihailescu
Loading…
[V1] Allocate kv_cache with stride order for V1
v1
#18775
opened May 27, 2025 by
NickLucche
Loading…
[Core] Improve Tensor serialisation
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#18774
opened May 27, 2025 by
lgeiger
Loading…
[Bugfix] Disable prefix caching by default for benchmark
ready
ONLY add when PR is ready to merge/full CI is needed
#18771
opened May 27, 2025 by
cascade812
Loading…
[Bugfix]: correctly propagate errors message caught at the chat_templating step to the client
frontend
#18769
opened May 27, 2025 by
gcalmettes
Loading…
[Feature] A calibration-free RTN-based quantization for accurate and accelerated INT4/INT8 inference
#18768
opened May 27, 2025 by
sakogan
Loading…
[WIP] Add a metric to track request failures
documentation
Improvements or additions to documentation
frontend
[Platform][Dist] Make torch distributed process group extendable
#18763
opened May 27, 2025 by
MengqingCao
Loading…
[WIP][Kernel] Integrate CUTLASS MoE kernel with PPLX
#18762
opened May 27, 2025 by
ElizaWszola
•
Draft
[Bugfix][FailingTest]Fix test_model_load_with_params.py
ci/build
#18758
opened May 27, 2025 by
rabi
Loading…
[Bugfix] Fix nomic max_model_len
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18755
opened May 27, 2025 by
noooop
Loading…
[Kernel] GGUF MMVQ kernel for multiple input vectors
ready
ONLY add when PR is ready to merge/full CI is needed
#18754
opened May 27, 2025 by
SzymonOzog
Loading…
[Core] Rework dtype resolution
multi-modality
Related to multi-modality (#4194)
#18751
opened May 27, 2025 by
DarkLight1337
Loading…
[Bugfix]: Fix moe_unpermute compatibility by aligning function signatures under CUDA < 12.0
#18749
opened May 27, 2025 by
caibucai22
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.