-
-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Fix HunyuanOCR cross-image contamination in batch processing
#30344
opened Dec 9, 2025 by
anker-c2
Loading…
3 of 5 tasks
[CI] refine more logic when generating and using nightly wheels & indices
ci/build
#30341
opened Dec 9, 2025 by
Harry-Chen
Loading…
3 of 5 tasks
[CMake][Build]: Remove unused ACL CMake env variables
ci/build
#30339
opened Dec 9, 2025 by
Radu2k
Loading…
Fix gigachat3 parser + update tests
frontend
tool-calling
#30338
opened Dec 9, 2025 by
ajpqs
Loading…
3 of 5 tasks
[Bugfix]: Streaming i/o of batch files. Resolves #30268
ci/build
frontend
#30334
opened Dec 9, 2025 by
umgefahren
Loading…
3 of 5 tasks
[BUGFIX] Mistral tool call parser v11+
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#30332
opened Dec 9, 2025 by
juliendenize
Loading…
5 tasks
[Bugfix] tpu_model_runner: set vllm config context in reset_dynamo_cache()
tpu
Related to Google TPUs
v1
#30331
opened Dec 9, 2025 by
dtrifiro
Loading…
[Bugfix] Fix cuda graph sizes when running with speculative decoding
nvidia
#30330
opened Dec 9, 2025 by
PatrykSaffer
Loading…
[BugFix] Fix hang issue in LMCache mp mode
kv-connector
v1
#30327
opened Dec 9, 2025 by
wz1qqx
Loading…
5 tasks
[Frontend] [Doc] Exclude log deltas feature
frontend
#30322
opened Dec 9, 2025 by
Catacomba
Loading…
3 tasks done
[BugFix] Spec decode with VLLM_ENABLE_V1_MULTIPROCESSING=0
v1
#30319
opened Dec 9, 2025 by
heheda12345
Loading…
5 tasks
Generalize pooling model support with multi-task, multi-layer, multi-label classification that can be pooled from both hidden states and LM head's logits.
#30315
opened Dec 9, 2025 by
kflu
Loading…
3 of 5 tasks
[fix] fix SM check for Flashinfer TRTLLM MOE
nvidia
#30314
opened Dec 9, 2025 by
jiahanc
Loading…
5 tasks
[Misc][Quantization] Clarify the intent of GGUF
FusedMoE weight materialization
#30310
opened Dec 9, 2025 by
a4lg
Loading…
1 of 5 tasks
[bugfix][quantization] fix quark qwen3 kv_cache quantization
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#30308
opened Dec 9, 2025 by
haoyangli-amd
Loading…
[Model][Quantization] Fix / Add GGUF support for Qwen2 MoE models
qwen
Related to Qwen models
#30307
opened Dec 9, 2025 by
a4lg
Loading…
3 of 5 tasks
Fix incomplete response generation for tool call outputs
deepseek
Related to DeepSeek models
fb-exported
frontend
meta-exported
[ResponsesAPI] Add GPTOSS MCP tool streaming
frontend
gpt-oss
Related to GPT-OSS models
#30301
opened Dec 9, 2025 by
qandrew
Loading…
[Bugfix] Update WSL detection to check for WSL1 compatibility as WSL2…
#30299
opened Dec 9, 2025 by
HoneyBerries
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.