Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix gigachat3 parser + update tests frontend tool-calling
#30338 opened Dec 9, 2025 by ajpqs Loading…
3 of 5 tasks
fix: enhance human_readable_int function
#30337 opened Dec 9, 2025 by andyxning Loading…
5 tasks
[Bugfix] Fix fp8 DeepGemm compilation issues
#30336 opened Dec 9, 2025 by ElizaWszola Loading…
[BUGFIX] Mistral tool call parser v11+ frontend ready ONLY add when PR is ready to merge/full CI is needed tool-calling
#30332 opened Dec 9, 2025 by juliendenize Loading…
5 tasks
[Bugfix] tpu_model_runner: set vllm config context in reset_dynamo_cache() tpu Related to Google TPUs v1
#30331 opened Dec 9, 2025 by dtrifiro Loading…
[Feature][CPU Backend]: Add PyTorch vectorized backend
#30329 opened Dec 9, 2025 by Radu2k Loading…
[BugFix] Fix hang issue in LMCache mp mode kv-connector v1
#30327 opened Dec 9, 2025 by wz1qqx Loading…
5 tasks
[Frontend] [Doc] Exclude log deltas feature frontend
#30322 opened Dec 9, 2025 by Catacomba Loading…
3 tasks done
[fix] fix SM check for Flashinfer TRTLLM MOE nvidia
#30314 opened Dec 9, 2025 by jiahanc Loading…
5 tasks
[Misc][Quantization] Clarify the intent of GGUF FusedMoE weight materialization
#30310 opened Dec 9, 2025 by a4lg Loading…
1 of 5 tasks
[bugfix][quantization] fix quark qwen3 kv_cache quantization qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#30308 opened Dec 9, 2025 by haoyangli-amd Loading…
[Model][Quantization] Fix / Add GGUF support for Qwen2 MoE models qwen Related to Qwen models
#30307 opened Dec 9, 2025 by a4lg Loading…
3 of 5 tasks
[Misc] Pass reasoning to deepseekV32 tokenizer deepseek Related to DeepSeek models frontend
#30302 opened Dec 9, 2025 by kingsmad Draft
5 tasks
[ResponsesAPI] Add GPTOSS MCP tool streaming frontend gpt-oss Related to GPT-OSS models
#30301 opened Dec 9, 2025 by qandrew Loading…
ProTip! Filter pull requests by the default branch with base:main.