Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[BUGFIX] llama_4_scaling wrongly passed to DeepseekAttention deepseek Related to DeepSeek models llama Related to Llama models ready ONLY add when PR is ready to merge/full CI is needed
#29908 opened Dec 2, 2025 by juliendenize Loading…
5 tasks
[CI/Build] Avoid duplicate empty inputs test for common multimodal generation tests multi-modality Related to multi-modality (#4194)
#29907 opened Dec 2, 2025 by Isotr0py Draft
3 of 5 tasks
Gigachat 3 tool parser and tests documentation Improvements or additions to documentation frontend tool-calling
#29905 opened Dec 2, 2025 by ajpqs Loading…
4 of 5 tasks
[Logs] Optimize startup logs 4 nvidia v1
#29903 opened Dec 2, 2025 by yewentao256 Loading…
SigLIP example add chat_template documentation Improvements or additions to documentation needs-rebase
#29902 opened Dec 2, 2025 by piood Loading…
[Compile] Fix torch warning TensorFloat32 tensor cores for float32 matrix multiplication available but not enabled ready ONLY add when PR is ready to merge/full CI is needed v1
#29897 opened Dec 2, 2025 by yewentao256 Loading…
feat(model): Add BitsAndBytes quantization support for Qwen3-Omni-MoE qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#29896 opened Dec 2, 2025 by Navanit-git Loading…
5 tasks
[DOC] Add Arm to list of compute resouces providers documentation Improvements or additions to documentation
#29894 opened Dec 2, 2025 by fadara01 Loading…
2 tasks
[BugFix] Fix assert in build_for_cudagraph_capture nvidia ready ONLY add when PR is ready to merge/full CI is needed v1
#29893 opened Dec 2, 2025 by LucasWilkinson Loading… v0.12.0
[Bugfix] Fix FP8 MoE LoRA
#29890 opened Dec 2, 2025 by jeejeelee Loading…
5 tasks
optimize topk_topp_sampling. v1
#29886 opened Dec 2, 2025 by RanTao123 Loading…
4 tasks
Improve and supplement LoRA tuning documentation performance Performance-related issues
#29880 opened Dec 2, 2025 by caozuoba Loading…
5 tasks
[BugFix] fix imgs_pos in hunyuan_vl
#29879 opened Dec 2, 2025 by wkcn Draft
5 tasks
Support Deepseekv32 chat deepseek Related to DeepSeek models frontend
#29876 opened Dec 2, 2025 by yjc9696 Draft
5 tasks
add toolparser for deepseek v3.2 reusing qwen xml parser deepseek Related to DeepSeek models frontend qwen Related to Qwen models tool-calling
#29874 opened Dec 2, 2025 by wenmengzhou Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.