Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix cross attention v1
#28346 opened Nov 8, 2025 by fsx950223 Loading…
5 tasks
[Performance][gpt-oss] Revert gpt-oss max cudagraph size to 1024 gpt-oss Related to GPT-OSS models performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#28345 opened Nov 8, 2025 by mmangkad Loading…
5 tasks
[Kernel] Fix fused_gdn_gating qwen Related to Qwen models
#28343 opened Nov 8, 2025 by ZJY0516 Loading…
5 tasks
fix: close issue 28338 by fixed python version ci/build
#28339 opened Nov 8, 2025 by yihong0618 Loading…
5 tasks
Remove setuptools upper bound constraint (<80) ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#28337 opened Nov 8, 2025 by ColeMurray Loading…
[Misc] FlattenLogprobs -> FlatLogprobs ready ONLY add when PR is ready to merge/full CI is needed
#28335 opened Nov 8, 2025 by zhuohan123 Loading…
5 tasks
[Frontend] split append tool output frontend gpt-oss Related to GPT-OSS models
#28333 opened Nov 8, 2025 by qandrew Loading…
[Model] Add Afmoe architecture implementation documentation Improvements or additions to documentation new-model Requests to new models
#28332 opened Nov 8, 2025 by pranav4501 Loading…
5 tasks done
[Frontend][2/n] remove empty content from _parse_tool_calls_from_content frontend ready ONLY add when PR is ready to merge/full CI is needed
#28331 opened Nov 7, 2025 by qandrew Loading…
[Misc] Add more scoping for improved trace v1
#28329 opened Nov 7, 2025 by frank-wei Loading…
Enhance run_cluster.sh for multi-NIC support documentation Improvements or additions to documentation
#28328 opened Nov 7, 2025 by evberrypi Loading…
2 of 4 tasks
[Core] Simplify async KV output aggregation kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#28327 opened Nov 7, 2025 by njhill Loading…
Fix rotary embedding benchmark script performance Performance-related issues
#28323 opened Nov 7, 2025 by xyang16 Loading…
5 tasks
[ROCm] Add env to enable/disable aiter triton gemm rocm Related to AMD ROCm
#28321 opened Nov 7, 2025 by sarckk Loading…
3 of 5 tasks
Make tests/lora/utils usable by plugins
#28313 opened Nov 7, 2025 by vanbasten23 Loading…
5 tasks
[Core] Cache vllm_is_batch_invariant
#28304 opened Nov 7, 2025 by lgeiger Loading…
[Bugfix] Parse gpt-oss refusals w/ newer openai-harmony ci/build frontend gpt-oss Related to GPT-OSS models
#28303 opened Nov 7, 2025 by bbrowning Loading…
[Model][Qwen3VL] Simplify get_mrope_input_positions using numpy qwen Related to Qwen models
#28302 opened Nov 7, 2025 by lgeiger Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.