-
Notifications
You must be signed in to change notification settings - Fork 592
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Doc] Upgrade multi-node doc
documentation
Improvements or additions to documentation
#4365
opened Nov 23, 2025 by
Potabk
Loading…
add Qwen2.5-VL README
documentation
Improvements or additions to documentation
#4364
opened Nov 23, 2025 by
MrFan-yes
Loading…
Fix the hang issue of multimodal model when running with DP>1 and cud…
#4362
opened Nov 22, 2025 by
wujinyuan1
Loading…
Fix the hang issue of multimodal model when running with DP>1 and cudagraph_mode is FULL_DECODE_ONLY
#4361
opened Nov 22, 2025 by
wujinyuan1
Loading…
[cherry-pick pr-4355] bugfix for mtp>1 when lm_head_tp>1
merge-conflicts
#4360
opened Nov 22, 2025 by
zouyida2052
Loading…
[refact] unified soc_version code
module:core
module:ops
module:quantization
module:tests
#4359
opened Nov 22, 2025 by
zzzzwwjj
Loading…
Add Qwen3-235B tutorial
documentation
Improvements or additions to documentation
#4358
opened Nov 22, 2025 by
JC-ut0
Loading…
[refactor]support gatingtopk operator generalization
module:core
module:ops
module:tests
#4356
opened Nov 22, 2025 by
1092626063
Loading…
[Bugfix] use module-level import for 'chunk_gated_delta_rule' in Qwen3Next
#4354
opened Nov 22, 2025 by
zjchenn
Loading…
[main]Upgrade cann to 8.3rc2
documentation
Improvements or additions to documentation
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4350
opened Nov 21, 2025 by
MrZ20
Loading…
[MM][Patch] Patch
AscendQwen2_5_VisionAttention and remove redundant code
#4349
opened Nov 21, 2025 by
shen-shanshan
Loading…
【fix】ops gatingtopk fix nightly ci error
module:ops
module:tests
#4340
opened Nov 21, 2025 by
1092626063
Loading…
[Benchmark] Fix python build error
performance-test
enable performance test for PR
ready-for-test
start test by label for PR
#4336
opened Nov 21, 2025 by
Potabk
Loading…
[Doc] Context Parallel
documentation
Improvements or additions to documentation
#4330
opened Nov 21, 2025 by
zhenwenqi2024
Loading…
[Doc][Model] Add principles of modeling files in vllm-ascend
documentation
Improvements or additions to documentation
#4327
opened Nov 21, 2025 by
shen-shanshan
Loading…
[Fix] Remove unnecessary NPU synchronization in MTP proposer
ready
read for review
ready-for-test
start test by label for PR
#4325
opened Nov 21, 2025 by
yiz-liu
Loading…
[TEST]Update deepseek mtpx acc cases standard
module:tests
#4321
opened Nov 21, 2025 by
jiangyunfan1
Loading…
[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector
#4311
opened Nov 20, 2025 by
lidenghui1110
Loading…
[task] Add causal_conv1d_update triton kernel
module:ops
#4307
opened Nov 20, 2025 by
OsirisDuan
Loading…
[task] Add layer norm forward triton kernel
module:ops
#4306
opened Nov 20, 2025 by
OsirisDuan
Loading…
[task] Add fused gdn gating triton kernel
module:ops
#4304
opened Nov 20, 2025 by
OsirisDuan
Loading…
[v0.9.1] Upgrade CANN to 8.2.rc2
ci/build
documentation
Improvements or additions to documentation
module:core
module:ops
module:tests
module:tools
#4303
opened Nov 20, 2025 by
MrZ20
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.