-
Notifications
You must be signed in to change notification settings - Fork 66
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Doc changes from main to 0.10.2
documentation
Improvements or additions to documentation
skip-gaudi-tests
#549
opened Nov 7, 2025 by
mhelf-intel
Loading…
Bulk docs to 0.11.0
documentation
Improvements or additions to documentation
skip-gaudi-tests
#547
opened Nov 7, 2025 by
PatrykWo
Loading…
[FIX_FOR_VLLM_LATEST] Fix upstream execute_model crash
#546
opened Nov 7, 2025 by
iboiko-habana
Loading…
[Docs] Readme for bucketing from file + env var added
documentation
Improvements or additions to documentation
skip-gaudi-tests
#545
opened Nov 7, 2025 by
adobrzyn
Loading…
Refactor part of spec decode structure identical to vLLM
#544
opened Nov 7, 2025 by
jerrychenhf
Loading…
[SW-228042] Add support for dynamic vLLM kv-cache quantization
#538
opened Nov 6, 2025 by
dudilester
Loading…
[FIX_FOR_VLLM_LATEST] Fix cpu disable shared_experts VLLM_DISABLE_SHARED_EXPERTS_STREAM
#537
opened Nov 6, 2025 by
pawel-olejniczak
Loading…
[FIX_FOR_VLLM_LATEST] Fix 'HPUCompressedTensorsWNA16MoEMethod' object has no attribute 'fused_experts'
#535
opened Nov 6, 2025 by
pawel-olejniczak
Loading…
[Attention Metadata Overhaul 2/N] Move metadata processing outside HPUModelAdapter, prepare biases on CPU
#530
opened Nov 5, 2025 by
kzawora-intel
Loading…
[Attention Metadata Overhaul 1/N] Extract metadata update to HPUAttentionMetadataProcessor
#526
opened Nov 5, 2025 by
kzawora-intel
Loading…
reduce graph recompilations in input embeddings for Gemma3
#519
opened Nov 4, 2025 by
skaulintel
•
Draft
[FIX_FOR_VLLM_LATEST] Apply fix for [Core] Async scheduling + structured outputs compatibility #26866
#512
opened Nov 3, 2025 by
iboiko-habana
Loading…
Call shutdown_inc to mitiagate driver worker teardown order
#511
opened Nov 3, 2025 by
michalkuligowski
•
Draft
[FIX_FOR_VLLM_LATEST] Hourly 775 fix: 'HPUWorker' object has no attribute 'get_kv_connector…
#510
opened Nov 3, 2025 by
pawel-olejniczak
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.