-
Notifications
You must be signed in to change notification settings - Fork 28
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable Pipeline Parallelism on torchax path
#1055
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on Jax TPU platform
#1054
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on Jax runner
#1053
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on jax worker
#1043
opened Nov 7, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Add qkv_parallel_linear/row_parallel_linear lora wrapper unit tests
#1036
opened Nov 6, 2025 by
vanbasten23
Loading…
[Docs] fix dead links in multiple documentation pages
#1027
opened Nov 6, 2025 by
mattheliu
Loading…
3 tasks done
[CI] Introduce a default features to pre-set 'pass' status in the support matrix
#1026
opened Nov 6, 2025 by
boe20211
Loading…
[TPU Offloading][Test] add connector scheduler tests
#1023
opened Nov 6, 2025 by
juncgu-google
Loading…
initial commit on compressed-tensors quantization support for fp8
#1011
opened Nov 4, 2025 by
qihqi
Loading…
[GPT-OSS] Load MXFP4 weights directly and dequantize online
#992
opened Oct 31, 2025 by
amishacorns
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-11-05.