[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector #4311

lidenghui1110 · 2025-11-20T09:15:59Z

What this PR does / why we need it?

func get_kv_cache_spec in model_runner changed a lot and caused error in cpuoffloading connector which is copied from model_runner, this PR adapts to new implemented get_kv_cache_spec to fix it.

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@2918c1b

Signed-off-by: lidenghui <lidenghui1110@gmail.com>

github-actions · 2025-11-20T09:16:09Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: lidenghui <lidenghui1110@gmail.com>

lidenghui1110 · 2025-11-20T13:20:02Z

@wangxiyuan please take a look at this bugfix, thanks.

adapt to new implemented get_kv_cache_spec in cpuoffload connector

b9cc35d

Signed-off-by: lidenghui <lidenghui1110@gmail.com>

fix lint

eacb9c8

Signed-off-by: lidenghui <lidenghui1110@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector #4311

[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector #4311

lidenghui1110 commented Nov 20, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

lidenghui1110 commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector #4311

Are you sure you want to change the base?

[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector #4311

Conversation

lidenghui1110 commented Nov 20, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

lidenghui1110 commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lidenghui1110 commented Nov 20, 2025 •

edited by github-actions bot

Loading