[Model][Qwen3VL] Simplify `get_mrope_input_positions` using numpy #28302

lgeiger · 2025-11-07T15:24:15Z

Purpose

This PR simplifies the Qwen3VL get_mrope_input_positions computation by using np.indices to make the code more readable.

In a profile I also noticed that the torch CPUs ops, especially torch.tensor(list[int]) are slower than their numpy equivalents so this PR also changes the computation to numpy.

Before:

After:

Test Plan

VLLM_WORKER_MULTIPROC_METHOD=spawn lm_eval --model vllm-vlm --model_args "pretrained=Qwen/Qwen3-VL-30B-A3B-Instruct-FP8,max_model_len=10000" --tasks chartqa --batch_size auto --apply_chat_template

Test Result

Before:

Tasks	Version	Filter	Metric		Value		Stderr
chartqa	0	none	anywhere_accuracy	↑	0.8680	±	0.0068
		none	exact_match	↑	0.6340	±	0.0096
		none	relaxed_accuracy	↑	0.8572	±	0.0070

After:

Tasks	Version	Filter	Metric		Value		Stderr
chartqa	0	none	anywhere_accuracy	↑	0.8672	±	0.0068
		none	exact_match	↑	0.6324	±	0.0096
		none	relaxed_accuracy	↑	0.8576	±	0.0070

gemini-code-assist

Code Review

This pull request refactors the get_mrope_input_positions function to use NumPy instead of PyTorch for improved performance and readability. The changes are logical and well-implemented, replacing complex PyTorch operations with more concise NumPy equivalents like np.indices.

I've identified one high-severity issue related to an edge case where empty input_tokens would cause a crash. I've provided a code suggestion to handle this case gracefully. Other than that, the changes look good.

vllm/model_executor/models/qwen3_vl.py

DarkLight1337

Actually I'm thinking of directly using the positions from mm_features instead of having to calculate the mask again, WDYT?

lgeiger · 2025-11-10T13:46:28Z

Actually I'm thinking of directly using the positions from mm_features instead of having to calculate the mask again, WDYT?

I'm not entirely sure how you plan to do this, but not having to compute the mask again does sound sensible.

DarkLight1337 · 2025-11-10T14:09:22Z

We can pass req_state.mm_features which includes mm_position (PlaceholderRange) into get_mrope_input_positions.

DarkLight1337 · 2025-11-10T14:21:35Z

I will open another PR to update the argument list, then we can migrate the models to actually make use of mm_position one by one.

DarkLight1337 · 2025-11-10T14:44:49Z

Opened #28399

mergify · 2025-11-11T12:53:24Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @lgeiger.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

DarkLight1337 · 2025-11-11T14:03:45Z

Feel free to update your PR now

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

DarkLight1337

Thanks, LGTM

lgeiger · 2025-11-12T02:20:48Z

@DarkLight1337 Just to make sure there's no confusion: I only rebased this PR so far but haven't made use of mm_position yet. The PR is still valid, though.

I'm not sure when I'll have time to migrate the logic to use mm_position but I'll try to have a look early next week. If you want to make this change soon feel free to make a PR otherwise I'll have a look at it on the weekend.

DarkLight1337 · 2025-11-12T02:31:19Z

Feel free to work on this yourself!

…lm-project#28302) Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com> Signed-off-by: George D. Torres <gdavtor@gmail.com>

…lm-project#28302) Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

lgeiger requested a review from sighingnow as a code owner November 7, 2025 15:24

mergify bot added the qwen Related to Qwen models label Nov 7, 2025

gemini-code-assist bot reviewed Nov 7, 2025

View reviewed changes

vllm/model_executor/models/qwen3_vl.py Show resolved Hide resolved

DarkLight1337 reviewed Nov 10, 2025

View reviewed changes

mergify bot added the needs-rebase label Nov 11, 2025

[Model][Qwen3VL] Simplify get_mrope_input_positions using numpy

a8494d5

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

lgeiger force-pushed the qwen3vl-mrope branch from 4eacf50 to a8494d5 Compare November 11, 2025 20:28

mergify bot removed the needs-rebase label Nov 11, 2025

DarkLight1337 approved these changes Nov 12, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) November 12, 2025 00:48

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 12, 2025

DarkLight1337 merged commit cbb799e into vllm-project:main Nov 12, 2025
55 checks passed

lgeiger deleted the qwen3vl-mrope branch November 14, 2025 15:21

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[Model][Qwen3VL] Simplify get_mrope_input_positions using numpy (vl…

6ac60df

…lm-project#28302) Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

Uh oh!

[Model][Qwen3VL] Simplify get_mrope_input_positions using numpy #28302

[Model][Qwen3VL] Simplify get_mrope_input_positions using numpy #28302

Uh oh!

Conversation

lgeiger commented Nov 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

lgeiger commented Nov 10, 2025

Uh oh!

DarkLight1337 commented Nov 10, 2025

Uh oh!

DarkLight1337 commented Nov 10, 2025

Uh oh!

DarkLight1337 commented Nov 10, 2025

Uh oh!

mergify bot commented Nov 11, 2025

Uh oh!

DarkLight1337 commented Nov 11, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

lgeiger commented Nov 12, 2025

Uh oh!

DarkLight1337 commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Model][Qwen3VL] Simplify `get_mrope_input_positions` using numpy #28302

[Model][Qwen3VL] Simplify `get_mrope_input_positions` using numpy #28302

lgeiger commented Nov 7, 2025 •

edited by github-actions bot

Loading

DarkLight1337 commented Nov 12, 2025 •

edited

Loading