[Kernel] Fix fused_gdn_gating #28343

ZJY0516 · 2025-11-08T08:37:18Z

Purpose

Fix fused_gdn_gating accuracy issue

Test Plan

vllm serve Qwen/Qwen3-Next-80B-A3B-Instruct --enable-expert-parallel -tp 4

lm_eval --model local-chat-completions --model_args model=Qwen/Qwen3-Next-80B-A3B-Instruct,base_url=http://localhost:8000/v1/chat/completions,num_concurrent=280 --tasks gsm8k --apply_chat_template --num_fewshot 5

Test Result

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.7839	±	0.0113
		strict-match	5	exact_match	↑	0.6566	±	0.0131

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

gemini-code-assist

Code Review

This pull request addresses an accuracy issue in the fused_gdn_gating_kernel. The changes are twofold: first, the manual sigmoid implementation is replaced with tl.sigmoid, which should provide better numerical stability. Second, the data type for storing the beta_output is corrected to match the input tensor b's data type. This change ensures that the kernel's behavior correctly mimics b.sigmoid(), resolving a precision mismatch that was likely the cause of the accuracy problem. The fix appears correct and well-targeted. I have no further suggestions.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/model_executor/models/qwen3_next.py

vadiklyutiy · 2025-11-08T08:47:00Z

Could you add accuracy test before this PR

ZJY0516 · 2025-11-08T08:51:40Z

Could you add accuracy test before this PR

Before this PR, sometimes gsm8k score was zero

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ZJY0516 · 2025-11-08T09:02:01Z

@codex review

chatgpt-codex-connector · 2025-11-08T09:05:16Z

Codex Review: Didn't find any major issues. What shall we delve into next?

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vadiklyutiy · 2025-11-08T09:28:10Z

Could you add accuracy test before this PR

Before this PR, sometimes gsm8k score was zero

what sometimes exactly mean?

use the same cmd and result was randomly 0 or around .80
or different cmd give different results
?

ZJY0516 · 2025-11-08T09:53:27Z

Could you add accuracy test before this PR

Before this PR, sometimes gsm8k score was zero

what sometimes exactly mean?

use the same cmd and result was randomly 0 or around .80

or different cmd give different results

?

same cmd, randomly 0 or around .80

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

init

9f4eac6

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ZJY0516 requested a review from sighingnow as a code owner November 8, 2025 08:37

mergify bot added the qwen Related to Qwen models label Nov 8, 2025

gemini-code-assist bot reviewed Nov 8, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 8, 2025

View reviewed changes

vllm/model_executor/models/qwen3_next.py Outdated Show resolved Hide resolved

ZJY0516 added 2 commits November 8, 2025 16:56

update

8265f17

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

update

fa57bb6

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ZJY0516 requested a review from vadiklyutiy November 8, 2025 10:48

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 8, 2025

Merge branch 'main' into fix-fused_gdn_gating

aa8e7d9

mgoin approved these changes Nov 9, 2025

View reviewed changes

mgoin merged commit c4768dc into vllm-project:main Nov 9, 2025
53 checks passed

ZJY0516 deleted the fix-fused_gdn_gating branch November 13, 2025 09:38

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Nov 13, 2025

[Kernel] Fix fused_gdn_gating (vllm-project#28343)

c6b5a58

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Kernel] Fix fused_gdn_gating #28343

[Kernel] Fix fused_gdn_gating #28343

Uh oh!

ZJY0516 commented Nov 8, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

vadiklyutiy commented Nov 8, 2025

Uh oh!

ZJY0516 commented Nov 8, 2025

Uh oh!

ZJY0516 commented Nov 8, 2025

Uh oh!

chatgpt-codex-connector bot commented Nov 8, 2025

Uh oh!

vadiklyutiy commented Nov 8, 2025

Uh oh!

ZJY0516 commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Kernel] Fix fused_gdn_gating #28343

[Kernel] Fix fused_gdn_gating #28343

Uh oh!

Conversation

ZJY0516 commented Nov 8, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

vadiklyutiy commented Nov 8, 2025

Uh oh!

ZJY0516 commented Nov 8, 2025

Uh oh!

ZJY0516 commented Nov 8, 2025

Uh oh!

chatgpt-codex-connector bot commented Nov 8, 2025

Uh oh!

vadiklyutiy commented Nov 8, 2025

Uh oh!

ZJY0516 commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ZJY0516 commented Nov 8, 2025 •

edited by github-actions bot

Loading