Skip to content

Commit f6883d1

Browse files
authored
[bugfix] fix mcore_bridge deepseek-v3 (#6508)
1 parent 0ab69b7 commit f6883d1

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

swift/megatron/model/gpt_bridge.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,7 @@ def _get_tp_split_dim(mg_key: Optional[str]) -> Optional[int]:
7575
'linear_qkv',
7676
# mla
7777
'linear_q_proj',
78+
'linear_q_up_proj',
7879
'linear_kv_up_proj'
7980
}
8081
# RowLinear

0 commit comments

Comments
 (0)