Commit 9c58b20
[MoE] Nvfp4 Masked Gemm: Add flashinfer grouped_gemm_nt_masked (vllm-project#25990)
Signed-off-by: Shu Wang. <shuw@nvidia.com>
Signed-off-by: mgoin <mgoin64@gmail.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>1 parent 1e74276 commit 9c58b20
File tree
10 files changed
+1062
-33
lines changed- .buildkite
- tests/kernels/moe
- vllm
- model_executor/layers
- fused_moe
- quantization
- utils
- utils
10 files changed
+1062
-33
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
921 | 921 | | |
922 | 922 | | |
923 | 923 | | |
| 924 | + | |
924 | 925 | | |
925 | 926 | | |
926 | 927 | | |
| |||
0 commit comments