Skip to content

Commit 3322e08

Browse files
committed
testing
1 parent db09390 commit 3322e08

File tree

4 files changed

+224350
-15
lines changed

4 files changed

+224350
-15
lines changed

examples/quantization_w4a4_fp4/qwen_30b_a3b.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,7 @@ def tokenize(sample):
5555
# * calibrate a global_scale for activations, which will be used to
5656
# quantize activations to fp4 on the fly
5757
recipe = QuantizationModifier(
58-
targets="Linear", scheme="NVFP4", ignore=["lm_head", "re:.*mlp.gate$"]
58+
targets="Linear", scheme="W4A16", ignore=["lm_head", "re:.*mlp.gate$"]
5959
)
6060

6161
# Apply quantization.

0 commit comments

Comments
 (0)