Skip to content

Commit a9d56db

Browse files
committed
making tests run faster
Summary Signed-off-by: HDCharles <charlesdavidhernandez@gmail.com>
1 parent 37c3314 commit a9d56db

File tree

4 files changed

+28
-3
lines changed

4 files changed

+28
-3
lines changed
Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,9 @@
11
cadence: "nightly"
22
test_type: "regression"
33
model: Qwen/Qwen3-30B-A3B
4+
45
scheme: NVFP4
5-
num_calibration_samples: 20
6+
67
dataset_id: HuggingFaceH4/ultrachat_200k
78
dataset_split: train_sft
9+
num_calibration_samples: 20
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
11
cadence: "nightly"
22
test_type: "regression"
33
model: Qwen/Qwen3-30B-A3B
4+
45
scheme: FP8_DYNAMIC
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,9 @@
11
cadence: "nightly"
22
test_type: "regression"
33
model: Qwen/Qwen3-30B-A3B
4-
scheme: W4A16
4+
55
dataset_id: HuggingFaceH4/ultrachat_200k
66
dataset_split: train_sft
7-
quant_type: "GPTQ"
7+
num_calibration_samples: 20
8+
9+
recipe: tests/e2e/vLLM/recipes/WNA16/recipe_w4a16_group_quant_first_20_layers.yaml
Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
quant_stage:
2+
quant_modifiers:
3+
GPTQModifier:
4+
ignore: [
5+
"lm_head",
6+
# Ignore layers (20+)
7+
"re:.*model\\.layers\\.([2-9][0-9])\\..*",
8+
]
9+
actorder: null
10+
config_groups:
11+
group_0:
12+
weights:
13+
num_bits: 4
14+
type: "int"
15+
symmetric: False
16+
strategy: "group"
17+
group_size: 128
18+
input_activations: null
19+
output_activations: null
20+
targets: ["Linear"]

0 commit comments

Comments
 (0)