Commit ca57ae3
committed
Improve test points: flaky and verbose
* Make test point less brittle
`tazureChat/generateWithStructuredOutput` has somewhat frequent spurious failures of this type:
```
================================================================================
Verification failed in tazureChat/generateWithStructuredOutput.
---------------------
Framework Diagnostic:
---------------------
OrConstraint failed.
--> + [First Condition]:
| OrConstraint failed.
| --> + [First Condition]:
| | IsEqualTo failed.
| | --> StringComparator failed.
| | --> The strings are not equal.
| |
| | Actual Value:
| | "western honey bee"
| | Expected Value:
| | "honeybee"
| --> OR
| + [Second Condition]:
| | IsEqualTo failed.
| | --> StringComparator failed.
| | --> The strings are not equal.
| |
| | Actual Value:
| | "western honey bee"
| | Expected Value:
| | "honey bee"
| -+---------------------
--> OR
+ [Second Condition]:
| IsEqualTo failed.
| --> StringComparator failed.
| --> The strings are not equal.
|
| Actual Value:
| "western honey bee"
| Expected Value:
| "bee"
-+---------------------
```
Rather than trying to add more and more valid alternatives, it is probably better to relax the requirements a bit.
* Make test less noisy
`texampleTests/testAnalyzeTextDataUsingParallelFunctionCallwithOllama` was lacking the `evalc` against noise cluttering the output.1 parent 08897d1 commit ca57ae3
2 files changed
+3
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | | - | |
| 13 | + | |
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | | - | |
20 | | - | |
| 19 | + | |
21 | 20 | | |
22 | 21 | | |
23 | 22 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
80 | | - | |
| 80 | + | |
81 | 81 | | |
82 | 82 | | |
83 | 83 | | |
| |||
0 commit comments