Commit 78fe00a
Merge main to candle refactoring (#523)
* Update test description from Math to General (#483)
Signed-off-by: carlory <baofa.fan@daocloud.io>
* feat: add HuggingChat support (#477)
* add chat ui to dashboard and docker compose & refactor dashboard/backend/
Signed-off-by: JaredforReal <w13431838023@gmail.com>
* try fix network error
Signed-off-by: JaredforReal <w13431838023@gmail.com>
* more
---------
Signed-off-by: JaredforReal <w13431838023@gmail.com>
Co-authored-by: bitliu <bitliu@tencent.com>
* project: 2025 Q4 roadmap (#487)
* project: q4 roadmap
* project: q4 roadmap
* project: q4 roadmap
* more
* more
* more
* more
* feat: add shelleck precommit hook (#488)
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
---------
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* project: add q4 roadmap news (#495)
* fix missing shellcheck in pre-commit image (#497)
Signed-off-by: carlory <baofa.fan@daocloud.io>
* infra: update tools (#501)
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat(demo): enhance OpenShift demo scripts with improved UX (#478)
- Reduce model selection test to 4 categories (2×Model-A, 2×Model-B)
- Add new "Classification Examples" option calling curl-examples.sh
- Update reasoning examples to avoid cache hits from previous tests
- Remove benign examples from PII and Jailbreak tests (show only attacks)
- Enhance live-semantic-router-logs.sh with better color visibility:
- Fix duplicate "WITH SCORE" text in classification output
- Fix CACHE HIT background color extending over timestamp
- Distinguish reasoning enabled vs disabled messages
- Remove redundant "(standard routing)" text
- Add background colors for Model-A/Model-B routing display
These improvements make the live demo clearer and more impactful for
presentations and demonstrations.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Signed-off-by: Yossi Ovadia <yovadia@redhat.com>
Co-authored-by: Claude <noreply@anthropic.com>
* fix: fix precommit Argument list too long error (#502)
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat: enforce milvus dial timeout if set (#503)
Signed-off-by: cryo <zdtna412@gmail.com>
* Add IETF draft publication: Multi-Provider Extensions for Agentic AI Inference APIs (#506)
* Initial plan
* Add new IETF draft publication for Multi-Provider Extensions for Agentic AI Inference APIs
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Allow semantic cache similarity threshold to be set at the category level (#493)
* Initial plan
* Add category-level cache settings: enabled and similarity_threshold
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Add comprehensive tests for category-level cache settings
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Update config files and documentation for category-level cache settings
- Updated 7 config YAML files (development, production, testing, e2e, and 3 recipes) with commented examples of category-level cache settings
- Added comprehensive documentation section explaining category-level cache configuration
- Updated semantic cache overview and in-memory cache docs with category-level examples
- Added best practices for threshold selection and privacy considerations
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Remove duplicate code in FindSimilar functions
Refactored FindSimilar() to delegate to FindSimilarWithThreshold() with default threshold instead of duplicating the entire implementation. This eliminates 226 lines of duplicate code across inmemory_cache.go and milvus_cache.go.
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Update src/semantic-router/pkg/extproc/request_handler.go
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Revert changes from unsigned commit ae39fe2
Restored the classificationText empty check that was removed in the previous commit.
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
Co-authored-by: Huamin Chen <rootfs@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Allow jailbreak detection and threshold to be configured at the category level (#508)
* Initial plan
* Add category-level jailbreak detection configuration
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Add documentation for category-level jailbreak settings
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Update documentation for category-level jailbreak detection
- Add category-level jailbreak configuration to jailbreak-protection.md
- Update category configuration docs with jailbreak_enabled parameter
- Add security-focused configuration example
- Update global configuration docs with category override notes
- Update README to mention fine-grained security control
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Add category-level jailbreak threshold configuration
- Add JailbreakThreshold field to Category struct
- Add GetJailbreakThresholdForCategory helper method
- Create CheckForJailbreakWithThreshold and AnalyzeContentForJailbreakWithThreshold methods
- Update performSecurityChecks to use category-specific threshold
- Add 5 comprehensive tests for threshold configuration
- Update example configs with threshold tuning examples
- Update documentation with threshold configuration and tuning guidelines
- Add threshold tuning guide with recommendations for different category types
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Allow PII detection threshold to be set at the category level (#510)
* Initial plan
* Add category-level PII threshold support
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Update documentation with API integration notes
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Fix markdown linting issues
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Fix: The caller information points to the wrapper function instead of the actual call location (#518)
Signed-off-by: carlory <baofa.fan@daocloud.io>
* feat: Implement hybrid cache that use in-memory index and milvus based doc store (#504)
* feat: add HNSW index to inmemory semantic cache and implement hybrid cache that use in-memory index and milvus based doc store
Signed-off-by: Huamin Chen <hchen@redhat.com>
* chore: run go mod tidy to clean up module dependencies
Signed-off-by: Huamin Chen <hchen@redhat.com>
* conditionally build candle cuda support
Signed-off-by: Huamin Chen <hchen@redhat.com>
* rebuild index upon restart
Signed-off-by: Huamin Chen <hchen@redhat.com>
* precommit fix
Signed-off-by: Huamin Chen <hchen@redhat.com>
* fix precommit
Signed-off-by: Huamin Chen <hchen@redhat.com>
* fix precommit
Signed-off-by: Huamin Chen <hchen@redhat.com>
* fix precommit
Signed-off-by: Huamin Chen <hchen@redhat.com>
* disable cuda build on ci
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
---------
Signed-off-by: Huamin Chen <hchen@redhat.com>
---------
Signed-off-by: carlory <baofa.fan@daocloud.io>
Signed-off-by: JaredforReal <w13431838023@gmail.com>
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
Signed-off-by: Yossi Ovadia <yovadia@redhat.com>
Signed-off-by: cryo <zdtna412@gmail.com>
Signed-off-by: Huamin Chen <hchen@redhat.com>
Co-authored-by: 杨朱 · Kiki <baofa.fan@daocloud.io>
Co-authored-by: Jared <w13431838023@gmail.com>
Co-authored-by: bitliu <bitliu@tencent.com>
Co-authored-by: shown <yuluo08290126@gmail.com>
Co-authored-by: Yossi Ovadia <yovadia@redhat.com>
Co-authored-by: Claude <noreply@anthropic.com>
Co-authored-by: cryo <zdtna412@gmail.com>
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>1 parent 2b72f27 commit 78fe00a
File tree
7 files changed
+86
-38
lines changed- candle-binding
- config
- src/semantic-router/pkg/cache
- tools/make
7 files changed
+86
-38
lines changedSome generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | | - | |
| 13 | + | |
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
27 | 36 | | |
28 | 37 | | |
29 | 38 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
27 | | - | |
28 | | - | |
| 27 | + | |
| 28 | + | |
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
40 | 43 | | |
41 | 44 | | |
42 | 45 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
120 | 120 | | |
121 | 121 | | |
122 | 122 | | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
123 | 135 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
55 | 58 | | |
56 | 59 | | |
57 | 60 | | |
| |||
66 | 69 | | |
67 | 70 | | |
68 | 71 | | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
69 | 76 | | |
70 | 77 | | |
71 | 78 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
0 commit comments