Commit 589e03e
Merge main to candle refactoring (#523)
* Update test description from Math to General (#483)
Signed-off-by: carlory <baofa.fan@daocloud.io>
* feat: add HuggingChat support (#477)
* add chat ui to dashboard and docker compose & refactor dashboard/backend/
Signed-off-by: JaredforReal <w13431838023@gmail.com>
* try fix network error
Signed-off-by: JaredforReal <w13431838023@gmail.com>
* more
---------
Signed-off-by: JaredforReal <w13431838023@gmail.com>
Co-authored-by: bitliu <bitliu@tencent.com>
* project: 2025 Q4 roadmap (#487)
* project: q4 roadmap
* project: q4 roadmap
* project: q4 roadmap
* more
* more
* more
* more
* feat: add shelleck precommit hook (#488)
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat: add shelleck precommit hook
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
---------
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* project: add q4 roadmap news (#495)
* fix missing shellcheck in pre-commit image (#497)
Signed-off-by: carlory <baofa.fan@daocloud.io>
* infra: update tools (#501)
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat(demo): enhance OpenShift demo scripts with improved UX (#478)
- Reduce model selection test to 4 categories (2×Model-A, 2×Model-B)
- Add new "Classification Examples" option calling curl-examples.sh
- Update reasoning examples to avoid cache hits from previous tests
- Remove benign examples from PII and Jailbreak tests (show only attacks)
- Enhance live-semantic-router-logs.sh with better color visibility:
- Fix duplicate "WITH SCORE" text in classification output
- Fix CACHE HIT background color extending over timestamp
- Distinguish reasoning enabled vs disabled messages
- Remove redundant "(standard routing)" text
- Add background colors for Model-A/Model-B routing display
These improvements make the live demo clearer and more impactful for
presentations and demonstrations.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Signed-off-by: Yossi Ovadia <yovadia@redhat.com>
Co-authored-by: Claude <noreply@anthropic.com>
* fix: fix precommit Argument list too long error (#502)
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
* feat: enforce milvus dial timeout if set (#503)
Signed-off-by: cryo <zdtna412@gmail.com>
* Add IETF draft publication: Multi-Provider Extensions for Agentic AI Inference APIs (#506)
* Initial plan
* Add new IETF draft publication for Multi-Provider Extensions for Agentic AI Inference APIs
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Allow semantic cache similarity threshold to be set at the category level (#493)
* Initial plan
* Add category-level cache settings: enabled and similarity_threshold
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Add comprehensive tests for category-level cache settings
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Update config files and documentation for category-level cache settings
- Updated 7 config YAML files (development, production, testing, e2e, and 3 recipes) with commented examples of category-level cache settings
- Added comprehensive documentation section explaining category-level cache configuration
- Updated semantic cache overview and in-memory cache docs with category-level examples
- Added best practices for threshold selection and privacy considerations
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Remove duplicate code in FindSimilar functions
Refactored FindSimilar() to delegate to FindSimilarWithThreshold() with default threshold instead of duplicating the entire implementation. This eliminates 226 lines of duplicate code across inmemory_cache.go and milvus_cache.go.
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
* Update src/semantic-router/pkg/extproc/request_handler.go
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Revert changes from unsigned commit ae39fe2
Restored the classificationText empty check that was removed in the previous commit.
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
Co-authored-by: Huamin Chen <rootfs@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Allow jailbreak detection and threshold to be configured at the category level (#508)
* Initial plan
* Add category-level jailbreak detection configuration
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Add documentation for category-level jailbreak settings
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Update documentation for category-level jailbreak detection
- Add category-level jailbreak configuration to jailbreak-protection.md
- Update category configuration docs with jailbreak_enabled parameter
- Add security-focused configuration example
- Update global configuration docs with category override notes
- Update README to mention fine-grained security control
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Add category-level jailbreak threshold configuration
- Add JailbreakThreshold field to Category struct
- Add GetJailbreakThresholdForCategory helper method
- Create CheckForJailbreakWithThreshold and AnalyzeContentForJailbreakWithThreshold methods
- Update performSecurityChecks to use category-specific threshold
- Add 5 comprehensive tests for threshold configuration
- Update example configs with threshold tuning examples
- Update documentation with threshold configuration and tuning guidelines
- Add threshold tuning guide with recommendations for different category types
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Allow PII detection threshold to be set at the category level (#510)
* Initial plan
* Add category-level PII threshold support
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Update documentation with API integration notes
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Fix markdown linting issues
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>
* Fix: The caller information points to the wrapper function instead of the actual call location (#518)
Signed-off-by: carlory <baofa.fan@daocloud.io>
* feat: Implement hybrid cache that use in-memory index and milvus based doc store (#504)
* feat: add HNSW index to inmemory semantic cache and implement hybrid cache that use in-memory index and milvus based doc store
Signed-off-by: Huamin Chen <hchen@redhat.com>
* chore: run go mod tidy to clean up module dependencies
Signed-off-by: Huamin Chen <hchen@redhat.com>
* conditionally build candle cuda support
Signed-off-by: Huamin Chen <hchen@redhat.com>
* rebuild index upon restart
Signed-off-by: Huamin Chen <hchen@redhat.com>
* precommit fix
Signed-off-by: Huamin Chen <hchen@redhat.com>
* fix precommit
Signed-off-by: Huamin Chen <hchen@redhat.com>
* fix precommit
Signed-off-by: Huamin Chen <hchen@redhat.com>
* fix precommit
Signed-off-by: Huamin Chen <hchen@redhat.com>
* disable cuda build on ci
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
* review feedback
Signed-off-by: Huamin Chen <hchen@redhat.com>
---------
Signed-off-by: Huamin Chen <hchen@redhat.com>
---------
Signed-off-by: carlory <baofa.fan@daocloud.io>
Signed-off-by: JaredforReal <w13431838023@gmail.com>
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
Signed-off-by: Yossi Ovadia <yovadia@redhat.com>
Signed-off-by: cryo <zdtna412@gmail.com>
Signed-off-by: Huamin Chen <hchen@redhat.com>
Co-authored-by: 杨朱 · Kiki <baofa.fan@daocloud.io>
Co-authored-by: Jared <w13431838023@gmail.com>
Co-authored-by: bitliu <bitliu@tencent.com>
Co-authored-by: shown <yuluo08290126@gmail.com>
Co-authored-by: Yossi Ovadia <yovadia@redhat.com>
Co-authored-by: Claude <noreply@anthropic.com>
Co-authored-by: cryo <zdtna412@gmail.com>
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>
Co-authored-by: rootfs <7062400+rootfs@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Xunzhuo <48784001+Xunzhuo@users.noreply.github.com>1 parent 5ee8a1b commit 589e03e
File tree
98 files changed
+9227
-1019
lines changed- .github/workflows
- bench
- candle-binding
- config
- examples
- dashboard
- backend
- config
- handlers
- middleware
- proxy
- router
- frontend/src
- components
- pages
- deploy
- docker-compose
- openshift
- dashboard
- demo
- e2e-tests
- scripts
- src
- semantic-router
- pkg
- api
- cache
- config
- extproc
- observability
- utils/classification
- training/training_lora
- classifier_model_fine_tuning_lora
- mmlu_pro_solver_lora
- pii_model_fine_tuning_lora
- prompt_guard_fine_tuning_lora
- tools
- linter/shellcheck
- make
- website
- blog
- docs
- installation
- overview/categories
- tutorials
- content-safety
- semantic-cache
- src
- css
- pages
- community
- static/img
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
98 files changed
+9227
-1019
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
97 | 97 | | |
98 | 98 | | |
99 | 99 | | |
| 100 | + | |
| 101 | + | |
100 | 102 | | |
101 | 103 | | |
102 | 104 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
71 | 71 | | |
72 | 72 | | |
73 | 73 | | |
74 | | - | |
| 74 | + | |
75 | 75 | | |
76 | | - | |
| 76 | + | |
77 | 77 | | |
78 | | - | |
| 78 | + | |
79 | 79 | | |
80 | | - | |
| 80 | + | |
81 | 81 | | |
82 | | - | |
| 82 | + | |
83 | 83 | | |
84 | | - | |
| 84 | + | |
85 | 85 | | |
86 | 86 | | |
87 | 87 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
69 | 69 | | |
70 | 70 | | |
71 | 71 | | |
72 | | - | |
73 | | - | |
| 72 | + | |
| 73 | + | |
74 | 74 | | |
75 | 75 | | |
76 | 76 | | |
| |||
86 | 86 | | |
87 | 87 | | |
88 | 88 | | |
| 89 | + | |
89 | 90 | | |
90 | 91 | | |
91 | 92 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
25 | 33 | | |
26 | 34 | | |
27 | 35 | | |
| |||
73 | 81 | | |
74 | 82 | | |
75 | 83 | | |
76 | | - | |
| 84 | + | |
77 | 85 | | |
78 | 86 | | |
79 | 87 | | |
| |||
87 | 95 | | |
88 | 96 | | |
89 | 97 | | |
90 | | - | |
| 98 | + | |
91 | 99 | | |
92 | 100 | | |
93 | 101 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
33 | | - | |
| 33 | + | |
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
37 | | - | |
| 37 | + | |
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
43 | | - | |
44 | | - | |
| 43 | + | |
| 44 | + | |
45 | 45 | | |
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
50 | | - | |
| 50 | + | |
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
75 | | - | |
| 75 | + | |
76 | 76 | | |
77 | | - | |
78 | | - | |
| 77 | + | |
| 78 | + | |
79 | 79 | | |
80 | 80 | | |
81 | 81 | | |
82 | | - | |
| 82 | + | |
83 | 83 | | |
84 | 84 | | |
85 | 85 | | |
86 | 86 | | |
87 | | - | |
| 87 | + | |
88 | 88 | | |
89 | | - | |
| 89 | + | |
90 | 90 | | |
91 | 91 | | |
92 | 92 | | |
93 | 93 | | |
94 | 94 | | |
95 | 95 | | |
96 | | - | |
97 | | - | |
| 96 | + | |
| 97 | + | |
98 | 98 | | |
99 | 99 | | |
100 | 100 | | |
| |||
107 | 107 | | |
108 | 108 | | |
109 | 109 | | |
110 | | - | |
| 110 | + | |
111 | 111 | | |
112 | | - | |
| 112 | + | |
113 | 113 | | |
114 | 114 | | |
115 | 115 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
33 | 36 | | |
34 | 37 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| 19 | + | |
19 | 20 | | |
20 | 21 | | |
21 | 22 | | |
| |||
75 | 76 | | |
76 | 77 | | |
77 | 78 | | |
78 | | - | |
| 79 | + | |
79 | 80 | | |
80 | 81 | | |
81 | 82 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
12 | | - | |
| 12 | + | |
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
| |||
0 commit comments