You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Note that the provider name is needed to deploy provider-specific resources. If no provider is specified, then only the InferencePool object and the EPP are deployed.
24
24
25
+
### Install with Custom Cmd-line Flags
26
+
27
+
To set cmd-line flags, you can use the `--set` option to set each flag, e.g.,:
Alternatively, you can define flags in the `values.yaml` file:
38
+
39
+
```yaml
40
+
inferenceExtension:
41
+
flags:
42
+
FLAG_NAME: <FLAG_VALUE>
43
+
v: 3## Log verbosity
44
+
...
45
+
```
46
+
25
47
### Install with Custom Environment Variables
26
48
27
49
To set custom environment variables for the EndpointPicker deployment, you can define them as free-form YAML in the `values.yaml` file:
@@ -182,7 +204,7 @@ The following table list the configurable parameters of the chart.
182
204
| `inferenceExtension.env` | List of environment variables to set in the endpoint picker container as free-form YAML. Defaults to `[]`. |
183
205
| `inferenceExtension.extraContainerPorts` | List of additional container ports to expose. Defaults to `[]`. |
184
206
| `inferenceExtension.extraServicePorts` | List of additional service ports to expose. Defaults to `[]`. |
185
-
| `inferenceExtension.flags` | List of flags which are passed through to endpoint picker. Example flags, enable-pprof, grpc-port etc. Refer [runner.go](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/cmd/epp/runner/runner.go) for complete list. |
207
+
| `inferenceExtension.flags` | map of flags which are passed through to endpoint picker. Example flags, enable-pprof, grpc-port etc. Refer [runner.go](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/cmd/epp/runner/runner.go) for complete list. |
186
208
| `inferenceExtension.affinity` | Affinity for the endpoint picker. Defaults to `{}`. |
187
209
| `inferenceExtension.tolerations` | Tolerations for the endpoint picker. Defaults to `[]`. |
188
210
| `inferenceExtension.monitoring.interval` | Metrics scraping interval for monitoring. Defaults to `10s`. |
0 commit comments