You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Note that the provider name is needed to deploy provider-specific resources. If no provider is specified, then only the InferencePool object and the EPP are deployed.
25
+
23
26
## Uninstall
24
27
25
28
Run the following command to uninstall the chart:
@@ -34,7 +37,6 @@ The following table list the configurable parameters of the chart.
|`inferencePool.name`| Name for the InferencePool, and endpoint picker deployment and service will be named as `{.Release.name}-epp`. |
38
40
|`inferencePool.targetPortNumber`| Target port number for the vllm backends, will be used to scrape metrics by the inference extension. Defaults to 8000. |
39
41
|`inferencePool.modelServers.matchLabels`| Label selector to match vllm backends managed by the inference pool. |
40
42
|`inferenceExtension.replicas`| Number of replicas for the endpoint picker extension service. Defaults to `1`. |
@@ -43,6 +45,7 @@ The following table list the configurable parameters of the chart.
43
45
|`inferenceExtension.image.tag`| Image tag of the endpoint picker. |
44
46
|`inferenceExtension.image.pullPolicy`| Image pull policy for the container. Possible values: `Always`, `IfNotPresent`, or `Never`. Defaults to `Always`. |
45
47
|`inferenceExtension.extProcPort`| Port where the endpoint picker service is served for external processing. Defaults to `9002`. |
48
+
|`provider.name`| Name of the Inference Gateway implementation being used. Possible values: `gke`. Defaults to `none`. |
0 commit comments