You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
make dynamic lora sidecar health check parameters configurable and force reconcile (kubernetes-sigs#605)
* update benchmarking guide with latest results with vllm v1
* update graph
* make dynamic lora sidecar health check parameters configurable and forrce reconcile
* update screenshots
* make the health and refresh params in sidecar cmd line argument
3.**Mount the configmap and configure sidecar in your pod**
39
-
```yaml
40
-
volumeMounts: # DO NOT USE subPath
41
-
- name: config-volume
42
-
mountPath: /config
43
-
```
44
-
Do not use subPath, since configmap updates are not reflected in the file
44
+
```yaml
45
+
volumeMounts: # DO NOT USE subPath
46
+
- name: config-volume
47
+
mountPath: /config
48
+
```
49
+
Do not use subPath, since configmap updates are not reflected in the file
45
50
46
-
[deployment]: deployment.yaml it uses [sidecar](https://kubernetes.io/docs/concepts/workloads/pods/sidecar-containers/)(`initContainer` with `restartPolicy`set to `always`) which is beta feature enabled by default since k8s version 1.29. They need to be enabled in 1.28 and prior to 1.28 sidecar are not officially supported.
51
+
## Command Line Arguments
52
+
53
+
The sidecar supports the following command-line arguments:
54
+
55
+
- `--health-check-timeout`: Maximum time in seconds to wait for the vLLM server health check (default: 300)
56
+
- `--health-check-interval`: Interval in seconds between health check attempts (default: 2)
57
+
- `--reconcile-trigger`: Time in seconds between forced reconciliation runs (default: 5)
58
+
- `--config`: Path to the config map file (default: value from DYNAMIC_LORA_ROLLOUT_CONFIG env var or "/config/configmap.yaml")
Note: This uses [sidecar](https://kubernetes.io/docs/concepts/workloads/pods/sidecar-containers/)(`initContainer` with `restartPolicy` set to `always`) which is beta feature enabled by default since k8s version 1.29. They need to be enabled in 1.28 and prior to 1.28 sidecar are not officially supported.
66
110
67
111
## Screenshots & Testing
68
112
The sidecar was tested with the Deployment and ConfigMap specified in this repo. Here are screen grabs of the logs from the sidecar and vllm server. One can verify that the adapters were loaded by querying `v1/models` and looking at vllm logs.
0 commit comments