Skip to content

Commit 5f4c962

Browse files
committed
Adding terminationGracePeriodSeconds to match vLLMs
1 parent 4ff391b commit 5f4c962

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

config/manifests/inferencepool-resources.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,8 @@ spec:
4242
labels:
4343
app: vllm-llama3-8b-instruct-epp
4444
spec:
45+
# Conservatively, this timeout should mirror the longest grace period of the pods within the pool
46+
terminationGracePeriodSeconds: 130
4547
containers:
4648
- name: epp
4749
image: us-central1-docker.pkg.dev/k8s-staging-images/gateway-api-inference-extension/epp:main

0 commit comments

Comments
 (0)