Skip to content

Commit d71f29c

Browse files
authored
Add instructions for configuring timeout
1 parent 484f19f commit d71f29c

File tree

1 file changed

+22
-0
lines changed

1 file changed

+22
-0
lines changed

site-src/guides/index.md

+22
Original file line numberDiff line numberDiff line change
@@ -199,6 +199,28 @@ This quickstart guide is intended for engineers familiar with k8s and model serv
199199
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/httproute.yaml
200200
```
201201

202+
### Configure Timeouts
203+
204+
Given that default timeouts for above implementations may be insufficient for most inference workloads, it is recommended to configure a timeout appropriate for your intended use case.
205+
206+
=== "GKE"
207+
208+
```bash
209+
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/gke/gcp-backend-policy.yaml
210+
```
211+
212+
=== "Istio"
213+
214+
```bash
215+
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/httproute-with-timeout.yaml
216+
```
217+
218+
=== "Kgateway"
219+
220+
```bash
221+
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/httproute-with-timeout.yaml
222+
```
223+
202224
### Try it out
203225

204226
Wait until the gateway is ready.

0 commit comments

Comments
 (0)