Update site-src/guides/model-server.md

liu-cong · ahg-g · web-flow · commit 8bd348538143 · 2025-03-19T11:57:27.000-07:00
Co-authored-by: Abdullah Gharaibeh &lt;40361897+ahg-g@users.noreply.github.com&gt;
diff --git a/site-src/guides/model-server.md b/site-src/guides/model-server.md
@@ -18,7 +18,7 @@ vLLM is configured as the default in the [endpoint picker extension](https://git
 
 ## Triton with TensorRT-LLM Backend
 
-You need to specify the metric names when starting the EPP container. Add the following to the `args` of the [EPP deployment](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/296247b07feed430458b8e0e3f496055a88f5e89/config/manifests/inferencepool.yaml#L48).
+Specify the metric names when starting the EPP container by adding the following to the `args` of the [EPP deployment](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/296247b07feed430458b8e0e3f496055a88f5e89/config/manifests/inferencepool.yaml#L48).
 ```
 - -totalQueuedRequestsMetric
 - "nv_trt_llm_request_metrics{request_type=waiting}"