Skip to content

Commit dd40239

Browse files
liu-congahg-g
andauthored
Update site-src/guides/model-server.md
Co-authored-by: Abdullah Gharaibeh <[email protected]>
1 parent c880c68 commit dd40239

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

site-src/guides/model-server.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ Any model server that conform to the [model server protocol](https://github.com/
1212
|vLLM V1|v0.8.0 and above| [commit bc32bc7](https://github.com/vllm-project/vllm/commit/bc32bc73aad076849ac88565cff745b01b17d89c)| |
1313
Triton(TensorRT-LLM)| TODO| Pending [PR](https://github.com/triton-inference-server/tensorrtllm_backend/pull/725). |LoRA affinity feature is not available as the required LoRA metrics haven't been implemented in Triton yet.|
1414

15-
## Use vLLM
15+
## vLLM
1616

1717
vLLM is configured as the default in the [endpoint picker extension](https://github.com/kubernetes-sigs/gateway-api-inference-extension/tree/main/pkg/epp). No further configuration is required.
1818

0 commit comments

Comments
 (0)