Skip to content

Update vllm deployment example to use 1 GPU as tensor parallelism is 1#28

Merged
k8s-ci-robot merged 1 commit intokubernetes-sigs:mainfrom
liu-cong:manifest
Oct 28, 2024

Commits

Commits on Oct 22, 2024