Skip to content

Commit 9861129

Browse files
authored
Add the base model of the cpu vllm sample app to InferenceModel.yaml (#481)
1 parent 32e03ec commit 9861129

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

config/manifests/inferencemodel.yaml

+11
Original file line numberDiff line numberDiff line change
@@ -21,3 +21,14 @@ spec:
2121
criticality: Critical
2222
poolRef:
2323
name: my-pool
24+
25+
---
26+
apiVersion: inference.networking.x-k8s.io/v1alpha2
27+
kind: InferenceModel
28+
metadata:
29+
name: inferencemodel-base-model-cpu
30+
spec:
31+
modelName: Qwen/Qwen2.5-1.5B-Instruct
32+
criticality: Critical
33+
poolRef:
34+
name: my-pool

0 commit comments

Comments
 (0)