Skip to content

Commit 1c9786f

Browse files
committed
updating InferencePool wording
1 parent b8f5479 commit 1c9786f

File tree

2 files changed

+10
-1
lines changed

2 files changed

+10
-1
lines changed

api/v1alpha1/inferencepool_types.go

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,10 @@ import (
2121
)
2222

2323
// InferencePool is the Schema for the InferencePools API.
24+
// The InferencePool object is intended to allow for easy maintenance of a set of model servers.
25+
// Best practice is for every model server to share a base model, or, for every model server to be able to serve every 'modelName' that will be available.
26+
// The InferencePool was made for the Inference Platform Admin: https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/docs/proposals/002-api-proposal/proposal.md#inference-platform-admin
27+
// The InferencePool depends on the K8s Gateway, and relies on the gateway controller to manage reconciliation.
2428
//
2529
// +kubebuilder:object:root=true
2630
// +kubebuilder:subresource:status

config/crd/bases/inference.networking.x-k8s.io_inferencepools.yaml

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,12 @@ spec:
1717
- name: v1alpha1
1818
schema:
1919
openAPIV3Schema:
20-
description: InferencePool is the Schema for the InferencePools API.
20+
description: |-
21+
InferencePool is the Schema for the InferencePools API.
22+
The InferencePool object is intended to allow for easy maintenance of a set of model servers.
23+
Best practice is for every model server to share a base model, or, for every model server to be able to serve every 'modelName' that will be available.
24+
The InferencePool was made for the Inference Platform Admin: https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/docs/proposals/002-api-proposal/proposal.md#inference-platform-admin
25+
The InferencePool depends on the K8s Gateway, and relies on the gateway controller to manage reconciliation.
2126
properties:
2227
apiVersion:
2328
description: |-

0 commit comments

Comments
 (0)