Skip to content

Commit bb516eb

Browse files
committed
updating criticality comment to link to discussion issue
1 parent 1c9786f commit bb516eb

File tree

1 file changed

+1
-4
lines changed

1 file changed

+1
-4
lines changed

api/v1alpha1/inferencemodel_types.go

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -74,10 +74,7 @@ type InferenceModelSpec struct {
7474
ModelName string `json:"modelName"`
7575

7676
// Criticality defines how important it is to serve the model compared to other models referencing the same pool.
77-
// Criticality impacts how traffic is handled in resource constrained situations. It handles this by
78-
// queuing or rejecting requests of lower criticality. InferenceModels of an equivalent Criticality will
79-
// fairly share resources over throughput of tokens. In the future, the metric used to calculate fairness,
80-
// and the proportionality of fairness will be configurable.
77+
// TODO: Update field upon resolution of: https://github.com/kubernetes-sigs/gateway-api-inference-extension/issues/213
8178
//
8279
// Default values for this field will not be set, to allow for future additions of new field that may 'one of' with this field.
8380
// Any implementations that may consume this field may treat an unset value as the 'Standard' range.

0 commit comments

Comments
 (0)