You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: Dockerfile
+2-1Lines changed: 2 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -9,6 +9,7 @@ ENV CGO_ENABLED=0
9
9
ENV GOOS=linux
10
10
ENV GOARCH=amd64
11
11
ARG COMMIT_SHA=unknown
12
+
ARG BUILD_REF
12
13
13
14
# Dependencies
14
15
WORKDIR /src
@@ -21,7 +22,7 @@ COPY pkg/epp ./pkg/epp
21
22
COPY internal ./internal
22
23
COPY api ./api
23
24
WORKDIR /src/cmd
24
-
RUN go build -ldflags="-X sigs.k8s.io/gateway-api-inference-extension/pkg/epp/metrics.CommitSHA=${COMMIT_SHA}" -o /epp
25
+
RUN go build -ldflags="-X sigs.k8s.io/gateway-api-inference-extension/pkg/epp/metrics.CommitSHA=${COMMIT_SHA} -X sigs.k8s.io/gateway-api-inference-extension/pkg/epp/metrics.BuildRef=${BUILD_REF}" -o /epp
| inference_model_running_requests | Gauge | Number of running requests for each model. |`model_name`=<model-name>| ALPHA |
35
35
| inference_pool_average_kv_cache_utilization | Gauge | The average kv cache utilization for an inference server pool. |`name`=<inference-pool-name>| ALPHA |
36
36
| inference_pool_average_queue_size | Gauge | The average number of requests pending in the model server queue. |`name`=<inference-pool-name>| ALPHA |
37
-
| inference_pool_per_pod_queue_size | Gauge | The total number of queue for each model server pod under the inference pool |`model_server_pod`=<model-server-pod-name>`name`=<inference-pool-name>| ALPHA |
37
+
| inference_pool_per_pod_queue_size | Gauge | The total number of queue for each model server pod under the inference pool |`model_server_pod`=<model-server-pod-name><br> `name`=<inference-pool-name>| ALPHA |
38
38
| inference_pool_ready_pods | Gauge | The number of ready pods for an inference server pool. |`name`=<inference-pool-name>| ALPHA |
39
-
| inference_extension_info | Gauge | The general information of the current build. |`commit`=<hash-of-the-build>| ALPHA |
39
+
| inference_extension_info | Gauge | The general information of the current build. |`commit`=<hash-of-the-build><br> `build_ref`=<ref-to-the-build>| ALPHA |
0 commit comments