Skip to content

Commit 5be1f1d

Browse files
JeffLuookfswain
authored andcommitted
[Metrics] Add grafana dashboard for inferecen extension and vLLM metrics (kubernetes-sigs#237)
1 parent d737d1a commit 5be1f1d

File tree

4 files changed

+1382
-0
lines changed

4 files changed

+1382
-0
lines changed

tools/dashboards/README.md

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
# Documentation
2+
3+
This documentation provides instructions for setting up grafana dashboards to see metrics emitted from the inference extension and model servers.
4+
5+
## Requirements
6+
7+
Please follow [metrics](https://github.com/kubernetes-sigs/gateway-api-inference-extension/tree/main/pkg/ext-proc/metrics) page to configure the proxy to enable all metrics.
8+
9+
## Load Inference Extension dashboard into Grafana
10+
11+
Please follow [grafana instructions](https://grafana.com/docs/grafana/latest/dashboards/build-dashboards/import-dashboards/) to load the dashboard json.
12+
13+
## Configure Google Managed Prometheus as source for metrics
14+
15+
If you run the inferece gateway with [Google Managed Prometheus](https://cloud.google.com/stackdriver/docs/managed-prometheus), please follow the [instructions](https://cloud.google.com/stackdriver/docs/managed-prometheus/query) to configure Google Managed Prometheus as data source for the grafana dashboard.
16+
17+
## Troubleshooting
18+
19+
### No data in graph
20+
21+
Please configure the `scrape_interval` of your prometheus configuration to lower than `15s`, `rate` function returns empty string if data falls too apart. See https://www.robustperception.io/what-range-should-i-use-with-rate/ for more details.
22+
23+
Example:
24+
```
25+
global:
26+
scrape_interval: 5s
27+
```

0 commit comments

Comments
 (0)