File tree 1 file changed +7
-3
lines changed
1 file changed +7
-3
lines changed Original file line number Diff line number Diff line change @@ -22,6 +22,11 @@ To enable LoadAwareScorer, the following env vars must be configured:
22
22
export ENABLE_LOAD_AWARE_SCORER=true
23
23
export LOAD_AWARE_SCORER_WEIGHT=1.0
24
24
```
25
+
26
+ To enable PDFilter, the following env var must be configured:
27
+ ```
28
+ export ENABLE_PD_FILTER=true
29
+ ```
25
30
---
26
31
[ Inference Gateways ] :#concepts-and-definitions
27
32
@@ -96,8 +101,8 @@ See our website at https://gateway-api-inference-extension.sigs.k8s.io/ for deta
96
101
## Roadmap
97
102
98
103
As Inference Gateway builds towards a GA release. We will continue to expand our capabilities, namely:
99
- 1 . Prefix-cache aware load balancing with interfaces for remote caches
100
- 1 . Recommended LoRA adapter pipeline for automated rollout
104
+ 1 . Prefix-cache aware load balancing with interfaces for remote caches
105
+ 1 . Recommended LoRA adapter pipeline for automated rollout
101
106
1 . Fairness and priority between workloads within the same criticality band
102
107
1 . HPA support for autoscaling on aggregate metrics derived from the load balancer
103
108
1 . Support for large multi-modal inputs and outputs
@@ -121,4 +126,3 @@ Contributions are readily welcomed, follow the [dev guide](./docs/dev.md) to sta
121
126
### Code of conduct
122
127
123
128
Participation in the Kubernetes community is governed by the [ Kubernetes Code of Conduct] ( code-of-conduct.md ) .
124
-
You can’t perform that action at this time.
0 commit comments