Skip to content

Commit 01c043e

Browse files
committed
update readme
1 parent e2f398a commit 01c043e

File tree

1 file changed

+7
-3
lines changed

1 file changed

+7
-3
lines changed

README.md

+7-3
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,11 @@ To enable LoadAwareScorer, the following env vars must be configured:
2222
export ENABLE_LOAD_AWARE_SCORER=true
2323
export LOAD_AWARE_SCORER_WEIGHT=1.0
2424
```
25+
26+
To enable PDFilter, the following env var must be configured:
27+
```
28+
export ENABLE_PD_FILTER=true
29+
```
2530
---
2631
[Inference Gateways]:#concepts-and-definitions
2732

@@ -96,8 +101,8 @@ See our website at https://gateway-api-inference-extension.sigs.k8s.io/ for deta
96101
## Roadmap
97102

98103
As Inference Gateway builds towards a GA release. We will continue to expand our capabilities, namely:
99-
1. Prefix-cache aware load balancing with interfaces for remote caches
100-
1. Recommended LoRA adapter pipeline for automated rollout
104+
1. Prefix-cache aware load balancing with interfaces for remote caches
105+
1. Recommended LoRA adapter pipeline for automated rollout
101106
1. Fairness and priority between workloads within the same criticality band
102107
1. HPA support for autoscaling on aggregate metrics derived from the load balancer
103108
1. Support for large multi-modal inputs and outputs
@@ -121,4 +126,3 @@ Contributions are readily welcomed, follow the [dev guide](./docs/dev.md) to sta
121126
### Code of conduct
122127

123128
Participation in the Kubernetes community is governed by the [Kubernetes Code of Conduct](code-of-conduct.md).
124-

0 commit comments

Comments
 (0)