InferencePool status should track the number of ready endpoints #342

ahg-g · 2025-02-15T02:34:26Z

What would you like to be added:

InferencePool status should track the number of ready endpoints

Why is this needed:

To track the health of the inferencePool endpoints

ahg-g · 2025-02-20T18:42:32Z

The question is what component should do that?

hzxuzhonghu · 2025-02-22T02:02:24Z

From my perspective, sicne epp has already monitored model server metrics, it should be able to do the status check asyn as well

Kuromesi · 2025-02-26T01:47:25Z

Can we setup our own informer especially for the InferencePool instead of using the informer provided by controller-runtime? This should make things easier if we want to trigger InferencePool updates. I think maybe we should provide more info in InferencePool such as endpoints metrics, available models and etc. which is more user-friendly. A customized informer may allow us to add events to trigger InferencePool reconciling after endpoints or models are updated

And a customized informer may also resolve this issue #369, since we can have a better control of the lifecycle of the informers.

ahg-g · 2025-02-26T13:41:52Z

We discussed in the last community meeting that managing the InferencePool object status is not the responsibility of the EPP, it is the responsibility of the gateway controller.

Also, if we are viewing InferencePool as a light-weight version of the Service API, then tracking number of ready endpoints is not actually in scope for the InferencePool status. Users can look at the Deployment status to check that.

ahg-g mentioned this issue Feb 18, 2025

Tracking issues for release 0.2.0 #362

Closed

33 tasks

kfswain added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 19, 2025

ahg-g closed this as completed Feb 26, 2025

ahg-g mentioned this issue Apr 23, 2025

[WIP] proposal - add readypods field to InferencePoolStatus #714

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

InferencePool status should track the number of ready endpoints #342

InferencePool status should track the number of ready endpoints #342

ahg-g commented Feb 15, 2025

ahg-g commented Feb 20, 2025

Uh oh!

hzxuzhonghu commented Feb 22, 2025

Uh oh!

Kuromesi commented Feb 26, 2025

Uh oh!

ahg-g commented Feb 26, 2025

Uh oh!

InferencePool status should track the number of ready endpoints #342

InferencePool status should track the number of ready endpoints #342

Comments

ahg-g commented Feb 15, 2025

ahg-g commented Feb 20, 2025

Uh oh!

hzxuzhonghu commented Feb 22, 2025

Uh oh!

Kuromesi commented Feb 26, 2025

Uh oh!

ahg-g commented Feb 26, 2025

Uh oh!