Simplify POC installation #8

liu-cong · 2024-09-26T18:12:32Z

This was adapted from the private POC repo.

This PR separates what needs to installed vs. what resources user creates as sample applications. The installation is simplified to a single gatewayclass.yaml file.

@courageJ encountered some issue when trying the POC, and this PR worked for her. Not sure what didn't work before though :)

k8s-ci-robot · 2024-09-26T18:12:42Z

Welcome @liu-cong!

It looks like this is your first PR to kubernetes-sigs/llm-instance-gateway 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes-sigs/llm-instance-gateway has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

kfswain

LGTM with the assumption that this has been validated E2E

kfswain · 2024-09-26T20:07:02Z

examples/poc/manifests/gatewayclass.yaml

-apiVersion: gateway.networking.k8s.io/v1
-kind: Gateway
+apiVersion: apps/v1
+kind: Deployment


gatewayclass as the file name may be misleading if we are putting the ext_proc deployment yaml here

The point is that ext_proc is an implementation detail of this specific gateway class

A reasonable perspective.

However, the core of this PoC is the ext-proc implementation, and I don't think can be considered an implementation detail here.

how about just call it installation.yaml which just bundles everything, or can break it down to multiple yaml files

Either SGTM, thanks!

cool, renamed to installation.yaml. And the README explains what's being installed so this should be good!

liu-cong · 2024-09-26T20:58:57Z

LGTM with the assumption that this has been validated E2E

Yes both @courageJ and I tried this.

kfswain · 2024-09-26T21:24:03Z

/lgtm

Joffref · 2024-09-26T23:27:42Z

examples/poc/README.md

+   Wait until the gateway is ready.
+   ```bash
+   IP=$(kubectl get gateway/llm-gateway -o jsonpath='{.status.addresses[0].value}')
+   PORT=8081


Isn't this wrong, llm-instance-gw is listening on 8080. Am I wrong?

Good question! Actually in the POC setup, Envoy is configured the additional 8081 port for ext proc traffic. Updated README

Ho yes, you're right actually. Sorry, for the confusion.

Joffref · 2024-09-26T23:33:38Z

Overall it looks good to me and it's a great addition. Thank you @liu-cong 😀

Update README on envoy 8081 port

kfswain · 2024-09-27T16:50:20Z

/lgtm

Xunzhuo · 2024-09-29T02:56:05Z

examples/poc/README.md

@@ -1,68 +1,55 @@
 # Envoy Ext Proc Gateway with LoRA Integration

-This project sets up an Envoy gateway to handle gRPC calls with integration of LoRA (Low-Rank Adaptation). The configuration aims to manage gRPC traffic through Envoy's external processing and custom routing based on headers and load balancing rules. The setup includes Kubernetes services and deployments for both the gRPC server and the vllm-lora application.
+This project sets up an Envoy gateway with a custom external processing which  implements advanced routing logic tailored for LoRA (Low-Rank Adaptation) adapters. The routing algorithm is based on the model specified (using Open AI API format), and ensuring efficient load balancing based on model server metrics.


one more extra space between which and implements

terrytangyuan

/lgtm
/approve

k8s-ci-robot · 2024-09-29T02:58:53Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: liu-cong, terrytangyuan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [terrytangyuan]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot requested review from ahg-g and kfswain September 26, 2024 18:12

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Sep 26, 2024

k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 26, 2024

kfswain reviewed Sep 26, 2024

View reviewed changes

liu-cong force-pushed the main branch 2 times, most recently from fc9ce95 to 2097027 Compare September 26, 2024 21:17

k8s-ci-robot assigned kfswain Sep 26, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 26, 2024

Joffref reviewed Sep 26, 2024

View reviewed changes

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 27, 2024

Simplify POC installation

d59dfaa

Update README on envoy 8081 port

liu-cong force-pushed the main branch from 3c6d5a1 to d59dfaa Compare September 27, 2024 16:49

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 27, 2024

Xunzhuo reviewed Sep 29, 2024

View reviewed changes

terrytangyuan approved these changes Sep 29, 2024

View reviewed changes

k8s-ci-robot assigned terrytangyuan Sep 29, 2024

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 29, 2024

k8s-ci-robot merged commit 947e44d into kubernetes-sigs:main Sep 29, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify POC installation #8

Simplify POC installation #8

liu-cong commented Sep 26, 2024 •

edited

Loading

k8s-ci-robot commented Sep 26, 2024

kfswain left a comment

kfswain Sep 26, 2024

liu-cong Sep 26, 2024

kfswain Sep 26, 2024

liu-cong Sep 26, 2024

kfswain Sep 26, 2024

liu-cong Sep 26, 2024

liu-cong commented Sep 26, 2024

kfswain commented Sep 26, 2024

Joffref Sep 26, 2024

liu-cong Sep 27, 2024

Joffref Sep 27, 2024

Joffref commented Sep 26, 2024

kfswain commented Sep 27, 2024

Xunzhuo Sep 29, 2024

terrytangyuan left a comment

k8s-ci-robot commented Sep 29, 2024

Simplify POC installation #8

Simplify POC installation #8

Conversation

liu-cong commented Sep 26, 2024 • edited Loading

k8s-ci-robot commented Sep 26, 2024

kfswain left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liu-cong commented Sep 26, 2024

kfswain commented Sep 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Joffref commented Sep 26, 2024

kfswain commented Sep 27, 2024

Choose a reason for hiding this comment

terrytangyuan left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Sep 29, 2024

liu-cong commented Sep 26, 2024 •

edited

Loading