Skip to content

remove SchedulingContext, flatten scheduler interfaces #889

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 2, 2025

Conversation

nirrozenbaum
Copy link
Contributor

This PR continues the iterative progress of scheduler redesign.
It completely removes the use of SchedulingContext. instead all interfaces were updated with the arguments (instead of wrapping everything inside scheduling context and pass that as one arg). This is aligned with the discussions on scheduler design choices made on #845.
one leftover that is still here is the LLMRequest and LLMResponse structs (should replace with a map[string]any to store arbitrary data), that should be replaced as we continue to make progress. this PR doesn't include getting rid of those and it will be handled in follow up PRs.
additionally, PostCycle has to be removed (after cleaned from Prefix plugin) and PostResponse should be extracted to request control layer. all of this is not part of this PR, as we aim to do the transition iteratively and not one huge PR.

cc: @ahg-g

Copy link

netlify bot commented May 29, 2025

Deploy Preview for gateway-api-inference-extension ready!

Name Link
🔨 Latest commit b6e72a7
🔍 Latest deploy log https://app.netlify.com/projects/gateway-api-inference-extension/deploys/683846eb0e395300080b0103
😎 Deploy Preview https://deploy-preview-889--gateway-api-inference-extension.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label May 29, 2025
@k8s-ci-robot k8s-ci-robot requested review from Jeffwan and robscott May 29, 2025 11:37
@k8s-ci-robot k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label May 29, 2025
@ahg-g
Copy link
Contributor

ahg-g commented Jun 2, 2025

/lgtm
/approve

Thanks!

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 2, 2025
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, nirrozenbaum

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 2, 2025
@k8s-ci-robot k8s-ci-robot merged commit 7617439 into kubernetes-sigs:main Jun 2, 2025
8 checks passed
@nirrozenbaum nirrozenbaum deleted the rm-sched-ctx branch June 3, 2025 08:20
irar2 pushed a commit to irar2/gateway-api-inference-extension that referenced this pull request Jun 3, 2025
shmuelk pushed a commit to shmuelk/gateway-api-inference-extension that referenced this pull request Jun 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants