Skip to content

Inference Extension: Use single shared extproc filter #11414

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Jun 18, 2025

Conversation

danehans
Copy link
Contributor

@danehans danehans commented Jun 14, 2025

Description

Backport of #11390 and associated #11444 (stabilizes setup tests).

Change Type

/kind bug_fix

Changelog

NONE

Additional Notes

xref upstream issue: llm-d/llm-d-inference-scheduler#158

@danehans danehans added the kind/bug_fix Categorizes issue or PR as related to a bug. label Jun 14, 2025
@danehans
Copy link
Contributor Author

danehans commented Jun 17, 2025

Since the v2.0.x branch does not contain any inference extension e2e tests, I cherry-picked #11412 and #11344 from main to test this change against v2.0.x branch and all tests passed:

# After building and loading the kgtw image into the kind cluster

$ GOLANG_PROTOBUF_REGISTRATION_CONFLICT=ignore go test -v -timeout 600s ./test/kubernetes/e2e/tests -run ^TestInferenceExtension$
...
--- PASS: TestInferenceExtension (225.07s)
    --- PASS: TestInferenceExtension/InferenceExtension (208.00s)
        --- PASS: TestInferenceExtension/InferenceExtension/TestMultiHTTPRouteSingleInferencePool (76.34s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestMultiHTTPRouteSingleInferencePool/OpenAITestCase0 (13.39s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestMultiHTTPRouteSingleInferencePool/OpenAITestCase1 (0.29s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestMultiHTTPRouteSingleInferencePool/OpenAITestCase2 (0.28s)
        --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteMultiInferencePool (74.16s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteMultiInferencePool/OpenAITestCase0 (13.35s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteMultiInferencePool/OpenAITestCase1 (0.29s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteMultiInferencePool/OpenAITestCase2 (0.28s)
        --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteSingleInferencePool (57.49s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteSingleInferencePool/OpenAITestCase0 (0.30s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteSingleInferencePool/OpenAITestCase1 (0.16s)
            --- PASS: TestInferenceExtension/InferenceExtension/TestSingleHTTPRouteSingleInferencePool/OpenAITestCase2 (0.14s)
PASS
ok  	github.com/kgateway-dev/kgateway/v2/test/kubernetes/e2e/tests	225.119s

@sam-heilbron sam-heilbron added this pull request to the merge queue Jun 17, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 17, 2025
@danehans danehans added this pull request to the merge queue Jun 17, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 17, 2025
@danehans danehans added this pull request to the merge queue Jun 17, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 17, 2025
@danehans danehans added this pull request to the merge queue Jun 18, 2025
Merged via the queue into kgateway-dev:v2.0.x with commit b4777ea Jun 18, 2025
12 checks passed
@danehans danehans deleted the v2.0.x_issue_11389 branch June 18, 2025 22:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug_fix Categorizes issue or PR as related to a bug.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants