[WIP] Add Support for notebooks/spark operator to manifests #3223

fresende · 2025-08-21T05:28:19Z

Pull Request Template for Kubeflow Manifests

✏️ Summary of Changes

Add Support for notebooks/spark operator to manifests

This is a work in progress to automate the installation of Spark operator integration with notebooks. We are currently having issues when starting the kernel, where it needs to communicate back to the enterprise gateway, but it silently fails. I believe it's related to isito and would appreciate some help.

Connection Flow

Kernel starts and encrypts its connection details.
Kernel sends those details back to Enterprise Gateway.
Enterprise Gateway decrypts and reads the info.
Gateway passes the connection info to the kernel’s proxy.
Gateway uses that info to connect to the kernel’s ports (shell, iopub, stdin, heartbeat, control).

✅ Contributor Checklist

I have tested these changes with kustomize. See Installation Prerequisites.
All commits are signed-off to satisfy the DCO check.
I have considered adding my company to the adopters page to support Kubeflow and help the community, since I expect help from the community for my issue (see 1. and 2.).

Signed-off-by: Fellipe Resende <[email protected]>

google-oss-prow · 2025-08-21T05:28:26Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign juliusvonkohout for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

applications/spark/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

juliusvonkohout · 2025-08-21T21:19:23Z

Why do you not use the modern standard spark-connect with interactive session support instead of the enterprise gateway?

tarekabouzeid · 2025-08-22T09:19:52Z

Why do you not use the modern standard spark-connect with interactive session support instead of the enterprise gateway?

I think spark connect doesn't cover the full Spark API or multi user isolation needs compared to JEG.

juliusvonkohout · 2025-08-22T10:05:12Z

Why do you not use the modern standard spark-connect with interactive session support instead of the enterprise gateway?

I think spark connect doesn't cover the full Spark API or multi user isolation needs compared to JEG.

We isolate per namespace, so why is spark-connect not multi-tenant? CC @vikas-saxena02

tarekabouzeid · 2025-08-22T11:41:33Z

Why do you not use the modern standard spark-connect with interactive session support instead of the enterprise gateway?

I think spark connect doesn't cover the full Spark API or multi user isolation needs compared to JEG.

We isolate per namespace, so why is spark-connect not multi-tenant? CC @vikas-saxena02

If spark connect is installed per namespace, then yes. But i am not 100% sure if different notebook kernels can then have different spark drivers.

That will be nice to investigate it further. Maybe @fresende have already looked into.

juliusvonkohout · 2025-08-22T11:55:18Z

The goal is to have spark separated per namespace. so spark-cluster and spark-connect is deployed per namespace and only the Jupyterlabs in the namespace can access it.

vikas-saxena02 · 2025-08-22T12:15:13Z

I have experimented with deploying spark-connect deployed separately as a service.. I could do it on a namespace level. But I advent reis it with the new spark-connect crd which was added recently. Thanks and regards, Vikas Saxena.

…

On Fri, 22 Aug 2025, 9:55 pm Julius von Kohout, ***@***.***> wrote: *juliusvonkohout* left a comment (kubeflow/manifests#3223) <#3223 (comment)> The goal is to have spark separated per namespace. so spark-cluster and spark-connect is deployed per namespace and only the Jupyterlabs in the namespace can access it. — Reply to this email directly, view it on GitHub <#3223 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AVSEDXRQCGS3756IR6IFZQD3O4AL3AVCNFSM6AAAAACENQPRMWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTEMJUGEYDQMZQGI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

tarekabouzeid · 2025-08-24T15:31:01Z

I have experimented with deploying spark-connect deployed separately as a service.. I could do it on a namespace level. But I advent reis it with the new spark-connect crd which was added recently. Thanks and regards, Vikas Saxena.
…
On Fri, 22 Aug 2025, 9:55 pm Julius von Kohout, @.> wrote: juliusvonkohout left a comment (kubeflow/manifests#3223) <#3223 (comment)> The goal is to have spark separated per namespace. so spark-cluster and spark-connect is deployed per namespace and only the Jupyterlabs in the namespace can access it. — Reply to this email directly, view it on GitHub <#3223 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AVSEDXRQCGS3756IR6IFZQD3O4AL3AVCNFSM6AAAAACENQPRMWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTEMJUGEYDQMZQGI . You are receiving this because you were mentioned.Message ID: @.>

Do you mean for each notebook in that namespace will have its own spark connect and spark application?

lresende · 2025-08-24T22:22:05Z

The classic Jupyter + IPython kernel approach with PySpark would allow each user to run their own Spark driver instance, giving them full control over configuration, context, and resource usage without interference from others. It’s a mature, well-proven model that supports the entire PySpark API surface, including advanced features and low-level tuning not yet available through Spark Connect. This setup ensures predictable behavior, easier debugging, and maximum compatibility with existing Spark workflows and Jupyter integrations.

SparkConnect, on the other hand (which I believe came to replace Apache Livy) has its merit and its very good for providing a shared Spark as a service.

We can definitely continue investigating integration with Spark Connect further, but I believe it should happen in parallel as they will probably be used for different user cases.

applications/spark/spark-operator/base/resources.yaml

juliusvonkohout · 2025-09-04T18:18:47Z

applications/spark/spark-operator/base/resources.yaml

This files comes from https://github.com/fresende/kubeflow-manifests/blob/install-eg/scripts/synchronize-spark-operator-manifests.sh and must not be modified. Please create a customize overlay. We also need proper tests.

And you need to sign your commits.

Add Support for notebooks/spark operator to mainfests

12ed156

Signed-off-by: Fellipe Resende <[email protected]>

google-oss-prow bot added the do-not-merge/work-in-progress label Aug 21, 2025

google-oss-prow bot requested review from juliusvonkohout and kimwnasptd August 21, 2025 05:28

google-oss-prow bot added the size/L label Aug 21, 2025

fresende changed the title ~~[WIP] Add Support for notebooks/spark operator to mainfests~~ [WIP] Add Support for notebooks/spark operator to manifests Aug 21, 2025

juliusvonkohout reviewed Aug 30, 2025

View reviewed changes

applications/spark/spark-operator/base/resources.yaml Outdated Show resolved Hide resolved

remove kernel image puller

42ee864

juliusvonkohout reviewed Sep 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Add Support for notebooks/spark operator to manifests #3223

[WIP] Add Support for notebooks/spark operator to manifests #3223

fresende commented Aug 21, 2025

Uh oh!

google-oss-prow bot commented Aug 21, 2025

Uh oh!

juliusvonkohout commented Aug 21, 2025 •

edited

Loading

Uh oh!

tarekabouzeid commented Aug 22, 2025

Uh oh!

juliusvonkohout commented Aug 22, 2025

Uh oh!

tarekabouzeid commented Aug 22, 2025

Uh oh!

juliusvonkohout commented Aug 22, 2025 •

edited

Loading

Uh oh!

vikas-saxena02 commented Aug 22, 2025 via email

Uh oh!

tarekabouzeid commented Aug 24, 2025

Uh oh!

lresende commented Aug 24, 2025

Uh oh!

Uh oh!

juliusvonkohout Sep 4, 2025

Uh oh!

juliusvonkohout Sep 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[WIP] Add Support for notebooks/spark operator to manifests #3223

Are you sure you want to change the base?

[WIP] Add Support for notebooks/spark operator to manifests #3223

Conversation

fresende commented Aug 21, 2025

Pull Request Template for Kubeflow Manifests

✏️ Summary of Changes

✅ Contributor Checklist

Uh oh!

google-oss-prow bot commented Aug 21, 2025

Uh oh!

juliusvonkohout commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tarekabouzeid commented Aug 22, 2025

Uh oh!

juliusvonkohout commented Aug 22, 2025

Uh oh!

tarekabouzeid commented Aug 22, 2025

Uh oh!

juliusvonkohout commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vikas-saxena02 commented Aug 22, 2025 via email

Uh oh!

tarekabouzeid commented Aug 24, 2025

Uh oh!

lresende commented Aug 24, 2025

Uh oh!

Uh oh!

juliusvonkohout Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

juliusvonkohout Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

juliusvonkohout commented Aug 21, 2025 •

edited

Loading

juliusvonkohout commented Aug 22, 2025 •

edited

Loading