Problem connecting singleuser pod to spark

Sean_Murphy · December 8, 2021, 1:52pm

I’m having an issue connecting to Spark from a singleuser pod launched via Jupyterhub 2.0. I suspect it’s a networking issue.

I can launch a dedicated jupyterlab pod and connect to the Spark instance; I need to set the spark.driver.host to the IP addr of the pod, but once done, Spark can communicate pack to the jupyterlab instance and the job can run.

When I try to do the same in a Jupyterhub context, it looks like connectivity between the Spark workers and the singleuser instance is blocked. I have disabled network policies and the cloudmetadata container; I tried to add some ingress rules but I guess I’m not doing this correctly.

Any pointers on how to troubleshoot such issues?

Thanks, rgds,
Sean.

Sean_Murphy · December 8, 2021, 2:35pm

As often happens, the solution occurred to me a few mins after submitting this post.

I set the spark.driver.port within the singleuser pod and set the allowedIngressPorts to the same value; in this way, spark can talk back to the singleuser instance.

Hope this helps someone!

Rgds,
Sean.

Topic		Replies	Views
Spark Client Mode Integration Zero to JupyterHub on Kubernetes	2	1002	September 19, 2019
Users pods connection to external spark cluster Zero to JupyterHub on Kubernetes jupyterlab , jupyterhub , how-to , help-wanted	1	72	December 9, 2024
Is it posible to expose or open ports on the singleuser pod? Zero to JupyterHub on Kubernetes how-to	0	439	June 24, 2020
JHub + Spark on K8S / Workers cant connect to Drivers Zero to JupyterHub on Kubernetes	0	598	June 25, 2021
Single user server as driver node in spark cluster k8s Zero to JupyterHub on Kubernetes jupyterlab , jupyterhub , how-to , help-wanted	1	919	April 13, 2022

Problem connecting singleuser pod to spark

Related topics