Opinion on Pipeline Solutions for Long-Running Jupyter Notebooks and Python Scripts with Z2JH

gcerar · April 18, 2023, 9:18pm

Dear Community,

In our research lab, we have been using Z2JH (on k8s) and have found it to be a valuable tool for our work. I want to express my gratitude for the effort put into creating such a useful tool.

Recently, I have been exploring ways to execute Jupyter Notebooks or Python scripts uninterrupted for days on the same infrastructure as Z2JH. I have found two viable solutions: SLURM on top of Kubernetes and Kubeflow with the Kale plugin. I have included links to relevant resources for each solution.

SLURM on top Kubernetes [link]
Kubeflow with Kale plugin [demo video]

My goal is to find a solution that requires minimal effort to shift from Z2JH to the new pipeline system. However, I am unsure if these solutions are compatible with Z2JH. I would greatly appreciate any thoughts, opinions, or experiences that the community may have regarding these solutions.

Thank you for your time and consideration, and I apologize for the open-ended nature of this question.

manics · April 19, 2023, 10:31am

A lot work has been done on integrating Dask with JupyterHub, though I haven’t used it:
https://docs.dask.org/en/stable/deploying-kubernetes.html
It’s probably worth looking at though!

gcerar · April 20, 2023, 4:40pm

I re-read my question and I apologize for not being clear enough. What we have now is Z2JH, which will probably stay that way forever. Easy to use for students and researchers. Our primary issue is long-running tasks, which get randomly terminated by the culling service once the user closes the browser. I’m searching for a solution where someone could submit/offload/migrate computation (e.g., grid search) to another service with minimal effort and pick the results after the calculation is complete.

gcerar · April 26, 2023, 7:48am

I came across another solution, Elyra. Ability to run a notebook or Python script as a batch job.

Topic		Replies	Views
Integrate JupyterHub on Kubernetes (Z2JH) with Slurm Zero to JupyterHub on Kubernetes help-wanted	2	79	May 16, 2025
HPC batch system on Kubernetes? JupyterHub jupyterhub , hpc	0	462	August 3, 2023
A new and simple way to manage notebooks on Kubernetes Zero to JupyterHub on Kubernetes announcement , community , jupyterlab , jupyterhub	0	748	January 23, 2023
Infrastructure Advice for JupyterHub, Dask, and Airflow JupyterHub	4	1709	April 10, 2019
ZTJH on a Raspberry PI K8s Cluster Zero to JupyterHub on Kubernetes how-to	25	3970	April 15, 2021

Opinion on Pipeline Solutions for Long-Running Jupyter Notebooks and Python Scripts with Z2JH

Related topics