Questions about running Jupyterhub Z2JK in a multi-zonal GKE cluster

mcberma · December 16, 2020, 8:52pm

We are planning to run Jupyterhub in a multi-zonal (3 availability zone) Google Kubernetes Engine (GKE) cluster on a private Virtual Private Cloud. Jupyterhub and Jupyter Lab will NOT have external IP addresses. Access to Jupyterhub and Jupyterlab from web clients will be through an Ingress Controller (Istio ILB Gateway).

We are doing this for scalability and high availability purposes.

We are planning on deploying the configurable HTTP proxy, the Hub and the users’ Jupyterlab containers in all three availability zones.

We are planning to deploy a shared (single instance) of Cloud SQL for Postgress database to save user session state. The hubs running in all three of the availability zones will read and write from this shared Cloud SQL instance.

Questions:

Will this deployment strategy work?
What are the pitfalls?
What changes to this strategy need to be done to achieve at least some degree of high availability and scalability?
Should the Ingress Controller use sticky or non-sticky sessions? In a perfect world a non-sticky approach would be the way to go. But pragmatically, it might make more sense to keep the user in the same availability zone where he/she authenticated to the hub. Please advise.

manics · December 17, 2020, 9:22am

Unfortunately Jupyterhub doesn’t currently support high availability. You can read about the technical limitations in:

Regarding scalability this forum thread should be useful:

mcberma · December 17, 2020, 6:19pm

Guys:

Thank you for the replies. Very helpful.

Topic		Replies	Views
Multi-zone jupyterhub in GKE Zero to JupyterHub on Kubernetes	2	558	April 12, 2019
Multiple replica for proxy and hub pod in single deployment kubernetes Zero to JupyterHub on Kubernetes jupyterhub , help-wanted	4	666	January 23, 2024
Multi hub setup on a single EKS cluster Zero to JupyterHub on Kubernetes jupyterhub , how-to , help-wanted	0	204	January 16, 2024
Recent rollouts of JupyterHub for Kubernetes for 100+ student academic courses? Zero to JupyterHub on Kubernetes	3	101	June 6, 2024
Core component resilience/reliability JupyterHub	10	2025	September 11, 2020

Questions about running Jupyterhub Z2JK in a multi-zonal GKE cluster

Related topics