Jupyterhub sharing notebooks between users in Elastic Kubernetes Service

limikmag · September 14, 2022, 2:27pm

Hello Community!
I deployed jupyterhub from the official jupyterhub helm chart version 1.1.3-n470.h217c7977. All works ok but we would like to enhance it with a sharing notebooks feature. We have a few pillars:

we would like to use S3 (or as a last resort EFS),
we use okta auth in our jupyterhub so we would like to have sharing granulation per user(how to integrate okta ID with create/edit/delete S3 permissions?),
we don’t want to download and upload the whole S3 bucket every time, because it can be a huge amount of data. We would like to upload/download only changed files.

I found some similar problem here JupyterHub Object Storage S3. But the solution is not good for a big amount of data and each user download everything.

Thank you in advance!

manics · September 14, 2022, 4:31pm

EFS is the easiest to setup Setting up EFS storage on AWS — Zero to JupyterHub with Kubernetes documentation

Mounting S3 is theoretically possible, you can see an earlier discussion in

but you’ll need to do some development work

Jupyter notebook/lab can use S3 as a ContentsManager: GitHub - danielfrg/s3contents: Jupyter Notebooks in S3 - Jupyter Contents Manager implementation
To setup per-user S3 permissions you’ll need to implement some authenticator or spawner hooks to create the IAM role, and to set the service account corresponding to the IAM role.

yuvipanda · September 14, 2022, 4:45pm

I too would highly suggest using EFS instead of S3! I did, however, recently build GitHub - yuvipanda/jupyterhub-roothooks (with some docs in the README) to be able to mount s3 more easily. Take a look at that if you want, but definitely definitely use S3 as last resort and EFS as first call on AWS

Topic		Replies	Views
IAM role not working for S3 storage of notebook code Zero to JupyterHub on Kubernetes	2	1388	September 25, 2020
JupyterHub Object Storage S3 JupyterHub	2	4253	May 3, 2020
JupyterHub notebook persistence on AWS S3 bucket JupyterHub	1	906	July 1, 2022
S3 contents library is not working. Unable to see S3 default directory on notebook. I have S3 bucket in aws and I have already created service account and role which grants permission to the S3 bucket Zero to JupyterHub on Kubernetes jupyterlab , jupyterhub	2	41	February 18, 2025
Is is possible to create a shared folder which is visible to existing users? JupyterHub jupyterhub , how-to , help-wanted	4	1215	October 26, 2022

Jupyterhub sharing notebooks between users in Elastic Kubernetes Service

Related topics