Access S3 Bucket from Jupyter Notebook

Stephen_Park · December 12, 2021, 10:21pm

Hello, I am very new to Jupyterhub and I want to be able to access S3 bucket from my Jupyter Notebook. I have authentication done through Keycloak, and I have found some resources where I can retrieve the Keycloak access token, but I can’t figure out where to make the STS get session token call to retrieve temporary credentials for AWS S3 for the authenticated keycloak user. Could someone provide some insight on this? Thanks.

manics · December 14, 2021, 3:35pm

You should be able to configure a pre_spawn_start hook in your Authenticator that can make arbitrary calls and store the results to auth_state which can be accessed by the spawner.

Starting single-user notebook with our custom ldap docker image - #4 by manics is a bit out of date, but shows the basic principals of a spawner passing user-specific variables to a spawner (in this case a UID) which makes them available to the user as environment variables.

Stephen_Park · December 14, 2021, 9:29pm

So for testing purposes I’ve been able to pass in AWS credentials to the Jupyter Notebook environment with that pre_spawn_start hook and have access to AWS S3, but I would like those credentials to be tied with the authenticated Keycloak user. I was thinking about doing a AssumeRoleWithWebIdentity call with the Keycloak access token to get temporary AWS credentials, but I’m getting a “couldn’t retrieve verification key from identity provider” when trying to make that call. Is there a better way of getting AWS credentials associated with a Keycloak user?

manics · December 14, 2021, 10:25pm

I’ve never integrated KeyCloak with AWS before. Can you share your configuration/code with secrets redacted? If I still can’t help then someone else might be able to.

Topic		Replies	Views
User Specific AWS secrets Zero to JupyterHub on Kubernetes	1	1011	October 20, 2020
IAM role not working for S3 storage of notebook code Zero to JupyterHub on Kubernetes	2	1388	September 25, 2020
Jupyterhub sharing notebooks between users in Elastic Kubernetes Service Zero to JupyterHub on Kubernetes how-to , help-wanted	2	1065	September 14, 2022
JupyterHub notebook persistence on AWS S3 bucket JupyterHub	1	906	July 1, 2022
How to load AWS Profiles to Jupyter PySpark or Spark sessions Notebook	1	1495	April 17, 2021

Access S3 Bucket from Jupyter Notebook

Related topics