[GenericOAuthenticator] Pulling `claim_groups_key` from another token than the one returned by `USERDATA_URL` for AWS Cognito

dpf · May 23, 2022, 3:50pm

Hi there,
First off, thank you for the amazing work and product I am grateful that such a vibrant community is backing it up.

I’m using GenericOAuthenticator and AWS Cognito to manage authentication/authorization to Jupyterhub, but I’m facing the following issue:

GenericOAuthenticator is looking for the group claim in the USERDATA token. I map this to the USERINFO endpoint of AWS Cognito (USERINFO endpoint - Amazon Cognito)
AWS Cognito does not put the group claim in the USERINFO token, but in the ACCESS token (Using the access token - Amazon Cognito), which has to be decoded and verified.

I’ve searched on this forum and elsewhere, but alas I haven’t found any solution or discussion around this issue. Right now I’m using a custom class inheriting from GenericOAuthenticator and patching a method to get the group claim from the ACCESS token, but this does not feel great.

Is there a better way, and if not, is there an appetite and some design ideas for an improvement there?
I’m ready to implement some changes and open a PR for this if needed.

manics · May 25, 2022, 9:07pm

Hi! Could you provide a bit more information on your overall goals? Are you wanting to lookup a user’s groups and only allow login if they’re a member of that group? Are you wanting to somehow map the AWS Cognito group to a JupyterHub group? Or are you wanting to fetch that information and save it for later use?

For the first case one option could be to change the claim_groups_key so that the callable version takes both user_data_resp_json and token_resp_json (might make sense to do the same for username_key?):

github.com

jupyterhub/oauthenticator/blob/c58b9e122071cbe6fdc614651e61f7814703d771/oauthenticator/generic.py#L191-L192

      
        
            if callable(self.claim_groups_key):
                groups = self.claim_groups_key(user_data_resp_json)

dpf · May 30, 2022, 7:51am

FYI We’re running with jupyterhub/ zero-to-jupyterhub-k8s v1.2, AFAIU it doesn’t have any impact on the issue at hand.

The goals of the setup we have working right now are the following:

When a user logs in, get their cognito:groups claim using the claim_groups_key and only allow a certain set of groups using allowed_groups
Keep track of the user auth state all the way to the options_form and present custom spawn options based on their groups.
Keep track of the user auth state all the way to the KubeSpawner to validate that the form selection received is actually authorized.

The last two points are achieved thanks to enable_auth_state, JUPYTERHUB_CRYPT_KEY and a custom auth_state_hook (more or less following this part of the documentation)

The only real issue we have is the hackish way we fetch the claim in the access token (instead of the default userdata token) and then add it to the user_data_resp_json dictionary. From that point on, everything flows.

Might be important design-wise, reading the access token requires a jwt.decode, so some TLS verification ideally.

Topic		Replies	Views
How to access user_data json from OAuthenticator? JupyterHub jupyterhub , help-wanted	5	1048	September 27, 2021
GenericAuthenticator with Cognito: how to check for department match The Littlest JupyterHub	5	623	February 27, 2023
Control admin group membership from an AWS Cognito user group Zero to JupyterHub on Kubernetes	6	414	April 4, 2024
GenericOAuthenticator - Restrict access using claims JupyterHub	2	1272	November 22, 2022
Group whitelist with AWSCognito JupyterHub	0	316	October 7, 2020

[GenericOAuthenticator] Pulling `claim_groups_key` from another token than the one returned by `USERDATA_URL` for AWS Cognito

Related topics