Jupyterhub Can a large number of users start the service at the same time

fangpochen · September 7, 2021, 9:02am

I used the API to start each user’s server and an error occurred
tornado.web.HTTPError: HTTP 429: Too Many Requests (Too many users trying to log in right now. Try again in 60 seconds.)

When I start the user’s server with the API every five seconds, it all works
I would like to know how many concurrent requests JupyterHub can support. How can I increase the concurrent requests
My JupyterHub is deployed on AKS K8S 1.19

mriedem · September 15, 2021, 2:43pm

You’re getting rate limited which is a safety feature to avoid crashing the hub, see the config docs:

https://jupyterhub.readthedocs.io/en/stable/api/app.html#jupyterhub.app.JupyterHub.concurrent_spawn_limit

I would like to know how many concurrent requests JupyterHub can support.

There are a lot of variables here so there isn’t any one right answer. It depends on your setup (are you using zero-to-jupyterhub-k8s with kubespawner?), the image and resource limits you’ve put in place, if you’re pre-pulling images on the user nodes, if you’re pre-scaling user node capacity with placeholders:

Etc. There have been several threads related to this topic and my team has developed some open source tooling that we use to stress test our deployments:

You might find that useful for your testing. If nothing else you can at least refer to the other previous forum threads linked here:

github.com

IBM/jupyter-tools/blob/87296dd13ab43b905c7657d17e3eac7371e90fc1/docs/configuration.md#references

# Configuration settings

This document provides an overview of configuration settings for increasing hub performance.

1. [Culler settings](#culler)
   1. [Frequency](#culler-frequency)
   2. [Concurrency limit](#culler-concurrency)
   3. [Timeout](#culler-timeout)
   4. [Notebook culler](#notebook-culler)
2. [Activity intervals](#activity)
   1. [`activity_resolution`](#activity-resolution)
   2. [`last_activity_interval`](#last-activity-interval)
   3. [`JUPYERHUB_ACTIVITY_INTERVAL`](#hub-activity-interval)
3. [Startup time](#startup)
   1. [`init_spawners_timeout`](#spawners-timeout)
4. [Other settings](#other)
   1. [`k8s_threadpool_api_workers`](#kubespawner-thread)
   2. [Disable events](#kubespawner-events)
5. [References](#references)

This file has been truncated. show original

mriedem · September 15, 2021, 2:49pm

Also depending on what you’re using in your setup (z2jh/kubespawner) you may want to disable this setting:

https://jupyterhub.readthedocs.io/en/stable/api/spawner.html#jupyterhub.spawner.Spawner.consecutive_failure_limit

z2jh defaults that to 5:

github.com

jupyterhub/zero-to-jupyterhub-k8s/blob/a3dfbd29f3826ba5fbd3675e2ed2ee3f5bf91f4a/jupyterhub/values.yaml#L47

    
      
              nodePort:
            extraPorts: []
            loadBalancerIP:
          baseUrl: /
          cookieSecret:
          initContainers: []
          fsGid: 1000
          nodeSelector: {}
          tolerations: []
          concurrentSpawnLimit: 64
          consecutiveFailureLimit: 5
          activeServerLimit:
          deploymentStrategy:
            ## type: Recreate
            ## - sqlite-pvc backed hubs require the Recreate deployment strategy as a
            ##   typical PVC storage can only be bound to one pod at the time.
            ## - JupyterHub isn't designed to support being run in parallell. More work
            ##   needs to be done in JupyterHub itself for a fully highly available (HA)
            ##   deployment of JupyterHub on k8s is to be possible.
            type: Recreate
          db:

For us we had to disable that because during a high load event where lots of users were logging in at once we were getting consecutive spawn failures (sometimes) if for example the node auto-scaler was slow to catch up and spawns were failing waiting for an available node. Restarting the hub in that case didn’t help and in our environment and usage patterns it’s an expected scenario if we’re not pre-scaled enough so we just disabled that config.

Topic		Replies	Views
Core component resilience/reliability JupyterHub	10	2024	September 11, 2020
JupyterHub on different server platforms JupyterHub	2	369	August 19, 2021
Increase the limit number of active servers on tljh The Littlest JupyterHub	2	400	March 5, 2024
Resource requirements of the Jupyterhub proxy JupyterHub help-wanted	0	353	September 17, 2020
About server specifications required to use jupyterhub with 50 people JupyterHub	2	432	June 29, 2022

Jupyterhub Can a large number of users start the service at the same time

Related topics