Core component resilience/reliability

You can use the latest dev version of zero-to-jupyterhub (from https://jupyterhub.github.io/helm-chart/#development-releases-jupyterhub). It includes kubespawner 0.13, which has the PR mentioned here.

@rmoe’s PR is beautiful, look at the CPU usage reduction here:

The change was deployed on 09/05 sometime, and you can see the big difference it makes.

More importantly, you can see the change in response latencies.

We were encountering many many requests at 1s+ latencies! This basically made the hub unavailable - it was dropping requests on the floor, so many requests didn’t even make to it.

UC Berkeley’s infra is now stable thanks to @rmoe’s work. THANK YOU

3 Likes