K8s nodes going bad?

davidread · February 5, 2020, 8:45am

Does anyone else suffer from k8s nodes going bad? We find about one node per week becomes faulty - either k8s is aware of the problems, or it just pods’ networking seizes up or won’t start properly. We find it really disruptive when a node is full of Jupyter pods - we have to disrupt all the users on the node by draining it and they have to restart on another node. Is it just us with a rubbish k8s cluster, or do others find this too?

It makes me wonder whether k8s is stable enough for running stateful apps like Jupyter, which can’t have multiple replicas.

Topic		Replies	Views
Pods stuck in terminating state on node Zero to JupyterHub on Kubernetes	1	1928	December 28, 2019
Jupyterhub Pod Dies on regular basis Zero to JupyterHub on Kubernetes jupyterhub , help-wanted	5	739	July 25, 2023
Reliability practices around Z2JH deployments Zero to JupyterHub on Kubernetes	0	420	August 20, 2019
Core component resilience/reliability JupyterHub	10	2022	September 11, 2020
Connection error in Hub when deploy to on prem-k8s cluster Zero to JupyterHub on Kubernetes community , jupyterhub	1	583	August 31, 2022

K8s nodes going bad?

Related topics