We have setup a bare-metal Kubernetes cluster, and have deployed JupyterHub on it for students here at UC Davis to use. There has been a recurring issue that is affecting our JupyterHub service. User pods sometime get stuck in the terminating, and that seems to affect other pods running on the same node as well. Our solution so far has been to use
kubectl delete pod <pod-name> --grace-period=0 --force, and to drain the affected node. We are not sure what is the root cause of this problem. Has anyone else ran into a similar problems? Hi @choldgraf, have you run into this problem with the deployments at Berkeley? Thank you in advance.