Hi!
We have a situation where we see:
error killing pod: [failed to "KillContainer" for "notebook" with
KillContainerError: "rpc error: code = DeadlineExceeded desc = context
deadline exceeded", failed to "KillPodSandbox" for
"5df9642a-8b7f-48bf-8056-af1119c4b555" with KillPodSandboxError: "rpc error:
code = DeadlineExceeded desc = context deadline exceeded"]
for users. Then later they try to spin up a node with jupyterhub and it fails (picking a different node in the cluster). They get multi-attachment errors.
A manual way to fix is deleting pvs, pvcs, and the offending node. But it’s happening to multiple users now.
Not sure how to proceed.
We are currently using Jupyterhub / helm, 3.3.7