Prevent pod to get GPUs

MG123 · March 19, 2024, 9:03am

Self replying - and leaving it there for anyone in the same boat:
this was a wrong configuration in the nvidia container runtime.
/etc/nvidia-container-runtime/config.toml should have the following lines:

accept-nvidia-visible-devices-as-volume-mounts = true
accept-nvidia-visible-devices-envvar-when-unprivileged = false

And then the nvidia plugin should be deployed with

    compatWithCPUManager: true
    deviceListStrategy: volume-mounts

This (apparently) prevent containers to get all GPUs if no allocation specified, and correctly only 1 GPU when assigned with limits.

This is “clearly” documented here [External] Read list of GPU devices from volume mounts instead of NVIDIA_VISIBLE_DEVICES - Google Docs

Topic		Replies	Views
How to share GPU to mutiple pods? Insufficient nvidia.com/gpu JupyterHub jupyterhub , help-wanted	6	1123	June 3, 2024
Unwanted shared GPU Zero to JupyterHub on Kubernetes	2	731	June 1, 2021
Insufficient gpu problem! Zero to JupyterHub on Kubernetes help-wanted	1	1266	October 24, 2021
GPu doesn't work with extra_config Zero to JupyterHub on Kubernetes help-wanted	1	724	February 8, 2022
Singleuser GPU limits Zero to JupyterHub on Kubernetes	1	1012	March 24, 2022

Prevent pod to get GPUs

Related topics