JupyterHUB on top of EMR and GPU nodes

weldpua2008 · July 25, 2019, 12:39pm

Hello,
We are using a managed Hadoop service by AWS with JupyterHUB.
We want to limit notebooks with TensorFlow on EMR Nodes with GPUs.
I failed to find any mention about how to send Notebooks (PySpark) on specific nodes (with specific tags).

kevin-bates · July 25, 2019, 2:36pm

Would the spark.yarn.{am,executor}.nodeLabelExpression tags applied in the spark-submit call satisfy this requirement? If so, you could create a specific kernel specification (kernel.json file) that invokes spark-submit. If you needed the notebook kernel (spark driver) to also run on the gpu node (i.e., cluster mode), then you’d also need to insert Enterprise Gateway into the picture and configure your Notebook servers to proxy their kernel management operations to EG.

If this is something that sounds promising, with or without EG, we can work on crafting up a kernel.json (and shell script) that invokes spark-submit to launch the IPython kernel. EG provides sample kernelspecs and yours would probably look similar to the spark_python_yarn_{client,cluster} specs.

weldpua2008 · July 31, 2019, 11:18am

Looks like I will implement it with spark.yarn.{am,executor}.nodeLabelExpression

sudo yarn rmadmin -addToClusterNodeLabels "GPU(exclusive=false)"
Add the following properties to /etc/hadoop/conf/capacity-scheduler.xml 
<property>
<name>yarn.scheduler.capacity.root.accessible-node-labels.GPU.capacity</name>
<value>100</value>
</property>

<property>
<name>yarn.scheduler.capacity.root.default.accessible-node-labels.GPU.capacity</name>
<value>100</value>
</property>

and then stop and start the ResourceManager like: 
sudo stop hadoop-yarn-resourcemanager
sudo start hadoop-yarn-resourcemanager


Add the following properties to the YARN configuration file i.e. yarn-site.xml file in the nodes of the GPU Task instance group:

<property>
<name>yarn.nodemanager.node-labels.provider</name>
<value>config</value>
</property>

<property>
<name>yarn.nodemanager.node-labels.provider.configured-node-partition</name>
<value>GPU</value>
</property>

Once these properties are added, restart the NodeManager on the respective nodes:
sudo stop hadoop-yarn-nodemanager
sudo start hadoop-yarn-nodemanager

Topic		Replies	Views
Running Jupyter Notebook Yarn Cluster Mode in HPE Data Fabric Kernels jupyterlab , jupyterhub	1	903	June 21, 2022
Help running spark jobs on a cluster that is external to K8 Zero to JupyterHub on Kubernetes help-wanted	2	719	October 30, 2024
Remote execution of code using GPUs in Jupyter discuss jupyterhub , how-to , help-wanted	2	1913	September 16, 2022
Connecting to external cluster JupyterHub jupyterhub , how-to , help-wanted	0	539	February 18, 2021
Executing PySpark code in a Jupyter Notebook using Z2JK where the user configures the Spark Session with the spark deployment mode set to "client" with the Spark executors running in their own dedicated Kubernetes cluster Zero to JupyterHub on Kubernetes jupyterlab , jupyterhub	1	833	June 16, 2022

JupyterHUB on top of EMR and GPU nodes

Related topics