Running Jupyter Notebook Yarn Cluster Mode in HPE Data Fabric

Yamen_Saban · June 21, 2022, 8:51am

I am trying to configure my kernel.json to run in both yarn client/cluster modes inside HPE Data Fabric, could anyone please help with the right configurations.

Yamen_Saban · June 21, 2022, 7:55pm

I am using the below kernel configuration

{
    "language": "python",
    "display_name": "Spark - Python (YARN Cluster Mode)",
    "metadata": {
      "process_proxy": {
        "class_name": "enterprise_gateway.services.processproxies.yarn.YarnClusterProcessProxy"
      }
    },
    "env": {
      "SPARK_HOME": "/opt/mapr/spark/spark-3.2.0",
      "PYSPARK_PYTHON": "/opt/anaconda3/envs/spark_py38/bin/python",
      "PYTHONPATH": "/opt/anaconda3/envs/spark_py38/lib/python3.8/site-packages/:/opt/mapr/spark/spark-3.2.0/python:/opt/mapr/spark/spark-3.2.0/python/lib/py4j-0.10.9.2-src.zip",
      "SPARK_OPTS": "--master yarn --deploy-mode cluster --name ${KERNEL_ID:-ERROR__NO__KERNEL_ID} --conf spark.yarn.submit.waitAppCompletion=false --conf spark.yarn.appMasterEnv.PYTHONUSERBASE=/home/${KERNEL_USERNAME}/.local --conf spark.yarn.appMasterEnv.PYTHONPATH=/opt/anaconda3/envs/spark_py38/lib/python3.8/site-packages/:/opt/mapr/spark/spark-3.2.0/python:/opt/mapr/spark/spark-3.2.0/python/lib/py4j-0.10.9.2-src.zip --conf spark.yarn.appMasterEnv.PATH=/opt/anaconda3/envs/spark_py38/bin:$PATH ${KERNEL_EXTRA_SPARK_OPTS}",
      "LAUNCH_OPTS": ""
    },
    "argv": [
      "/opt/anaconda3/envs/spark_py38/share/jupyter/kernels/spark_python_yarn_cluster/bin/run.sh",
      "--RemoteProcessProxy.kernel-id",
      "{kernel_id}",
      "--RemoteProcessProxy.response-address",
      "{response_address}",
      "--RemoteProcessProxy.port-range",
      "{port_range}",
      "--RemoteProcessProxy.spark-context-initialization-mode",
      "lazy"
    ]
  }

Topic		Replies	Views
Jupyter Notebook connecting to existing Spark/Yarn Cluster General	7	15669	April 1, 2019
Help running spark jobs on a cluster that is external to K8 Zero to JupyterHub on Kubernetes help-wanted	2	722	October 30, 2024
Spark Client Mode Integration Zero to JupyterHub on Kubernetes	2	1004	September 19, 2019
Enterprise Kernel Gateway on Hadoop - am i missing something? Enterprise Gateway	3	1048	December 11, 2019
Changing yarn queue for Jhub JupyterHub jupyterhub , how-to , help-wanted	1	491	March 30, 2021

Running Jupyter Notebook Yarn Cluster Mode in HPE Data Fabric

Related topics