Running Zeppelin on CDH

Download and Build Zeppelin Go to the download page and get the latest source package. Untar the source package and create a git repo to make bower happy: $ tar zxvf zeppelin-0.5.6-incubating.tgz $ cd zeppelin-0.5.6-incubating $ git init Before building from source first determine the Hadoop version by running the following command on the edge node: $ hadoop version Hadoop 2.6.0-cdh5.4.8 ... This command was run using /opt/cloudera/parcels/CDH-5.4.8-1.cdh5.4.8.p0.4/lib/hadoop/hadoop-common-2.6.0-cdh5.4.8.jar Build Zeppelin with YARN support enabled using the Maven profile corresponding to the Hadoop version found above: »

Installing a Custom Spark Version on CDH

Since Spark can be run as a YARN application it is possible to run a Spark version other than the one provided by the Cloudera platform (CDH). This document lists the instructions for how to compile a specific Spark version against the Hadoop version supported by CDH. The instructions are based on the post Running Spark 1.5.1 on CDH. Determine the version of CDH and Hadoop by running the following command on the edge node: »