What is Zeppelin?

Zeppelin is a web-based interactive notebook for data processing (exploration, transformation, model building, etc.) and visualization.

Where to download?

You can download the latest release of Zeppelin from https://zeppelin.apache.org/download.html.

Which package should I download?

In this tutorial I will download the “Binary package with Spark interpreter and interpreter net-install script”. If you are interested in interpreters other than Spark you can download the “Binary package with all interpreters” and if you are interested in building the tool from its source code you can download the source package.

Step by step guide

Download the package

cd /path/to/download/zeppelin
wget http://apache.claz.org/zeppelin/zeppelin-0.7.1/zeppelin-0.7.1-bin-netinst.tgz

Extract the archive

tar xzf zeppelin-0.7.1-bin-netinst.tgz

Configuring Zeppelin

cp conf/zeppelin-site.xml.template conf/zeppelin-site.xml
cp conf/zeppelin-env.sh.template conf/zeppelin-env.sh

find “zeppelin.server.port” and change the port number as you wish

edit zeppelin-env.sh and add the following lines: export SPARK_HOME=/usr/local/spark export SPARK_SUBMIT_OPTIONS="–driver-class-path /opt/hadoopgpl/lib/hadoop-lzo.jar"