What is Zeppelin?
Zeppelin is a web-based interactive notebook for data processing (exploration, transformation, model building, etc.) and visualization.
Where to download?
You can download the latest release of Zeppelin from https://zeppelin.apache.org/download.html.
Which package should I download?
In this tutorial I will download the “Binary package with Spark interpreter and interpreter net-install script”. If you are interested in interpreters other than Spark you can download the “Binary package with all interpreters” and if you are interested in building the tool from its source code you can download the source package.
Step by step guide
Download the package
cd /path/to/download/zeppelin
wget http://apache.claz.org/zeppelin/zeppelin-0.7.1/zeppelin-0.7.1-bin-netinst.tgz
Extract the archive
tar xzf zeppelin-0.7.1-bin-netinst.tgz
Configuring Zeppelin
cp conf/zeppelin-site.xml.template conf/zeppelin-site.xml
cp conf/zeppelin-env.sh.template conf/zeppelin-env.sh
find “zeppelin.server.port” and change the port number as you wish
edit zeppelin-env.sh and add the following lines: export SPARK_HOME=/usr/local/spark export SPARK_SUBMIT_OPTIONS="–driver-class-path /opt/hadoopgpl/lib/hadoop-lzo.jar"