Introduction
Kettle is Pentaho’s ETL tool, which is also called Pentaho Data Integration (PDI).
Installing Kettle is extremely simple.
Install Java
Follow the JDK installation instructions that are listed in the following post: Install Java JDK 6.0 update 31 on Ubuntu 12.04 LTS
Download
To download the Kettle either run the following command, or follow the bulleted steps below.
wget http://downloads.sourceforge.net/project/pentaho/Data%20Integration/4.3.0-stable/pdi-ce-4.3.0-stable.tar.gz
Or follow the steps below if you don’t want to use the wget command shown above.
- Open a web browser to http://wiki.pentaho.com/display/COM/Latest+Stable+Builds.
- Click 4.3.0-stable.
- Click pdi-ce-4.3.0-stable.tar.gz.
Installation
Next, open a terminal and enter the following commands:
cd ~/Downloads
tar -xzf pdi-ce-4.3.0-stable.tar.gz
mv data-integration ~/bin/pdi-ce-4.3.0
cd ~/bin
ln -s pdi-ce-4.3.0 data-integration
cd ~/bin/data-integration
To run Spoon:
./spoon.sh
Additional Reading
There is a lot of good documentation installed with PDI.
cd ~/bin/data-integration/docs/English
Open your favorite PDF viewer, or type:
evince getting_started_with_pdi.pdf
Image may be NSFW.
Clik here to view.

Clik here to view.
