Friday, 21 September 2012

Hadoop Installation

Leave a Comment


Steps are following:

1.      The steps are tested on Linux Ubuntu 10.

2.      Download Hadoop from http://hadoop.apache.org/common/releases.html . This document is done with 23 May, 2012: Release 2.0.0-alpha available

3.      Unzip Hadoop in a directory. This will create a direcotry "hadoop-2.0.0-alpha"

5.      Set the environment variable for Java and Hadoop and also put Hadoop in the path. For bashrc shell it lools like:
export JAVA_HOME=/usr/lib/jvm/java-6-openjdk/
export HADOOP_INSTALL=/home/enlighted/hadoop-2.0.0-alpha
export PATH=$PATH:/home/enlighted/programs/apache-maven-3.0.4/bin:$HADOOP_INSTALL/bin:$HADOOP_INSTALL/sbin

6.      Check if hadoop is installed properly by using hadoop command
hadoop version
It will print something similar to
Hadoop 2.0.0-alpha
alph... –r1338348

7.      Hadoop can be configured using XML configuration files which are at etc/hadoop directory inside hadoop folder. The improtant onces are:
core-site.xml -> core porperties
hdfs-site.xml -> hdfs properties
yarn-site.xml -> yarn properties which is the next generation of Map Reduce.

8.      By default hadoop is configured to run in standalone mode. The other two modes are pseudo distributed and clustered mode




0 comments:

Post a Comment