Steps are following:
1.
The steps are tested on
Linux Ubuntu 10.
2.
Download Hadoop from http://hadoop.apache.org/common/releases.html .
This document is done with 23 May, 2012:
Release 2.0.0-alpha available
3.
Unzip Hadoop in a
directory. This will create a direcotry "hadoop-2.0.0-alpha"
5.
Set the environment
variable for Java and Hadoop and also put Hadoop in the path. For bashrc shell
it lools like:
export JAVA_HOME=/usr/lib/jvm/java-6-openjdk/
export HADOOP_INSTALL=/home/enlighted/hadoop-2.0.0-alpha
export PATH=$PATH:/home/enlighted/programs/apache-maven-3.0.4/bin:$HADOOP_INSTALL/bin:$HADOOP_INSTALL/sbin
export HADOOP_INSTALL=/home/enlighted/hadoop-2.0.0-alpha
export PATH=$PATH:/home/enlighted/programs/apache-maven-3.0.4/bin:$HADOOP_INSTALL/bin:$HADOOP_INSTALL/sbin
6.
Check if hadoop is
installed properly by using hadoop command
hadoop version
It will print something similar to
Hadoop 2.0.0-alpha
alph... –r1338348
7.
Hadoop can be configured
using XML configuration files which are at etc/hadoop directory inside hadoop
folder. The improtant onces are:
core-site.xml -> core porperties
hdfs-site.xml -> hdfs properties
yarn-site.xml -> yarn properties which is the next generation
of Map Reduce.
8.
By default hadoop is
configured to run in standalone mode. The other two modes are pseudo
distributed and clustered mode

0 comments:
Post a Comment