Hadoop

From Cncz
Revision as of 11:21, 15 April 2013 by Bram (talk | contribs)
Jump to: navigation, search

Running Hadoop in Terminal Rooms

To setup a hadoop hadoop cluster in a terminal room, make sure you have booted some PC's with Ubuntu Linux 12.04. Write down the names of the PC's you want to use as slave nodes. Then run tkhadoop.sh:

/usr/local/bin/tkhadoop.sh [slaves]

For example, when you're physically working with hg137pc01 and pc two and three are available and running linux, use:

tkhadoop.sh hg137pc02.science.ru.nl hg137pc03.science.ru.nl


This will setup the files needed to run a three node hadoop cluster. The host on which you execute tkhadoop will be the master node and will be used as slave as well. You'll find your hadoop installation in:

/scratch/username/hadoop

From within this directory, you can test the examples as documented on the apache hadoop website: