Hadoop

From Cncz
Revision as of 12:21, 15 April 2013 by Bram (talk | contribs)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Running Hadoop in Terminal Rooms

To setup a hadoop hadoop cluster in a terminal room, make sure you have booted some PC's with Ubuntu Linux 12.04. Write down the names of the PC's you want to use as slave nodes. Then run tkhadoop.sh:

/usr/local/bin/tkhadoop.sh [slaves]

For example, when you're physically working with hg137pc01 and pc two and three are available and running linux, use:

tkhadoop.sh hg137pc02.science.ru.nl hg137pc03.science.ru.nl


This will setup the files needed to run a three node hadoop cluster. The host on which you execute tkhadoop will be the master node and will be used as slave as well. You'll find your hadoop installation in:

/scratch/username/hadoop

From within this directory, you can test the examples as documented on the apache hadoop website: