Hadoop
Jump to navigation
Jump to search
Running Hadoop in Terminal Rooms
To setup a hadoop hadoop cluster in a terminal room, make sure you have booted some PC's with Ubuntu Linux 12.04. Write down the names of the PC's you want to use as slave nodes. Then run tkhadoop.sh:
/usr/local/bin/tkhadoop.sh [slaves]
For example, when you're physically working with hg137pc01 and pc two and three are available and running linux, use:
tkhadoop.sh hg137pc02.science.ru.nl hg137pc03.science.ru.nl
This will setup the files needed to run a three node hadoop cluster. The host on which you execute tkhadoop will be the master node and will be used as slave as well.
You'll find your hadoop installation in:
/scratch/username/hadoop
From within this directory, you can test the examples as documented on the apache hadoop website: