Is it possible to create a hadoop cluster by connecting three single node machines?
If i have three three virtual machines with cloudera hadoop single node installed, is it possible to create a cluster by connecting three of them? like one as namenode and other two as datanodes.
I am following this documentaion...
Of course you can connect them and things should be easy once all hosts run in pseudo-distributed mode ( all the demons on the same host ). In theory all you have to do is configuration change on all 3 hosts. In practice you have to read also this because things are a bit different.
The first external datanode is hard work, any other will follow with no problems.
This tutorial provides exactly what you need. HTH.