Programming
Hadoop’s namenode and datanodes expose a bunch of TCP ports used by Hadoop’s daemons to communicate to each other or listen directly to users’ requests. These ports information are needed ... read more »
After installing Hadoop, we usually run some benchmark programs to test whether the system works well. In the post of Hadoop install tutorial, we show a very simple to grep ... read more »
This post lists pitfalls and lessons learning when configuring and tuning Hadoop.
Hadoop with IPv6
Hadoo doesn’t support IPv6 currently (up to 0.20.2 and 0.21.0): Hadoop and IPv6. The performance of the ... read more »
Hadoop is designed to run on hundreds to thousands of computers inside cluster. However, Hadoop is configured to run things in a non-distributed ... read more »