Tutorials, solutions and tips on technologies related to Cloud Computing.

Please also check Go Peta Scale which shares posts from the community on large scale systems and cloud computing technologies.

Headline
[Jan 15, 2012 | Views: 238]
Hadoop’s namenode and datanodes expose a bunch of TCP ports used by Hadoop’s daemons to communicate to each other or listen directly to users’ requests. These ports information are needed by both the Hadoop users and cluster administrators to write programs or configure firewalls/gateways accordingly. A post written by Philip Zeyliger from ...
Read the full story »
[Jan 15, 2012 | Views: 5,123]
Hadoop mainly consists of two parts: Hadoop MapReduce and HDFS. Hadoop MapReduce is a programming model and software framework for writing applications, which is an open-source variant of MapReduce that is initially designed and implemented by Google for processing and generating large data sets [1]. HDFS is Hadoop’s underlying data ... read more »
[Apr 18, 2012 | Views: 2,424]
The mrcc project’s homepage is here: mrcc project Abstract mrcc[3] is an open source compilation system that uses MapReduce[1] to distribute C code compilation across the servers of the cloud computing platform. mrcc is built to use Hadoop[2] by default, but it is easy to port it to other could computing platforms, such ... read more »
[May 3, 2012 | Views: 2,023]
This post lists important conferences on Cloud Computing in year 2012. This post is kept updating. OSDI 2012 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’12) October 8–10, 2012, Hollywood, CA “The tenth OSDI seeks to present innovative, exciting research in computer systems. OSDI brings together professionals from academic and industrial ... read more »
[Apr 6, 2011 | Views: 412]
Hadoop is designed to run on hundreds to thousands of computers inside cluster. However, Hadoop is configured to run things in a non-distributed mode as a single Java process by default. This is specially useful for debugging since distributed debugging is really a ... read more »
[May 11, 2011 | Views: 387]
This post lists important conferences on Cloud Computing in year 2011. ACM Symposium on Cloud Computing October 27 and 28, 2011, Cascais, Portugal Submission Deadline: April 30, 2011 23rd ACM Symposium on Operating Systems Principles (SOSP) October 23-26, 2011, Cascais, Portugal Submission deadline: March 18, 2011, 11:59 PM GMT EuroSys 2011 April 10-13, 2011. Salzburg, Austria. CLOUD COMPUTING ... read more »
[Jan 8, 2012 | Views: 307]
After installing Hadoop, we usually run some benchmark programs to test whether the system works well. In the post of Hadoop install tutorial, we show a very simple to grep strings from a simple sets of files. In this post, we introduce the Sort for testing and benchmarking Hadoop. The ... read more »
[Apr 26, 2011 | Views: 302]
This post lists pitfalls and lessons learning when configuring and tuning Hadoop. Hadoop with IPv6 Hadoo doesn’t support IPv6 currently (up to 0.20.2 and 0.21.0): Hadoop and IPv6. The performance of the cluster may suffer from turning IPv6 on in clusters: mail archive. One good practice is to disable IPv6 on servers in ... read more »
[Jan 15, 2012 | Views: 238]
Hadoop’s namenode and datanodes expose a bunch of TCP ports used by Hadoop’s daemons to communicate to each other or listen directly to users’ requests. These ports information are needed by both the Hadoop users and cluster administrators to write programs or configure firewalls/gateways accordingly. A post written by Philip Zeyliger from ... read more »
Recent
[May 3, 2012 | Views: 2,023]
This post lists important conferences on Cloud Computing in year 2012. This post is kept updating. OSDI 2012 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’12) October 8–10, 2012, Hollywood, ... read more »
[Apr 18, 2012 | Views: 2,424]
The mrcc project’s homepage is here: mrcc project Abstract mrcc[3] is an open source compilation system that uses MapReduce[1] to distribute C code compilation across the servers of the cloud computing platform. mrcc ... read more »
[Jan 15, 2012 | Views: 5,123]
Hadoop mainly consists of two parts: Hadoop MapReduce and HDFS. Hadoop MapReduce is a programming model and software framework for writing applications, which is an open-source variant of MapReduce that ... read more »
[Jan 15, 2012 | Views: 238]
Hadoop’s namenode and datanodes expose a bunch of TCP ports used by Hadoop’s daemons to communicate to each other or listen directly to users’ requests. These ports information are needed ... read more »
[Jan 8, 2012 | Views: 307]
After installing Hadoop, we usually run some benchmark programs to test whether the system works well. In the post of Hadoop install tutorial, we show a very simple to grep ... read more »
[May 11, 2011 | Views: 387]
This post lists important conferences on Cloud Computing in year 2011. ACM Symposium on Cloud Computing October 27 and 28, 2011, Cascais, Portugal Submission Deadline: April 30, 2011 23rd ACM Symposium on Operating Systems ... read more »
[Apr 26, 2011 | Views: 302]
This post lists pitfalls and lessons learning when configuring and tuning Hadoop. Hadoop with IPv6 Hadoo doesn’t support IPv6 currently (up to 0.20.2 and 0.21.0): Hadoop and IPv6. The performance of the ... read more »
[Apr 6, 2011 | Views: 412]
Hadoop is designed to run on hundreds to thousands of computers inside cluster. However, Hadoop is configured to run things in a non-distributed ... read more »