This is a mirror of official site: http://jasper-net.blogspot.com/

Running Hadoop On Ubuntu Linux (Multi-Node Cluster)

| Friday, March 18, 2011
Table of Contents:

What we want to do

In this tutorial, I will describe the required steps for setting up a multi-node Hadoop cluster using the Hadoop Distributed File System (HDFS) on Ubuntu Linux.
Are you looking for the single-node cluster tutorial? Just head over there.
Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File System and of MapReduce. HDFS is a highly fault-tolerant distributed file system and like Hadoop designed to be deployed on low-cost hardware. It provides high throughput access to application data and is suitable for applications that have large data sets.

Read more: Michael G. Noll

Posted via email from Jasper-net

0 comments: