Category Archives: CDH

Flume Community Office Hours @ Cloudera HQ, 2/28/2011

Categories: CDH Community Flume

On Monday, we held our second Flume Office Hours at Cloudera HQ in Palo Alto.  The intent was to meet informally, to talk about what’s new, to answer questions, and to get feedback from the community to help prioritize features for future releases.

Below is the slide deck from Flume Office Hours:

This time we had an online presense for folks to participate from remote locations.

Read more

CDH3 Beta 4 Now Available

Categories: CDH General

Cloudera is happy to announce the fourth beta release of Cloudera’s Distribution for Apache Hadoop version 3 — CDH3b4. As usual, we’d like to share a few highlights from this release.

Since this will be the last beta before we designate CDH3 stable, our focuses for this release have been on stability, security, and scalability.

Stability and ease of use
Since we released CDH3 Beta 3 in October,

Read more

Apache Hadoop Availability

Categories: CDH General Hadoop HDFS MapReduce

A common question on the Apache Hadoop mailing lists is what’s going on with availability? This post takes a look at availability in the context of Hadoop, gives an overview of the work in progress and where things are headed.

Background

When discussing Hadoop availability people often start with the NameNode since it is a single point of failure (SPOF) in HDFS, and most components in the Hadoop ecosystem (MapReduce,

Read more