Tag Archives: analytics

CDH3 and Cloudera Enterprise

Categories: General

Today’s a big day for us at Cloudera. We’re announcing, as part of our activity at Hadoop Summit, two major new releases that we believe substantially advance Apache Hadoop for both the open source community and our enterprise customers.

First, we’re announcing a new release of Cloudera’s Distribution for Hadoop – CDH3 Beta 2. This release, built on more than a year and a half of extensive engagement with real customers in the market,

Read more

Highlights from the First Hadoop Contributors Meeting

Categories: General Hadoop HDFS MapReduce

While the vast majority of the Hadoop development discussion takes place on the Apache Jira and various project mailing lists, it’s often useful to meet face to face for high bandwidth discussion. To that end, Facebook hosted the first Apache Hadoop contributors meeting yesterday at their campus in Palo Alto. Cloudera, Facebook, Yahoo! and the Apache HBase team were well-represented.

Read more

Why Europe’s Largest Ad Targeting Platform Uses Apache Hadoop

Categories: General

Richard Hutton, CTO of nugg.ad, authored the following post about how and why his company uses Apache Hadoop.

nugg.ad operates Europe’s largest targeting platform. The company’s core business is to derive targeting recommendations from clicks and surveys. We measure these, store them in log files and later make sense of them all. In 2007 up until mid 2009 we used a classical data warehouse solution.

Read more

Hadoop at Twitter (part 1): Splittable LZO Compression

Categories: General

This summer I sent the following tweet, “Had lunch today at Twitter HQ. Thanks for the invite, @kevinweil! Great lunch conversation. Smart, friendly and fun team.” Kevin Weil leads the analytics team at Twitter and is an active member of the Hadoop community, and his colleague Eric Maland leads Operations.  Needless to say, Twitter is doing amazing things with Hadoop.  This guest blog from Kevin and Eric covers one of Twitter’s open-source projects which provides a solution for splittable LZO for Hadoop.

Read more

Apache HBase Available in CDH2

Categories: Community General Hadoop HBase

One of the more common requests we receive from the community is to package Apache HBase with Cloudera’s Distribution for Apache Hadoop. Lately, I’ve been doing a lot of work on making Cloudera’s packages easy to use, and recently, the HBase team has pitched in to help us deliver compatible HBase packages. We’re pretty excited about this, and we’re looking forward to your feedback. A big thanks to Andrew Purtell, a Senior Architect at TrendMicro and HBase Contributor,

Read more