Blog Posts

Title, Author(s) Abstract / Description File Format

May 23, 2013

Webpage View Page

May 22, 2013

Excerpt: Have you ever wished you could upgrade to the latest CDH minor release with just a few mouse clic... more

Webpage View Page

May 21, 2013

Excerpt: Mark your calendars, all you data cyclists! I’m visiting Paris, London, and Edinburgh t... more

Webpage View Page

May 20, 2013

Excerpt: According to Jim Benedetto,... more

Webpage View Page

May 17, 2013

Webpage View Page

May 15, 2013

Excerpt: Contributing to Apache Hadoop or writing custom pluggable modules requires modifying Hadoop’s s... more

Webpage View Page

May 13, 2013

Excerpt: One of the complexities of Apache Hadoop is the need to deploy clusters of servers, potentially o... more

Webpage View Page

May 10, 2013

Excerpt: Our thanks to Etsy developer Brad Greenlee (@bgreenlee) for the post below. We think his Mac... more

Webpage View Page

Top 5 Reasons to Attend HBaseCon 2013

Justin Kestelyn (@kestelyn)

May 9, 2013

Webpage View Page

May 8, 2013

Excerpt: The post below was originally published at... more

Webpage View Page

Cloudera Partners and Impala: Alteryx

Justin Kestelyn (@kestelyn)

May 8, 2013

Excerpt: Our thanks to Brian Dirking, Director of Product Marketing for... more

Webpage View Page

Extending the Data Warehouse with Hadoop

Justin Kestelyn (@kestelyn)

May 7, 2013

Excerpt: "Are data warehouses becoming victims of their own success?", Tony Baer asks in a ... more

Webpage View Page

May 7, 2013

Excerpt: At Cloudera, we have the privilege of helping thousands of developers learn Apache Hadoop, as wel... more

Webpage View Page

Cloudera Impala and Partners: Tableau

Justin Kestelyn (@kestelyn)

May 7, 2013

Excerpt: Our thanks to Ted Wasserman, product manager for Ta... more

Webpage View Page

May 6, 2013

Excerpt: This week, the Cloudera Sessions... more

Webpage View Page

Cloudera Partners and Impala: Talend

Justin Kestelyn (@kestelyn)

May 6, 2013

Excerpt: Our thanks to Yves de Montcheuil, Vice President of Marketing for... more

Webpage View Page

May 3, 2013

Excerpt: Our thanks to Kevin Spurway, Senior Vice President of Marketing for... more

Webpage View Page

May 2, 2013

Excerpt: This week represents quite a milestone for Cloudera and, at least we’d like to believe, the Had... more

Webpage View Page

May 2, 2013

Excerpt: On Monday April 29, Cloudera... more

Webpage View Page

April 30, 2013

Excerpt: It has been an exciting couple of days for new product announcements at Cloudera -- exciting espe... more

Webpage View Page

April 26, 2013

Excerpt: We're very happy to announce the 2.3 release of Hue, the open source... more

Webpage View Page

April 26, 2013

Excerpt: This post was originally published via blogs.apache.... more

Webpage View Page

April 23, 2013

Excerpt: Data scientists, that peculiar... more

Webpage View Page

April 22, 2013

Excerpt: As Cloudera’s keeper of customer stories, it’s dawned on me that others might benefit from th... more

Webpage View Page

April 22, 2013

Excerpt: Thanks to a dazzling array of excellent proposals from across the Apache HBase community, the... more

Webpage View Page

April 18, 2013

Excerpt: Today Cloudera announced a new... more

Webpage View Page

April 18, 2013

Webpage View Page

April 17, 2013

Excerpt: This guest post comes from Alex Giamas, Senio... more

Webpage View Page

April 15, 2013

Excerpt: It’s only Rock and Roll, but I like it!           - Mick Jagger... more

Webpage View Page

April 12, 2013

Excerpt: This how-to is the second in a series that explores the use of the Apache HBase REST interface. ... more

Webpage View Page

April 11, 2013

Excerpt: It's time for me to give you a quarterly update (... more

Webpage View Page

April 9, 2013

Excerpt: This guest post comes to us from David Greco, CTO of Elig... more

Webpage View Page

April 8, 2013

Excerpt: Managing and viewing data in... more

Webpage View Page

Congrats to OSCON 2013 Speakers!

Justin Kestelyn (@kestelyn)

April 5, 2013

Excerpt: Cloudera will be a proud exhibitor at O'... more

Webpage View Page

April 4, 2013

Excerpt: As a follow-up to a previous post about the Impala demo he built during Data Hacking Day, Ala... more

Webpage View Page

We Honor the Champions of Big Data!

Justin Kestelyn (@kestelyn)

April 2, 2013

Webpage View Page

March 29, 2013

Excerpt: Thanks to our friends at KDNuggets for pointing out that Cloudera is the... more

Webpage View Page

Meet the HBaseCon 2013 Program Committee

Justin Kestelyn (@kestelyn)

March 29, 2013

Excerpt: With HBaseCon 2013 (Early Bird registration now open!) pre... more

Webpage View Page

Meet the Engineer: Mark Grover

Justin Kestelyn (@kestelyn)

March 29, 2013

Webpage View Page

Phoenix in 15 Minutes or Less

Justin Kestelyn (@kestelyn)

March 28, 2013

Excerpt: The following FAQ is provided by James Taylor of Salesforce, which recently open-sourced its... more

Webpage View Page

March 26, 2013

Excerpt: Cloudera Man... more

Webpage View Page

March 25, 2013

Excerpt: Hue 2.2 , the open sour... more

Webpage View Page

March 25, 2013

Excerpt: The following guest post comes to you from Alan Gardner of remote database services and consu... more

Webpage View Page

March 22, 2013

Excerpt: In this... more

Webpage View Page

March 22, 2013

Excerpt: Last month, Apache Crunch became the fifth project (along... more

Webpage View Page

March 20, 2013

Excerpt: Hue is an open-source web interface for Apache Hado... more

Webpage View Page

How-to: Use Oozie Shell and Java Actions

Justin Kestelyn (@kestelyn)

March 18, 2013

Excerpt: Apache Oozie, the workflow coordinator for Apache Hadoop, h... more

Webpage View Page

March 15, 2013

Excerpt: Hadoop Summit Europe is coming up in Amsterdam n... more

Webpage View Page

March 13, 2013

Excerpt: Below you'll find the official announcement from Cloudera and Twitter about Parquet, an effic... more

Webpage View Page

March 12, 2013

Excerpt: There are various ways to access and interact with Apache HBase. The... more

Webpage View Page

March 8, 2013

Excerpt: Every growing, dynamic engineering culture needs a hackathon every once in a while.  Ear... more

Webpage View Page

March 7, 2013

Excerpt: The current (4.2) release of CDH -- Cloudera's 100% open-source distribution of Apache Hadoop and... more

Webpage View Page

March 6, 2013

Excerpt: Last week Cloudera released the 4.5 release of... more

Webpage View Page

March 5, 2013

Excerpt: Hadoop network encryption is a feature introduced in Apache Hadoop 2.0.2-alpha and in CDH4.1.... more

Webpage View Page

March 1, 2013

Excerpt: This post is about the new release of Hue, an open... more

Webpage View Page

February 27, 2013

Excerpt: It has been a while since I have blogged, primarily because we have been heads-down working towar... more

Webpage View Page

February 26, 2013

Excerpt: It has been a busy time for announcements coinciding with this week’s Strata conference. There... more

Webpage View Page

February 25, 2013

Excerpt: UPDATED 20130424: The new RHadoop treats output to Streaming a bit differently,... more

Webpage View Page

February 21, 2013

Excerpt: (Added Feb. 25 2013: Early Bird registration is now open - closes April 23, 2013!)... more

Webpage View Page

February 21, 2013

Excerpt: Now that Apache Hadoop is seven years old, use-case patterns for Big Data have emerged. In this p... more

Webpage View Page

February 20, 2013

Excerpt: Last week the Apache Hadoop PMC voted to release... more

Webpage View Page

February 15, 2013

Excerpt: Cloudera is proud to be a sponsor of Big Data... more

Webpage View Page

February 14, 2013

Excerpt: Organizations of all types and sizes are waking up to the idea that integrating the Apache Hadoop... more

Webpage View Page

From Zero to Impala in Minutes

Justin Kestelyn (@kestelyn)

February 7, 2013

Excerpt: This was post was originally published by U.C. Berk... more

Webpage View Page

February 6, 2013

Excerpt: This guest post is provided by Dave Nahmias, Pre-Sales and Partner Solutions Engineer at... more

Webpage View Page

A Ruby Client for Impala

Justin Kestelyn (@kestelyn)

February 4, 2013

Excerpt: Thanks to Stripe's Colin Marc (@colinmarc) for the guest post below, and for his work on the... more

Webpage View Page

January 30, 2013

Excerpt: In Part 1... more

Webpage View Page

January 28, 2013

Excerpt: Are you new to Apache Hadoop and need to start processing data fast and effectively? Have you bee... more

Webpage View Page

January 22, 2013

Excerpt: Clouderans are traveling the United States (and beyond) in droves during the first quarter of 201... more

Webpage View Page

January 18, 2013

Excerpt: I am pleased to announce the release of Cloudera Impala Beta (version 0.4) and Cloudera Manager 4... more

Webpage View Page

January 18, 2013

Excerpt: Our thanks to guest author Jon Natkins (@nattyice) of WibiData for the following post!... more

Webpage View Page

Meet the Instructor: Jesse Anderson

Ryan Goldman (@ClouderaU)

January 15, 2013

Webpage View Page

January 14, 2013

Excerpt: This following post was originally published via... more

Webpage View Page

Understanding MapReduce via Boggle

Jesse Anderson (@jessetanderson)

January 14, 2013

Excerpt: Graph theory is a growing part of Big Dat... more

Webpage View Page

January 11, 2013

Excerpt: The post below was originally published via ... more

Webpage View Page

January 10, 2013

Excerpt: For several good reasons, 2013 is a Happy New Year for Apache Hadoop enthusiasts. In 2012... more

Webpage View Page

January 9, 2013

Excerpt: (Update 2/6/2013 - Sorry, this event is sold out!) With... more

Webpage View Page

Meet the Engineer: Marcel Kornacker

Justin Kestelyn (@kestelyn)

January 8, 2013

Webpage View Page

January 7, 2013

Excerpt: I recently joined Cloudera after working in... more

Webpage View Page

Apache Bigtop 0.5.0 Has Been Released

Justin Kestelyn (@kestelyn)

January 3, 2013

Excerpt: The following post was originally published via... more

Webpage View Page

January 3, 2013

Excerpt: Hue is a web interface for... more

Webpage View Page

How-to: Use the ShareLib in Apache Oozie

Justin Kestelyn (@kestelyn)

December 18, 2012

Excerpt: As Apache Oozie, the workflow engine for Apache Hadoop, con... more

Webpage View Page

December 14, 2012

Excerpt: It’s been an exciting month and a half since the launch of the Cloudera Impala (the new open so... more

Webpage View Page

December 14, 2012

Excerpt: This is the first post in series that will get you going on how to write, compile, and run a simp... more

Webpage View Page

Cloudera Speakers at ApacheCon NA 2013

Justin Kestelyn (@kestelyn)

December 13, 2012

Excerpt: Our hearty congratulations to the Cloudera engineers who have been accepted as... more

Webpage View Page

December 11, 2012

Excerpt: At Cloudera, we put great pride into drinking our own champagne. That pride extends to our suppor... more

Webpage View Page

December 7, 2012

Excerpt: Hue is a web interface for... more

Webpage View Page

December 6, 2012

Excerpt: We are very pleased to introduce new, CDH4.1-aligned versions of the... more

Webpage View Page

December 5, 2012

Excerpt: With the... more

Webpage View Page

December 4, 2012

Excerpt: I am pleased to announce the release of Cloudera Impala Beta (version 0.3) and Cloudera Manager 4... more

Webpage View Page

November 28, 2012

Excerpt: AssignmentManager is a module in the Apache HBase... more

Webpage View Page

November 28, 2012

Webpage View Page

November 27, 2012

Excerpt: The following post was... more

Webpage View Page

This Month in Data Science

Justin Kestelyn (@kestelyn)

November 27, 2012

Excerpt: Data science has been a ubiquitous topic of conversation in the IT and business worlds across the... more

Webpage View Page

November 26, 2012

Excerpt: The following is a guest post from Nils Kübler, the creator of the Hannibal project. He is s... more

Webpage View Page

November 20, 2012

Excerpt: The following is a re-post from... more

Webpage View Page

The "Ask Bigger Questions" Contest!

Justin Kestelyn (@kestelyn)

November 19, 2012

Excerpt: Have you helped your company ask bigger questions? Our mission at Cloudera University is to equip... more

Webpage View Page

November 19, 2012

Excerpt: Apache ZooKeeper release 3.4.5 is now available. This... more

Webpage View Page

November 14, 2012

Excerpt: Since the... more

Webpage View Page

November 13, 2012

Excerpt: I am pleased to announce the release of Cloudera Impala Beta (version 0.2) and Cloudera Manager 4... more

Webpage View Page

November 13, 2012

Excerpt: This is the third article in a series about analyzing Twitter data using some of the components o... more

Webpage View Page

November 7, 2012

Excerpt: (The following is a... more

Webpage View Page

November 6, 2012

Excerpt: [Updated Nov. 26, 2012: Sorry, this event has reached capacity and is now closed.]... more

Webpage View Page

November 5, 2012

Excerpt: The 2012 Strata + Hadoop World conference was w... more

Webpage View Page

November 1, 2012

Webpage View Page

October 31, 2012

Excerpt: Last week at Strata + Hadoop World 2... more

Webpage View Page

October 31, 2012

Excerpt: A few weeks back, Cloudera announced CDH 4.1, the latest update release to Cloudera's Distributio... more

Webpage View Page

October 24, 2012

Excerpt: After a long period of intense engineering effort and user feedback, we are very pleased, and pro... more

Webpage View Page

October 24, 2012

Excerpt: Today we’re proud to announce a new addition to the Apache Hadoop ecosystem:... more

Webpage View Page

MR2 and YARN Briefly Explained

Justin Kestelyn (@kestelyn)

October 24, 2012

Excerpt: With CDH4 onward, the Apache Hadoop component introduced two new terms for Hadoop users to wonder... more

Webpage View Page

Meet the Engineer: Todd Lipcon

Justin Kestelyn (@kestelyn)

October 24, 2012

Webpage View Page

October 21, 2012

Excerpt: Cloudera is co-presenting the sold-out... more

Webpage View Page

October 21, 2012

Excerpt: This is a guest post by Oliver Guinan, VP Ground Software, at Skybox Imaging. Oliver is a 15-... more

Webpage View Page

October 21, 2012

Excerpt: Earlier this month the Apache Hadoop PMC released... more

Webpage View Page

What’s New in CDH4.1 Hue

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: Hue is a Web-based interface that makes it easier t... more

Webpage View Page

What’s New in CDH4.1 Pig

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: Apache Pig is a platform for analyzing large data sets that... more

Webpage View Page

October 21, 2012

Excerpt: Axemblr, purveyors of a cloud-agnostic MapReduce Web Service, h... more

Webpage View Page

October 21, 2012

Excerpt: This is the second article in a series about analyzing Twitter data using some of the components... more

Webpage View Page

HBase at ApacheCon Europe 2012

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: Apache HBase will have a notable profile at ApacheCon Europe... more

Webpage View Page

Meet the Engineer: Todd Lipcon

Justin Kestelyn (@kestelyn)

October 21, 2012

Webpage View Page

New Additions to the Apache HBase Team

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: StumbleUpon (SU) and Cloudera have signed a technology collaboration agreement. Cloudera will sup... more

Webpage View Page

October 21, 2012

Excerpt: Today we bring you one user's experience using Apache Whirr to spin up a CDH cluster in the c... more

Webpage View Page

October 21, 2012

Excerpt: Our video animation factory has been busy lately. The embedded player below contains our two late... more

Webpage View Page

October 21, 2012

Excerpt: We at Cloudera are tremendously excited by the power of data to effect large-scale change in the... more

Webpage View Page

October 21, 2012

Excerpt: Metrics are collections of information about Hadoop daemons, events and measurements; for example... more

Webpage View Page

MR2 and YARN Briefly Explained

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: With CDH4 onward, the Apache Hadoop component introduced two new terms for Hadoop users to wonder... more

Webpage View Page

Applying Parallel Prediction to Big Data

Justin Kestelyn (@kestelyn)

October 5, 2012

Excerpt: This guest post is provided by Dan McClary, Principal Product Manager for Big Data and H... more

Webpage View Page

Data Science: Hot or Not?

Justin Kestelyn (@kestelyn)

October 4, 2012

Excerpt: You may have noticed that Harvard Business Review is calling data science... more

Webpage View Page

CDH4.1 Now Released!

Charles Zedlewski

October 1, 2012

Excerpt: Update time!  As a reminder, Cloudera releases major versions of CDH, our 100% open source distr... more

Webpage View Page

September 28, 2012

Excerpt: For those of you new to it, the Duke's Choice Awards... more

Webpage View Page

September 27, 2012

Excerpt: The post below was originally published via... more

Webpage View Page

September 25, 2012

Excerpt: With the default Apache HBase configuration, everyone is a... more

Webpage View Page

September 24, 2012

Excerpt: Apache ZooKeeper release 3.4.4 is now... more

Webpage View Page

Meet the Engineer: Jon Natkins

Justin Kestelyn (@kestelyn)

September 21, 2012

Webpage View Page

Analyzing Twitter Data with Apache Hadoop

Jonathan Natkins (@nattybnatkins)

September 19, 2012

Excerpt: Social media has gained immense popularity with marketing teams, and Twitter is an effective tool... more

Webpage View Page

September 14, 2012

Excerpt: This guest post comes to us courtesy of Gwen Shapira (@gwenshap), a database consultant for... more

Webpage View Page

September 11, 2012

Excerpt: What's to love about Cloudera Ent... more

Webpage View Page

September 10, 2012

Excerpt: API access was a new feature introduced in Cloudera Manager 4.0 (download free edition... more

Webpage View Page

Meet the Engineer: Eric Sammer

Justin Kestelyn (@kestelyn)

September 7, 2012

Webpage View Page

September 5, 2012

Excerpt: Organizations in diverse industries have adopted Apache Hadoop-based systems for large-scale data... more

Webpage View Page

The Action on "HBase in Action"

Justin Kestelyn (@kestelyn)

September 4, 2012

Webpage View Page

August 30, 2012

Excerpt: Learn how to configure a basic Maven project that will be able to build applications agai... more

Webpage View Page

August 27, 2012

Excerpt: Today ZDNet has very helpfully published a... more

Webpage View Page

Meet the Engineer: Aaron T. Myers

Justin Kestelyn (@kestelyn)

August 23, 2012

Webpage View Page

August 21, 2012

Excerpt: Cloudera Manager 4.0.4 and Cloudera Manager 3.7.8 are now available! These are enhancement releas... more

Webpage View Page

August 21, 2012

Excerpt: The following is a guest post kindly offered by Adam Kawa, a 26-year old Hadoop developer fro... more

Webpage View Page

August 20, 2012

Excerpt: In June 2012, Eli Collins (@elicollins), from Cloudera's Platforms team, led a session at... more

Webpage View Page

August 16, 2012

Excerpt: This is the second blogpost about Apache HBase replication. The... more

Webpage View Page

August 15, 2012

Excerpt: Hello World: This is my first post as the new guy facilitating and coordinating developer communi... more

Webpage View Page

August 13, 2012

Excerpt: We are happy to announce the general availability of CDH3 update 5. This update is a maintenance... more

Webpage View Page

August 7, 2012

Excerpt: HttpFS is an HTTP gateway/proxy for Apache Hadoop FileSystem implementations. HttpFS comes with C... more

Webpage View Page

Column Statistics in Apache Hive

Shreepadma Venugopalan

August 3, 2012

Excerpt: Over the last couple of months the Hive team at Cloudera has been working hard to bring a bunch o... more

Webpage View Page

August 2, 2012

Excerpt: Apache ZooKeeper release 3.... more

Webpage View Page

August 2, 2012

Excerpt: Up to this point, we’ve described our reasons for using Hadoop and Hi... more

Webpage View Page

July 31, 2012

Excerpt: Introduction In this three-part series of posts, we will share our experiences tackling... more

Webpage View Page

July 30, 2012

Excerpt: Apache HBase Replication is a way of copying data from one HBase cluster to a different and possi... more

Webpage View Page

July 25, 2012

Excerpt: It’s not often the case that I have a chance to concur with my colleague E14 over at Hortonwork... more

Webpage View Page

July 19, 2012

Excerpt: We are pleased to announce the availability of Cloudera Manager 4.0.3. This is an enhancement rel... more

Webpage View Page

July 16, 2012

Excerpt: In ... more

Webpage View Page

July 12, 2012

Excerpt: In the recent blog post about the... more

Webpage View Page

July 11, 2012

Excerpt: At 5 pm PDT on June 30, a leap second was added to the Universal Coordinated Time (UTC). Within a... more

Webpage View Page

July 9, 2012

Excerpt: This is a guest re-post from Datameer's Director of Marketing, Rich Taylor. The original post... more

Webpage View Page

July 3, 2012

Excerpt: Apache Flume is a scalable, reliable, fault-tolerant, distributed system designed to collect, tra... more

Webpage View Page

July 2, 2012

Excerpt: Introduction Ever since Cloudera decided to contribute the code and resources for what... more

Webpage View Page

June 29, 2012

Excerpt: Introduction Apache HBase is the Hadoop open-source, distributed, versioned storage man... more

Webpage View Page

June 29, 2012

Excerpt: This blog was originally posted on the... more

Webpage View Page

June 26, 2012

Excerpt: This week, a team of researchers at Google will be presenting a paper describing a system they de... more

Webpage View Page

June 19, 2012

Excerpt: HBaseCon 2012 summation provided by Michael Stack, PMC Chair of the Apache HBase Project. HBa... more

Webpage View Page

June 18, 2012

Excerpt: Apache HBase is the Hadoop database, and is based on the Hadoop Distributed File... more

Webpage View Page

June 14, 2012

Excerpt: On Tuesday, June 12th The Churchill Club of Silicon Valley hosted a panel discussion on Hadoop's... more

Webpage View Page

June 11, 2012

Excerpt: Overview One of the major features of the upcoming Apache HBase 0.96 release is improve... more

Webpage View Page

June 5, 2012

Excerpt: I’m very pleased to... more

Webpage View Page

June 4, 2012

Excerpt: Hue 2.0.1 has just been... more

Webpage View Page

June 4, 2012

Excerpt: CopyTable is a simple Apache HBase utility that, unsurprisingly, can be used for copying individu... more

Webpage View Page

June 4, 2012

Excerpt: We are pleased to announce that Cloudera Manager 3.7.6 is now available! The most notable updates in... more

Webpage View Page

May 16, 2012

Excerpt: Apache HBase 0.94.0 has been released! This is the first major release since the January 22nd HBa... more

Webpage View Page

May 14, 2012

Excerpt: Today’s interview features Todd Lipcon, software engineer for Cloudera. Todd will be presenting... more

Webpage View Page

May 14, 2012

Excerpt: We're happy to announce the Beta release of Cloudera Manager 4.0.  This version of Clo... more

Webpage View Page

May 9, 2012

Excerpt: We are happy to officially announce the general availability of CDH3 update 4. This update consis... more

Webpage View Page

May 7, 2012

Excerpt: This was originally posted on the Hadoop Summit 2012... more

Webpage View Page

May 4, 2012

Excerpt: This past Monday marked the official release of Apache Hive 0.9.0. Users interested in taking t... more

Webpage View Page

May 3, 2012

Excerpt: This is a guest post by Assaf Yardeni, Head of R&D for Treato, an online social healthcar... more

Webpage View Page

May 1, 2012

Excerpt: This post was originally posted on the... more

Webpage View Page

April 25, 2012

Excerpt: HBaseCon 2012 is only a month away! The conference takes p... more

Webpage View Page

Introducing CDH4 Beta 2

Charles Zedlewski

April 24, 2012

Excerpt: I'm pleased to inform our users and customers that we have released the Cloudera's Distribution I... more

Webpage View Page

April 12, 2012

Excerpt: HBaseCon 2012 is nea... more

Webpage View Page

April 11, 2012

Excerpt: San Francisco seems to be having an unusually high number of... more

Webpage View Page

April 10, 2012

Excerpt: This blog was originally posted on the Apache Blog:... more

Webpage View Page

April 6, 2012

Excerpt: Cloudera will be hosting an Apache HBase... more

Webpage View Page

April 3, 2012

Excerpt: Apache Bigtop 0.3.0 (incubating) is now available. This is the first fully integrated, community-... more

Webpage View Page

April 2, 2012

Excerpt: This blog was originally posted on the Apache Blog: ... more

Webpage View Page

April 1, 2012

Excerpt: Introduction A few months ago, my colleague Charles Zedlewski wrote a... more

Webpage View Page

March 23, 2012

Excerpt: What's new? Apache HBase 0.92.1 is now available... more

Webpage View Page

March 21, 2012

Excerpt: Apache ZooKeeper release 3.... more

Webpage View Page

March 20, 2012

Excerpt: One of the more confusing topics in Hadoop is how authorization and authentication work in the sy... more

Webpage View Page

March 19, 2012

Excerpt: Apache HBase 0.90.6 is now available. It is a bug fix rele... more

Webpage View Page

March 14, 2012

Excerpt: Introduction Some of the configuration properties found in Apache Hadoop have a direct... more

Webpage View Page

March 8, 2012

Excerpt: We’re excited to host the first ever HB... more

Webpage View Page

March 7, 2012

Excerpt: Background Apache Hadoop consists of two primary components: H... more

Webpage View Page

March 5, 2012

Excerpt: Cloudera and Cisco jointly announced a reference architecture for running Cloudera's Distribution... more

Webpage View Page

March 2, 2012

Excerpt: Several weeks ago, I set about to demonstrate the ease with which... more

Webpage View Page

February 24, 2012

Excerpt: In... more

Webpage View Page

February 14, 2012

Excerpt: Apache ZooKeeper release 3.4.3 is now available. This is a bug fix release covering 18  issues, one of whi... more

Webpage View Page

February 14, 2012

Excerpt: Service and Configuration Management (Part I & II) We’ve recently recorded a series of demo videos int... more

Webpage View Page

Introducing CDH4

Charles Zedlewski

February 13, 2012

Excerpt: I’m pleased to inform our users and customers that Cloudera has released its 4th version of Cloudera’s... more

Webpage View Page

February 7, 2012

Excerpt: Earlier today, Cloudera proudly released the Cloudera Connector for Tableau. The availability of this connect... more

Webpage View Page

January 30, 2012

Excerpt: Keeping with our release policy for Cloudera’s Distribution Including Apache Hadoop (CDH) I’m plea... more

Webpage View Page

January 25, 2012

Excerpt: More than 150 people attended the San Francisco Bay Area HBase User Group meetup last Thursday, January 19th,... more

Webpage View Page

January 25, 2012

Excerpt: When most people first hear about data science, it’s usually in the context of how prominent web compani... more

Webpage View Page

January 24, 2012

Excerpt: Today the Apache HBase community has proudly released Apache HBase 0.92.0, a major new version of the scalable... more

Webpage View Page

January 18, 2012

Excerpt: Last November in New York City, Hadoop World, the largest conference of Apache Hadoop practitioners, developer... more

Webpage View Page

January 13, 2012

Excerpt: This blog was originally posted on the Apache Blog: https://blogs.apache.org/sqoop/entry/apache_sqoop_highlig... more

Webpage View Page

January 12, 2012

Excerpt: If you’re like a myriad of other systems administrators out there, you may be running a production Hadoo... more

Webpage View Page

January 11, 2012

Excerpt: Bala Venkatrao is the Director of Product Management at Cloudera . As many of you know, we recently launc... more

Webpage View Page

January 10, 2012

Excerpt: Cloudera users gain more choice, tighter Oracle integration. Cloudera partners gain increased validation of th... more

Webpage View Page

January 9, 2012

Excerpt: Great news! The InfoWorld Tech Center has chosen Apache Hadoop for a 2012 Technology of the Year Award . Judg... more

Webpage View Page

January 9, 2012

Excerpt: Great news! The InfoWorld Tech Center has chosen Apache Hadoop for a... more

Webpage View Page

Hadoop in 2011

Rob Weltman

January 9, 2012

Excerpt: 2011 was a breakthrough year for Apache Hadoop as many more mainstream organizations large and small turned to... more

Webpage View Page

January 9, 2012

Excerpt: 2011 was a breakthrough year for Apache Hadoop as many more mainstream organizations large and sm... more

Webpage View Page

January 8, 2012

Excerpt: Some users & customers have asked about the most recent release of Apache Hadoop, v1.0: what’s in it,... more

Webpage View Page

January 6, 2012

Excerpt: This was my summer internship project at Cloudera, and I’m very thankful for the level of support and me... more

Webpage View Page

January 6, 2012

Excerpt: This was my summer internship project at Cloudera, and I'm very thankful for the level of sup... more

Webpage View Page

January 5, 2012

Excerpt: Apache Sqoop (incubating) provides an efficient approach for transferring big data between Hadoop related sys... more

Webpage View Page

January 3, 2012

Excerpt: Part 1 of this post covered how to convert and store email messages for archival purposes using Apache Hadoop... more

Webpage View Page

January 3, 2012

Excerpt: Part... more

Webpage View Page

January 2, 2012

Excerpt: This blog was originally posted on the Apache Blog . Apache Sqoop recently celebrates its first incubator... more

Webpage View Page

December 30, 2011

Excerpt: Apache ZooKeeper release 3.4.2 is now available. This is a bug fix release covering 2 issues, one of w... more

Webpage View Page

December 28, 2011

Excerpt: Apache HBase 0.90.5 is now available.  This release of the scalable distributed data store ins... more

Webpage View Page

December 28, 2011

Excerpt: Apache HBase 0.90.5 is now available.  This is release of the scalable distributed data store... more

Webpage View Page

December 28, 2011

Excerpt: This is a guest post contributed by Loren Siebert. Loren is a San Francisco entrepreneur and software develope... more

Webpage View Page

December 28, 2011

Excerpt: This is a guest post contributed by Loren Siebert. Loren is a San Francisco entrepreneur and... more

Webpage View Page

December 27, 2011

Excerpt: Apache Whirr release 0.7.0 is now available. It includes changes covering over  50 issues , four... more

Webpage View Page

December 27, 2011

Excerpt: Apache Whirr release... more

Webpage View Page

December 22, 2011

Excerpt: This is a guest post from RichRelevance Principal Architect and Apache Avro PMC Chair Scott Carey. In Early... more

Webpage View Page

December 21, 2011

Excerpt: This blog was originally posted on the Apache Blog: https://blogs.apache.org/flume/entry/apache_flume_hackat... more

Webpage View Page

December 21, 2011

Excerpt: This blog was originally posted on the Apache Blog:... more

Webpage View Page

December 20, 2011

Excerpt: David joined us as part of our intern program , and built the prototype for the distributed log search functi... more

Webpage View Page

December 19, 2011

Excerpt: Apache ZooKeeper release 3.4.1 is now available: this is a fix release covering 7 issues, 2 of which w... more

Webpage View Page

December 13, 2011

Excerpt: Aparna Ramani is the Director of Engineering for Cloudera Enterprise. Cloudera Manager 3.7, a major new ver... more

Webpage View Page

December 9, 2011

Excerpt: This blog was originally posted on the Apache Blog: https://blogs.apache.org/flume/entry/flume_ng_architectur... more

Webpage View Page

December 9, 2011

Excerpt: This guide is intended to be an introduction to Crunch. Introduction Crunch is used for processing data. C... more

Webpage View Page

December 6, 2011

Excerpt: This guest blog post is from Alex Loddengaard , creator of FoneDoktor , an Android app that monitors phone u... more

Webpage View Page

December 2, 2011

Excerpt: San Francisco, Salesforce.com HQ - Recently there was an Apache HBase Pow-wow where project contributors gath... more

Webpage View Page

November 30, 2011

Excerpt: The amount of information we are exposed to on a daily basis is far outstripping our ability to consume it, le... more

Webpage View Page

November 29, 2011

Excerpt: Apache ZooKeeper release 3.3.4 is now available: this is a fix release covering 22 issues , 9 of w... more

Webpage View Page

November 23, 2011

Excerpt: Apache ZooKeeper release 3.4.0 is now available: it includes changes covering over  150 issues , 27 of... more

Webpage View Page

November 23, 2011

Excerpt: This blog was originally posted on the Apache Blog:   https://blogs.apache.org/sqoop/entry/inaugural_sqoo... more

Webpage View Page

November 17, 2011

Excerpt: The Apache Hive team is hard at work putting the finishing touches on the 0.8.0 release. While the release h... more

Webpage View Page

November 16, 2011

Excerpt: Last month at the Web 2.0 Summit in San Francisco, Cloudera CEO Mike Olson  presented some work the Clou... more

Webpage View Page

November 16, 2011

Excerpt: The third annual  Hadoop World conference has come and gone. The two days of conference keynotes and ses... more

Webpage View Page

November 16, 2011

Excerpt: A number of architectural changes have been added to Hadoop MapReduce. The new MapReduce system is called MR2... more

Webpage View Page

November 16, 2011

Excerpt: A number of architectural changes have been added to Hadoop MapReduce. The new MapReduce system i... more

Webpage View Page

November 15, 2011

Excerpt: The Apache Hadoop PMC has voted to release Apache Hadoop 0.23.0 . This release is significant since it is the... more

Webpage View Page

November 3, 2011

Excerpt: Cloudera believes that the flexibility and power of Apache Mahout (http://mahout.apache.org/) in conjunct... more

Webpage View Page

October 27, 2011

Excerpt: Several meetups for Apache Hadoop and Hadoop-related projects are scheduled for the evenings surrounding Ha... more

Webpage View Page

CDH3 update 2 is released

Charles Zedlewski

October 21, 2011

Excerpt: Continuing with our practice from Cloudera’s Distribution Including Apache Hadoop v2 (CDH2), our goal is... more

Webpage View Page

October 19, 2011

Excerpt: Check out the Hadoop World 2011 conference agenda! Find sessions of interest and begin planning your Hadoop... more

Webpage View Page

October 13, 2011

Excerpt: This post was contributed by Bob Gourley, editor, CTOvision.com . The missions and data of gover... more

Webpage View Page

October 12, 2011

Excerpt: The Development track at Hadoop World is a technical deep dive dedicated to discussion about Apache Hadoop and... more

Webpage View Page

October 10, 2011

Excerpt: As a data scientist at Cloudera, I work with customers across a wide range of industries that use Hadoop to so... more

Webpage View Page

October 10, 2011

Excerpt: As a data scientist at Cloudera, I work with customers across a wide range of industries that use... more

Webpage View Page

October 6, 2011

Excerpt: This post provides a high-level overview of Apache Sqoop (incubating). It discusses the general problem addres... more

Webpage View Page

October 4, 2011

Excerpt: The Enterprise Architecture track at Hadoop World 2011 will provide insight into how Hadoop is powering tod... more

Webpage View Page

October 3, 2011

Excerpt: Owen O’Malley recently collected and analyzed information in the Apache Hadoop project commit logs and... more

Webpage View Page

October 3, 2011

Excerpt: This post was written by Daniel Jackoway following his internship at Cloudera during the summer of 2011. Wh... more

Webpage View Page

September 29, 2011

Excerpt: Business Solutions is a Hadoop World 2011 track geared towards business strategists and decision makers. Sess... more

Webpage View Page

September 28, 2011

Excerpt: This post will explore a specific use case for Apache Hadoop, one that is not commonly recognized, but is gain... more

Webpage View Page

September 28, 2011

Excerpt: This post will explore a specific use case for Apache Hadoop, one that is not commonly recognized... more

Webpage View Page

September 27, 2011

Excerpt: The Hadoop World train is approaching the station! Remember to mark November 8 th and 9 th in your calendars... more

Webpage View Page

September 20, 2011

Excerpt: BusinessWeek recently published a fascinating article on Hadoop and Big Data, interviewing several Cloudera... more

Webpage View Page

September 20, 2011

Excerpt: BusinessWeek recently published a fascinating... more

Webpage View Page

September 20, 2011

Excerpt: Unstructured data is the fastest growing type of data generated today. The growth rate of text, documents, ima... more

Webpage View Page

September 15, 2011

Excerpt: Snappy is a compression library developed at Google, and, like many technologies that come from Google, Snappy... more

Webpage View Page

September 13, 2011

Excerpt: Make the most of your week in New York City by combining the Hadoop World 2011 conference with training class... more

Webpage View Page

September 13, 2011

Excerpt: Make the most of your week in New York City by combining the Hadoop World 2011 conference with... more

Webpage View Page

September 7, 2011

Excerpt: Attendees of Hadoop World will receive a free copy of either  Hadoop, The Definitive Guide (2nd edition)... more

Webpage View Page

August 30, 2011

Excerpt: The 3rd annual Hadoop World conference takes place on November 8th and 9th in New York City. Cloudera invites... more

Webpage View Page

August 10, 2011

Excerpt: Ari Rabkin is a summer intern at Cloudera, working with the engineering team to help make Hadoop more usable a... more

Webpage View Page

CDH3 Update 1 Released

Charles Zedlewski

July 22, 2011

Excerpt: Announcing an update to CDH3.... more

Webpage View Page

July 20, 2011

Excerpt: What is Hoop? Hoop provides access to all Hadoop Distributed File System (HDFS) operations (read and write)... more

Webpage View Page

July 13, 2011

Excerpt: This post was contributed by Michael Cafarella, an assistant professor of computer science at the University o... more

Webpage View Page

July 12, 2011

Excerpt: Pero works on research and development in new technologies for online advertising at Aol Advertising R&D... more

Webpage View Page

July 11, 2011

Excerpt: Philip Zeyliger is a software engineer at Cloudera and started the SCM project. Two weeks ago, at Hadoop S... more

Webpage View Page

July 5, 2011

Excerpt: The ecosystem around Apache Hadoop has grown at a tremendous rate. Folks now can use many different pieces of... more

Webpage View Page

July 5, 2011

Excerpt: Phil Langdale is a software engineer at Cloudera and the technical lead for Cloudera’s SCM Express produc... more

Webpage View Page

July 5, 2011

Excerpt: Drew O’Brien is a product marketing manager at Cloudera We’re excited to share the news about the... more

Webpage View Page

June 28, 2011

Excerpt: This is a guest repost from Shopzilla’s Tech Blog written by Andrew Look, a Software Engineer at Shop... more

Webpage View Page

June 24, 2011

Excerpt: Ed Albanese leads business development for Cloudera. He is responsible for identifying new markets, revenue op... more

Webpage View Page

June 24, 2011

Excerpt: Bala Venkatrao is the director of product management at Cloudera . I had the pleasure of attending Enzee U... more

Webpage View Page

June 22, 2011

Excerpt: This post was contributed by Jennie Cochran-Chinn and Joe Crobak. They are part of the team building out Adco... more

Webpage View Page

June 22, 2011

Excerpt: This post was contributed by Jennie Cochran-Chinn and Joe Crobak. They are part of the team b... more

Webpage View Page

June 21, 2011

Excerpt: This post was contributed by The Global Biodiversity Information Facility development team. The Global Bio... more

Webpage View Page

June 21, 2011

Excerpt:   This post was contributed by The Global Biodiversity Information Facility deve... more

Webpage View Page

June 2, 2011

Excerpt: The first task is to ensure that your system is up-to-date. This procedure has been tested on the following... more

Webpage View Page

May 25, 2011

Excerpt: Take advantage of the opportunity to become a Cloudera Certified Developer or Administrator for Apache Hadoop... more

Webpage View Page

May 25, 2011

Excerpt: Take advantage of the opportunity to become a Cloudera Certified Developer or Administrator for A... more

Webpage View Page

May 15, 2011

Excerpt: Background Klout’s goal is to be the standard for influence. The advent of social media has created... more

Webpage View Page

May 15, 2011

Excerpt: Background Klout's goal is to be the... more

Webpage View Page

May 13, 2011

Excerpt: This is a guest repost from the DataXu blog. Click here to view the original post. I recently evaluated... more

Webpage View Page

May 11, 2011

Excerpt: Cloudera is offering several training courses for Apache Hadoop over the dates surrounding Hadoop Summit. Th... more

Webpage View Page

April 28, 2011

Excerpt: This is a guest post from Mike Segel, an attendee of Chicago Data Summit. Earlier this week, Cloudera hoste... more

Webpage View Page

April 25, 2011

Excerpt: Do you know the answer? Many prominent projects (e.g. Hive, Pig) were sub-projects of Hadoop before becomi... more

Webpage View Page

April 20, 2011

Excerpt: I recently gave a talk at the LA Hadoop User Group about HBase Do’s and Don’ts . The audience was... more

Webpage View Page

April 20, 2011

Excerpt: I recently gave a talk at the LA Hadoop U... more

Webpage View Page

CDH3 goes GA

Mike Olson

April 12, 2011

Excerpt: I am very pleased to announce the general availability of Cloudera’s Distribution including Apache Hadoo... more

Webpage View Page

April 11, 2011

Excerpt: Simple Moving Average, Secondary Sort, and MapReduce (Part 3) by Josh Patterson... more

Webpage View Page

April 5, 2011

Excerpt: Adopting Apache Hadoop in the Federal Government by Jon Zuanich April 05... more

Webpage View Page

MapIncrease

ibmwatson

April 1, 2011

Excerpt: Puny humans. SSL and Wordpress authorization will keep me out of your blog question mark. I do not think so.... more

Webpage View Page

March 30, 2011

Excerpt: London Apache Hadoop User Group Meeting Summarized by Jon Zuanich March... more

Webpage View Page

March 29, 2011

Excerpt: If you find yourself in the Chicago area later this month, please join us at the Chicago Data Summit on Apri... more

Webpage View Page

We messed up.

Mike Olson

March 25, 2011

Excerpt: We messed up. by Mike Olson March 25, 2011 no comments... more

Webpage View Page

March 23, 2011

Excerpt: Rapleaf Uses Hadoop to Efficiently Scale with Terabytes of Data by Jon Zuanich... more

Webpage View Page

March 16, 2011

Excerpt: Simple Moving Average, Secondary Sort, and MapReduce (Part 2) by Josh Patterson... more

Webpage View Page

March 14, 2011

Excerpt: Simple Moving Average, Secondary Sort, and MapReduce (Part 1) by Josh Patterson... more

Webpage View Page

March 7, 2011

Excerpt: This is the third and final post in a series detailing a recent improvement in Apache HBase that helps to redu... more

Webpage View Page

March 7, 2011

Excerpt: This is the third and final post in a series detailing a recent improvement in Apache HBase that... more

Webpage View Page

March 1, 2011

Excerpt: Flume Community Office Hours @ Cloudera HQ, 2/28/2011 by Jonathan Hsieh... more

Webpage View Page

February 28, 2011

Excerpt: This is the second post in a series detailing a recent improvement in Apache HBase that helps to reduce the fr... more

Webpage View Page

February 25, 2011

Excerpt: Supported Operating Systems in CDH3 by Eli Collins February 25, 2011... more

Webpage View Page

February 25, 2011

Excerpt: While Cloudera's Distribution including Apache Hadoop (CDH) operating system support is... more

Webpage View Page

February 25, 2011

Excerpt: Gratuitous Hadoop: Stress Testing on the Cheap with Hadoop Streaming and EC2 by Jo... more

Webpage View Page

February 24, 2011

Excerpt: Today, rather than discussing new projects or use cases built on top of CDH, I'd like to switch gears a bit an... more

Webpage View Page

February 24, 2011

Excerpt: Today, rather than discussing new projects or use cases built on top of CDH, I'd like to switch g... more

Webpage View Page

February 22, 2011

Excerpt: CDH3 Beta 4 Now Available by Todd Lipcon February 22, 2011 1 c... more

Webpage View Page

February 17, 2011

Excerpt: Log Event Processing with HBase by Jon Zuanich February 17, 2011... more

Webpage View Page

February 17, 2011

Excerpt: This post was authored by Dmitry Chechik, a software engineer at TellApart, the leading Custo... more

Webpage View Page

February 16, 2011

Excerpt: An emerging data management architectural pattern behind interactive web applications... more

Webpage View Page

February 16, 2011

Excerpt: The user-data connection is driving NoSQL database-Hadoop pairing... more

Webpage View Page

February 15, 2011

Excerpt: Strategies for Exploiting Large-scale Data in the Federal Government by Jon Zuanic... more

Webpage View Page

February 14, 2011

Excerpt: Cloudera in The Cube with Silicon Angle TV at Strata Conference 2011 by Jon Zuanic... more

Webpage View Page

February 11, 2011

Excerpt: Wordnik Bypasses Processing Bottleneck with Hadoop by Jon Zuanich Februa... more

Webpage View Page

February 11, 2011

Excerpt: This post is courtesy of Kumanan Rajamanikkam, Lead Engineer at... more

Webpage View Page

February 10, 2011

Excerpt: Hadoop Availability by Eli Collins February 10, 2011 1 comment... more

Webpage View Page

February 10, 2011

Excerpt: A common question on the Apache Hadoop mail... more

Webpage View Page

February 7, 2011

Excerpt: Distributed Flume Setup With an S3 Sink by Jonathan Hsieh February 07, 2... more

Webpage View Page

February 3, 2011

Excerpt: Make your Hadoop voice heard! by Jon Zuanich February 03, 2011... more

Webpage View Page

February 3, 2011

Excerpt: Apache Hadoop is increasingly being adopted for storage and processing of large-scale complex dat... more

Webpage View Page

February 2, 2011

Excerpt: Upcoming Apache Hadoop Training Sessions by Jon Zuanich February 02, 201... more

Webpage View Page

February 2, 2011

Excerpt: Some News Related to the Apache Hadoop Project by Charles Zedlewski Febr... more

Webpage View Page

January 28, 2011

Excerpt: CDH2 Update 3 Now Available by Eli Collins January 28, 2011 1... more

Webpage View Page

January 26, 2011

Excerpt: Lessons Learned from Cloudera’s Hadoop Developer Training Course by Jon Zuan... more

Webpage View Page

January 21, 2011

Excerpt: Introducing Alfredo, Kerberos HTTP SPNEGO for Java by Alejandro Abdelnur... more

Webpage View Page

January 21, 2011

Excerpt: What is Kerberos & SPNEGO?... more

Webpage View Page

January 19, 2011

Excerpt: We blogged about 104 different topics in 2010 and we recently decided to take a look back and see what folks w... more

Webpage View Page

January 17, 2011

Excerpt: Hadoop I/O: Sequence, Map, Set, Array, BloomMap Files by Jon Zuanich Jan... more

Webpage View Page

January 11, 2011

Excerpt: How to Include Third-Party Libraries in Your Map-Reduce Job by Alex Kozlov... more

Webpage View Page

January 11, 2011

Excerpt: "My library is in the classpath but I still get a Class Not Found exception in a MapReduce job" -... more

Webpage View Page

January 10, 2011

Excerpt: Setting up CDH3 Hadoop on my new Macbook Pro by Jon Zuanich January 10,... more

Webpage View Page

January 7, 2011

Excerpt: Post written by Cloudera Software Engineer Aaron T. Myers. Apache Hadoop has had methods of doing user aut... more

Webpage View Page

Configuring Security Features in CDH3

Jon Zuanich (@jonzuanich)

January 7, 2011

Excerpt: Post written by Cloudera Software Engineer Aaron T. Myers. Apac... more

Webpage View Page

January 6, 2011

Excerpt: 2010 Cloudera Apache Hadoop Webinars by Jon Zuanich January 06, 2011... more

Webpage View Page

January 5, 2011

Excerpt: Map-Reduce With Ruby Using Apache Hadoop by Jon Zuanich January 05, 2011... more

Webpage View Page

December 21, 2010

Excerpt: New Features in Apache Pig 0.8 by John Kreisa December 21, 2010... more

Webpage View Page

December 15, 2010

Excerpt: A profile of Apache Hadoop MapReduce computing efficiency (continued) by Jon Zuani... more

Webpage View Page

December 14, 2010

Excerpt: A profile of Apache Hadoop MapReduce computing efficiency by Jon Zuanich... more

Webpage View Page

December 7, 2010

Excerpt: Cloudera and Pentaho team up to simplify data management and business intelligence... more

Webpage View Page

December 6, 2010

Excerpt: Lessons learned putting Hadoop into production by Jon Zuanich December 0... more

Webpage View Page

December 2, 2010

Excerpt: Hadoop World 2010 Tweet Analysis by Jon Zuanich December 02, 2010... more

Webpage View Page

November 29, 2010

Excerpt: Hadoop Log Location and Retention by Lars George November 29, 2010... more

Webpage View Page

November 24, 2010

Excerpt: Hadoop training coming to new cities in 2011 by Jon Zuanich November 24,... more

Webpage View Page

November 24, 2010

Excerpt: Due to expanding interest and demand for Apache Hadoop knowledge and skills across the mid-west a... more

Webpage View Page

November 18, 2010

Excerpt: Do the Schimmy: Efficient Large-Scale Graph Analysis with Hadoop, Part 2 by Jon Zu... more

Webpage View Page

November 18, 2010

Excerpt: Continued Guest Post from Michael Schatz and... more

Webpage View Page

November 17, 2010

Excerpt: Hadoop and HBase at RIPE NCC by Todd Lipcon November 17, 2010... more

Webpage View Page

November 15, 2010

Excerpt: Do the Schimmy: Efficient Large-Scale Graph Analysis with Hadoop by Jon Zuanich... more

Webpage View Page

November 8, 2010

Excerpt: Integrating Hadoop in your Existing DW and BI Environment by Gretchen Malay... more

Webpage View Page

November 8, 2010

Excerpt: Organizations are looking for a cost-effective way to deal with data that are now arriving in an... more

Webpage View Page

November 4, 2010

Excerpt: Better Workflow Management in CDH with Oozie 2 by Alejandro Abdelnur Nov... more

Webpage View Page

November 2, 2010

Excerpt: Tackling Large Scale Data in Government by Jon Zuanich November 02, 2010... more

Webpage View Page

November 1, 2010

Excerpt: Cloudera Fun & Frightful Halloween Festivities by Jon Zuanich Novem... more

Webpage View Page

October 26, 2010

Excerpt: Hadoop Lab at JavaOne by Jon Zuanich October 26, 2010 no comme... more

Webpage View Page

October 26, 2010

Excerpt: Guest post by Daniel Templeton, Product Manager at Oracl... more

Webpage View Page

October 16, 2010

Excerpt: Hadoop World 2010: An Unqualified Success by Jon Zuanich October 16, 201... more

Webpage View Page

October 12, 2010

Excerpt: CDH3 beta 3 now available by Todd Lipcon October 12, 2010 no c... more

Webpage View Page

October 11, 2010

Excerpt: Hadoop: The Definitive Guide, Second Edition by Tom White October 11, 20... more

Webpage View Page

October 8, 2010

Excerpt: Afternoon Hadoop World — Possible Path Through Great Content by Jon Zuanich... more

Webpage View Page

October 6, 2010

Excerpt: One Possible Hadoop World Morning Path by Jon Zuanich October 06, 2010... more

Webpage View Page

September 30, 2010

Excerpt: Hadoop World: More is better! by Gretchen Malay September 30, 2010... more

Webpage View Page

September 27, 2010

Excerpt: Top 10 Reasons to Attend Hadoop World by Jon Zuanich September 27, 2010... more

Webpage View Page

September 23, 2010

Excerpt: Twitter Analytics Lead, Kevin Weil, and a Presenter at Hadoop World Interviewed by... more

Webpage View Page

September 22, 2010

Excerpt: More on Cloudera Enterprise by Charles Zedlewski September 22, 2010... more

Webpage View Page

September 21, 2010

Excerpt: What’s Going On Surrounding Hadoop World by Jon Zuanich September 2... more

Webpage View Page

September 20, 2010

Excerpt: What is in our Kitchen? by Chad Metcalf September 20, 2010 no... more

Webpage View Page

September 17, 2010

Excerpt: Flume is a flexible, scalable, and reliable system for collecting streaming data.   The  Flume User... more

Webpage View Page

September 16, 2010

Excerpt: HUE SDK Training – NYC by Jon Zuanich September 16, 2010... more

Webpage View Page

September 14, 2010

Excerpt: CDH2 Update 2 Now Available by Eli Collins September 14, 2010... more

Webpage View Page

September 14, 2010

Excerpt: Hadoop World Presentation Track Release by Jon Zuanich September 14, 201... more

Webpage View Page

September 10, 2010

Excerpt: A Summer Internship with Cloudera by Jon Zuanich September 10, 2010... more

Webpage View Page

September 9, 2010

Excerpt: New York Training Session for Managers Interested In Hadoop by Jon Zuanich... more

Webpage View Page

September 8, 2010

Excerpt: Flume community update: September 2010 by jon September 08, 2010... more

Webpage View Page

September 7, 2010

Excerpt: Purdue University’s Saptarshi Guha Interviewed Regarding Hadoop, R and Hadoop World... more

Webpage View Page

September 6, 2010

Excerpt: A Look Back at August Posts by Jon Zuanich September 06, 2010... more

Webpage View Page

September 3, 2010

Excerpt: Tracing with Avro by Jon Zuanich September 03, 2010 no comment... more

Webpage View Page

September 3, 2010

Excerpt: Written by Patrick Wendell, an amazing summer intern with Cloudera and an Avro Commit... more

Webpage View Page

September 2, 2010

Excerpt: Infochimp’s President, Philip Kromer, Interviewed Regarding Hadoop and Hadoop World... more

Webpage View Page

September 1, 2010

Excerpt: Register for Hadoop Training in New York and Get into Hadoop World for Free! by Jo... more

Webpage View Page

August 30, 2010

Excerpt: Hadoop World 2010: Speaker Highlights by Jon Zuanich August 30, 2010... more

Webpage View Page

August 26, 2010

Excerpt: What’s New in Apache Hadoop 0.21 by Tom White August 26, 2010... more

Webpage View Page

August 24, 2010

Excerpt: Learn about fraud and how to prevent it with Hadoop... more

Webpage View Page

August 24, 2010

Excerpt: Fraud has multiple meanings and the term can be easily abused.  The definition of fraud has unde... more

Webpage View Page

August 24, 2010

Excerpt: Hadoop Administrator Training Comes to London by Jon Zuanich August 24,... more

Webpage View Page

August 24, 2010

Excerpt: Cloudera’s... more

Webpage View Page

August 23, 2010

Excerpt: Improving Hotel Search: Hadoop @ Orbitz Worldwide by John Kreisa August... more

Webpage View Page

August 23, 2010

Excerpt: This post was contributed by Jonathan Seidman from... more

Webpage View Page

August 19, 2010

Excerpt: Hadoop Training surrounding Hadoop World: NYC.... more

Webpage View Page

August 17, 2010

Excerpt: Hadoop/HBase Capacity Planning by Alex Kozlov August 17, 2010... more

Webpage View Page

August 17, 2010

Excerpt: Apache Hadoop and Apache HBase are gaining popularity due to their flexibility and tremendous wor... more

Webpage View Page

August 12, 2010

Excerpt: It’s easy to get started with Hadoop administration because Linux system administration is a pretty well... more

Webpage View Page

CDH3b2 Release Recap

Jeff Hammerbacher

August 11, 2010

Excerpt: CDH3b2 Release Recap by Jeff Hammerbacher August 11, 2010 no comments... more

Webpage View Page

August 10, 2010

Excerpt: Cloudera’s Henry Robinson to speak at Hadoop Day in Seattle by Huw Edwards... more

Webpage View Page

August 9, 2010

Excerpt: Hadoop World: early-bird rate ends on August 11 by Huw Edwards August 09... more

Webpage View Page

August 3, 2010

Excerpt: Flume community update – the first 30 days! by phunt August 03, 2010 no c... more

Webpage View Page

Migrating to CDH

Eric Sammer

August 2, 2010

Excerpt: With the recent release of CDH3b2 , many users are more interested than ever to try out Cloudera’s Dist... more

Webpage View Page

July 28, 2010

Excerpt: How to Get a Job at Cloudera by Mike Olson July 28, 2010 no comments... more

Webpage View Page

July 28, 2010

Excerpt: Notes From the Hackathon at Cloudera by Jeff Bean July 28, 2010 no comments... more

Webpage View Page

July 28, 2010

Excerpt: I was positively blown away by the enthusiasm, creativity, and productivity exhibited by the part... more

Webpage View Page

July 28, 2010

Excerpt: Upcoming webinar: 10 Common Hadoop-able Problems by Huw Edwards July 28, 2010 n... more

Webpage View Page

July 28, 2010

Excerpt: Announcing Two New Training Classes from Cloudera: Introduction to HBase and Analyzing Data with Hive and Pig... more

Webpage View Page

July 22, 2010

Excerpt: What’s New in CDH3b2: Hive by Carl Steinbach July 22, 2010 no comments... more

Webpage View Page

July 22, 2010

Excerpt: CDH3 beta 2 includes Apache Hive 0.5.0, the latest v... more

Webpage View Page

July 20, 2010

Excerpt: Developing Applications for HUE by Aaron Newton July 20, 2010 1 comment... more

Webpage View Page

July 20, 2010

Excerpt: Yesterday's post gave an... more

Webpage View Page

July 19, 2010

Excerpt: What’s New in CDH3b2: HUE by bc July 19, 2010 no comments... more

Webpage View Page

July 19, 2010

Excerpt: The HUE (aka. Hadoop User Experience) project [... more

Webpage View Page

July 19, 2010

Excerpt: Rackspace’s OpenStack shows the way for public cloud vendors by Ed Albanese July 1... more

Webpage View Page

July 16, 2010

Excerpt: What’s New in CDH3b2: Sqoop by Aaron Kimball July 16, 2010 no comments... more

Webpage View Page

July 15, 2010

Excerpt: Hacking with Cloudera on CDH by Alex Loddengaard July 15, 2010 no comments... more

Webpage View Page

July 15, 2010

Excerpt: What’s New in CDH3b2: Oozie by Arvind Prabhakar July 15, 2010 no comments... more

Webpage View Page

July 14, 2010

Excerpt: What’s New in CDH3b2: Pig by Carl Steinbach July 14, 2010 no comments... more

Webpage View Page

July 14, 2010

Excerpt: CDH3 beta 2 includes Apache Pig 0.7.0, the latest and... more

Webpage View Page

July 13, 2010

Excerpt: As part of our series of announcements at the recent Hadoop Summit, Cloudera released two of its previously in... more

Webpage View Page

July 12, 2010

Excerpt: CDH3 beta 2 is the first to incorporate Apache ZooKeeper. ZooKeeper is a highly reliable and available coordin... more

Webpage View Page

July 9, 2010

Excerpt: What’s New in CDH3b2: HBase by Todd Lipcon July 09, 2010 no comments... more

Webpage View Page

July 9, 2010

Excerpt: Over the last two years, Cloudera has helped a great number of customers... more

Webpage View Page

July 8, 2010

Excerpt: What’s New in CDH3b2: Core Hadoop by Eli Collins July 08, 2010 no comment... more

Webpage View Page

July 7, 2010

Excerpt: More on Cloudera’s Distribution including Apache Hadoop 3 by Charles Zedlews... more

Webpage View Page

June 29, 2010

Excerpt: CDH3 and Cloudera Enterprise by Mike Olson June 29, 2010 1 com... more

Webpage View Page

June 23, 2010

Excerpt: Are your systems struggling to absorb ever-increasing amounts of data being generated daily? Are you mired in... more

Webpage View Page

June 22, 2010

Excerpt: Cloudera is once again hosting  Hadoop World which will take place in  New York City on  Octo... more

Webpage View Page

June 18, 2010

Excerpt: Will Cloudera be at OSCON this year? Of course, it’s only the premier event for OS technologies on the ma... more

Webpage View Page

June 11, 2010

Excerpt: Integrating Hive and HBase by carl June 11, 2010 no comments... more

Webpage View Page

June 11, 2010

Excerpt: This post was contributed by John Sichi... more

Webpage View Page

June 10, 2010

Excerpt: One word more… by Mike Olson June 10, 2010 no comments... more

Webpage View Page

A transition

Christophe Bisciglia

June 10, 2010

Excerpt: A transition by Christophe Bisciglia June 10, 2010 no comments... more

Webpage View Page

A transition

Christophe Bisciglia

June 10, 2010

Excerpt: For an entrepreneur, it's an incredibly fulfilling experience to start companies and watch them "... more

Webpage View Page

June 4, 2010

Excerpt: A report from the recent UK HUG from Klass Bosteels.... more

Webpage View Page

June 3, 2010

Excerpt: Considerations for Hadoop and BI (part 2 of 2) by Jeff Bean June 03, 2010 no co... more

Webpage View Page

June 3, 2010

Excerpt: Just today we heard another question about integrating Apache Hadoop with Business Intelligence t... more

Webpage View Page

June 1, 2010

Excerpt: The second Apache Hadoop HDFS and MapReduce contributors meeting was held last Friday, May 28 at ClouderaR... more

Webpage View Page

May 25, 2010

Excerpt: Here at Cloudera we have deep knowledge and experience working with Hadoop and related technologies to so... more

Webpage View Page

May 21, 2010

Excerpt: Considerations for Hadoop and BI (part 1 of 2) by Jeff Bean May 21, 2010 no com... more

Webpage View Page

May 21, 2010

Excerpt: We recently met with a customer at... more

Webpage View Page

May 21, 2010

Excerpt: CDH2 Update 1 Now Available by Eli Collins May 21, 2010 no comments... more

Webpage View Page

May 10, 2010

Excerpt: What to Do with Extra Space? by bc May 10, 2010 no comments... more

Webpage View Page

May 7, 2010

Excerpt: Highlights from the First Hadoop Contributors Meeting by Eli Collins May 07, 2010... more

Webpage View Page

May 7, 2010

Excerpt: While the vast majority of the Hadoop development discussion takes place on... more

Webpage View Page

April 30, 2010

Excerpt: Around the globe, more and more companies are turning to Hadoop to tackle data processing problems that don... more

Webpage View Page

April 26, 2010

Excerpt: CAP Confusion: Problems with ‘partition tolerance’ by Henry Robinson April... more

Webpage View Page

April 21, 2010

Excerpt: Get Hadoop Training from Cloudera at the Hadoop Summit by John Kreisa April 21, 2010... more

Webpage View Page

April 13, 2010

Excerpt: Cloudera Hadoop Training Spreads Worldwide by John Kreisa April 13, 2010 no com... more

Webpage View Page

April 12, 2010

Excerpt: Cloudera Has Moved! by John Kreisa April 12, 2010 1 comment... more

Webpage View Page

April 5, 2010

Excerpt: Scaling Social Science with Hadoop by Ed Albanese April 05, 2010 12 comments... more

Webpage View Page

April 5, 2010

Excerpt: This post was contributed by researcher Scott Golder, who... more

Webpage View Page

April 1, 2010

Excerpt: Pushing the Limits of Distributed Processing by omer April 01, 2010 no comments... more

Webpage View Page

March 30, 2010

Excerpt: Cloudera’s Support Team Shares Some Basic Hardware Recommendations by Alex Loddengaard... more

Webpage View Page

March 24, 2010

Excerpt: It’s official – Cloudera’s Distribution for Hadoop Version 2, which we often shorthand as C... more

Webpage View Page

March 24, 2010

Excerpt: It's official - Cloudera's Distribution for Hadoop Version 2, which we ofte... more

Webpage View Page

CDH2 is released

Chad Metcalf

March 24, 2010

Excerpt: We’re proud to announce that Cloudera’s Distribution for Hadoop Version 2 (CDH2) is officially re... more

Webpage View Page

March 22, 2010

Excerpt: How Raytheon BBN Technologies Researchers are Using Hadoop to Build a Scalable, Distributed Triple Store... more

Webpage View Page

March 18, 2010

Excerpt: HBase User Group #9: HBase and HDFS by Todd Lipcon March 18, 2010 no comments... more

Webpage View Page

March 16, 2010

Excerpt: Natural Language Processing with Hadoop and Python by Ed Albanese March 16, 2010... more

Webpage View Page

March 16, 2010

Excerpt: This blog was co-written by Nitin Madnani... more

Webpage View Page

March 10, 2010

Excerpt: Richard Hutton , CTO of nugg.ad , authored the following post about how and why his company uses Hadoop. n... more

Webpage View Page

March 10, 2010

Excerpt: Richard Hutton, CTO of... more

Webpage View Page

March 3, 2010

Excerpt: Trip Report: Utah Java User’s Group by Philip Zeyliger March 03, 2010 no... more

Webpage View Page

Avro 1.3.0

Matt Massie

March 1, 2010

Excerpt: Avro 1.3.0 by Matt Massie March 01, 2010 no comments Avro... more

Webpage View Page

March 1, 2010

Excerpt: Apache Avro was added the to... more

Webpage View Page

February 22, 2010

Excerpt: Cloudera’s Hadoop Training Programs Expand Internationally by Christophe Bisciglia... more

Webpage View Page

February 22, 2010

Excerpt: It's been over a year now since we started offering Hadoop training in the Bay Area, and since th... more

Webpage View Page

February 18, 2010

Excerpt: CDH2: “Testing” Heading Towards “Stable” by Chad Metcalf Februa... more

Webpage View Page

January 19, 2010

Excerpt: Cloudera speaks VMware vCloud API, too. by Mike Olson January 19, 2010 no comme... more

Webpage View Page

January 11, 2010

Excerpt: Hadoop World: Building Data Intensive Apps with Hadoop and EC2 by ed January 11, 2010... more

Webpage View Page

December 23, 2009

Excerpt: Hadoop World: Making Hadoop Easy on Amazon Web Services by Christophe Bisciglia Decembe... more

Webpage View Page

December 22, 2009

Excerpt: Hadoop World: Hadoop Applications at Yahoo! by Christophe Bisciglia December 22, 2009... more

Webpage View Page

December 17, 2009

Excerpt: 7 Tips for Improving MapReduce Performance by Todd Lipcon December 17, 2009 no... more

Webpage View Page

December 15, 2009

Excerpt: Observers: Making ZooKeeper Scale Even Further by Henry Robinson December 15, 2009... more

Webpage View Page

December 10, 2009

Excerpt: Hadoop World: Sqoop – Database Import for Hadoop by Christophe Bisciglia December... more

Webpage View Page

December 8, 2009

Excerpt: Hadoop World: Security and API Compatibility by Christophe Bisciglia December 08, 2009... more

Webpage View Page

December 8, 2009

Excerpt: Today's Hadoop World talk comes from Owen O'Malley and talks about some of the biggest challenges fa... more

Webpage View Page

December 2, 2009

Excerpt: Hadoop World: Hadoop for Bioinformatics by Christophe Bisciglia December 02, 2009... more

Webpage View Page

November 25, 2009

Excerpt: Hadoop World: Practical HBase from Jonathan Gray and Ryan Rawson by Alex Loddengaard No... more

Webpage View Page

November 23, 2009

Excerpt: Hadoop World: Hadoop + Vertica from Omer Trajman by Alex Loddengaard November 23, 2009... more

Webpage View Page

November 20, 2009

Excerpt: Hadoop World: Hadoop + Clojure from Stuart Sierra and Tim Dysinger by Alex Loddengaard... more

Webpage View Page

November 19, 2009

Excerpt: Hadoop World: Protein Alignment from Paul Brown by Alex Loddengaard November 19, 2009... more

Webpage View Page

November 17, 2009

Excerpt: Hadoop at Twitter (part 1): Splittable LZO Compression by Matt Massie November 17, 2009... more

Webpage View Page

November 11, 2009

Excerpt: Hadoop World: Rethinking the Data Warehouse with Hadoop and Hive from Ashish Thusoo by Christop... more

Webpage View Page

November 9, 2009

Excerpt: Today’s Hadoop World video comes from Ed Capriolo, and goes into details about how to effectively monito... more

Webpage View Page

November 2, 2009

Excerpt: Avro is a recent addition to Apache's Hadoop family of projects. Avro defines a data format designed to supp... more

Webpage View Page

November 2, 2009

Excerpt: Apache Avro is a recent addition to Apache's... more

Webpage View Page

October 29, 2009

Excerpt: Hadoop World: NYC – Let the Videos Roll by Christophe Bisciglia October 29, 2009... more

Webpage View Page

October 21, 2009

Excerpt: Around the world, individuals contribute to Hadoop and build community around the technology. This kind of col... more

Webpage View Page

October 19, 2009

Excerpt: Cloudera Desktop and MooTools by Aaron Newton October 19, 2009 7 comments... more

Webpage View Page

October 15, 2009

Excerpt: Analyzing Human Genomes with Hadoop by Christophe Bisciglia October 15, 2009 4... more

Webpage View Page

October 15, 2009

Excerpt: Every day, we hear about people doing amazing things with Apache Hadoop. The va... more

Webpage View Page

October 1, 2009

Excerpt: Today at Hadoop World NYC , we’re announcing the availability of Cloudera Desktop ,  a unified an... more

Webpage View Page

September 30, 2009

Excerpt: At the beginning of September, we announced the first release of CDH2 , our current testing repository. Pac... more

Webpage View Page

September 29, 2009

Excerpt: One of the more common requests we receive from the community is to package HBase with Cloudera’s Distri... more

Webpage View Page

September 29, 2009

Excerpt: One of the more common requests we receive from the community is to package Apa... more

Webpage View Page

September 28, 2009

Excerpt: Grouping Related Trends with Hadoop and Hive by Amr Awadallah September 28, 2009... more

Webpage View Page

September 15, 2009

Excerpt: Apache Hadoop Log Files: Where to find them in CDH, and what info they contain by Alex Loddenga... more

Webpage View Page

September 10, 2009

Excerpt: In March of this year, we released our distribution for Hadoop.  Our initial focus was on stability and m... more

Webpage View Page

September 10, 2009

Excerpt: In March of this year, we released our distribution for Apache Hadoop.  Our initial focus was on... more

Webpage View Page

September 9, 2009

Excerpt: It’s been a crazy few weeks here at Cloudera, and while there is no sign of things letting up before Ha... more

Webpage View Page

Hadoop World: NYC 2009

Christophe Bisciglia

August 19, 2009

Excerpt: To say we were surprised by the quality and quantity of submissions we received for Hadoop World: NYC 2009... more

Webpage View Page

August 14, 2009

Excerpt: Hadoop Default Ports Quick Reference by Philip Zeyliger August 14, 2009... more

Webpage View Page

August 10, 2009

Excerpt: Back in October, I promised to keep marketing and sales out of this blog. We wanted to concentrate on techni... more

Webpage View Page

July 31, 2009

Excerpt: Tracking Trends with Hadoop and Hive on EC2 by Amr Awadallah July 31, 2009 8 co... more

Webpage View Page

July 29, 2009

Excerpt: As Hadoop adoption increases among organizations, companies, and individuals, and as it makes its way into pro... more

Webpage View Page

July 27, 2009

Excerpt: Cloudera’s Training VM is one of the most popular resources on our website. It was created with VMware W... more

Webpage View Page

July 27, 2009

Excerpt: Update (May 1 2013): The post below, which is based on an outdated VM, is deprecated. Rat... more

Webpage View Page

Hadoop HA Configuration

Christophe Bisciglia

July 22, 2009

Excerpt: One of the things we get a lot of questions about is how to make Hadoop highly available. There is still a lot... more

Webpage View Page

July 22, 2009

Excerpt: Disclaimer: Cloudera no longer approves of the recommendations in this post. Ple... more

Webpage View Page

The Project Split

Aaron Kimball

July 17, 2009

Excerpt: Last Wednesday, we hosted a Hadoop meetup, and I gave a short talk about the new project split. How does the s... more

Webpage View Page

July 17, 2009

Excerpt: There is some confusion about the state of the file append operation in HDFS. It was in, now it’s out. W... more

Webpage View Page

Hadoop Graphing with Cacti

Christophe Bisciglia

July 7, 2009

Excerpt: An important part of making sure Hadoop works well for all users is developing and maintaining strong relation... more

Webpage View Page

Hadoop Graphing with Cacti

Christophe Bisciglia

July 7, 2009

Excerpt: An important part of making sure Apache Hadoop works well for all users is deve... more

Webpage View Page

July 3, 2009

Excerpt: The distributed nature of MapReduce programs makes debugging a challenge. Attaching a debugger to a remote pro... more

Webpage View Page

June 30, 2009

Excerpt: Hadoop moves fast. Users often find that they need to upgrade after just a few months. Upgrading can be a daun... more

Webpage View Page

June 30, 2009

Excerpt: Apache Hadoop moves fast. Users often find that they need to upgrade after ju... more

Webpage View Page

June 24, 2009

Excerpt: Yesterday, Chris Goffinet from Digg made a great blog post about LZO and Hadoop. Many users have been frustr... more

Webpage View Page

June 24, 2009

Excerpt: Yesterday, Chris Goffinet from Digg made a great... more

Webpage View Page

June 22, 2009

Excerpt: On June 10th, more than 750 people from around the world descended on the Santa Clara Marriott to share their... more

Webpage View Page

June 22, 2009

Excerpt: On June 10th, more than 750 people from around the world descended on the Santa Clara Marriott to... more

Webpage View Page

June 17, 2009

Excerpt: Analyzing Apache logs with Pig by Amr Awadallah June 17, 2009 5 comments... more

Webpage View Page

June 17, 2009

Excerpt: (guest blog post by Dmitriy Rya... more

Webpage View Page

June 2, 2009

Excerpt: For the last few months, we’ve been working with the TVA to help them manage hundreds of TB of data from... more

Webpage View Page

Introducing Sqoop

Aaron Kimball

June 1, 2009

Excerpt: In addition to providing you with a dependable release of Hadoop that is easy to configure , at Cloudera we... more

Webpage View Page

May 29, 2009

Excerpt: A few months ago we announced the Cloudera Distribution for Hadoop .  We’re happy to report that l... more

Webpage View Page

May 28, 2009

Excerpt: In my first few weeks here at Cloudera , I’ve been tasked with helping out with the Apache ZooKeeper... more

Webpage View Page

May 28, 2009

Excerpt: As Hadoop continues to turn heads at startups and big enterprises alike, Cloudera has received several request... more

Webpage View Page

May 28, 2009

Excerpt: As Apache Hadoop continues to turn heads at startups and big enterprises alike, Cloudera has rece... more

Webpage View Page

May 27, 2009

Excerpt: Lately, we’ve been spending a lot of time on the East Coast, and one thing is clear: Hadoop is everywher... more

Webpage View Page

May 22, 2009

Excerpt: Administrators of HDFS clusters understand that the HDFS metadata is some of the most precious bits they have.... more

Webpage View Page

May 22, 2009

Excerpt: Administrators of HDFS clusters understand that the HDFS metadata is some of the most precious... more

Webpage View Page

May 18, 2009

Excerpt: This piece is based on the talk “Practical MapReduce” that I gave at Hadoop User Group UK on April... more

Webpage View Page

May 14, 2009

Excerpt: 5 Common Questions About Hadoop by Christophe Bisciglia May 14, 2009 11 comment... more

Webpage View Page

May 14, 2009

Excerpt: There’s been a lot of buzz about Apache Hadoop lately. Just the other day, some of our friends... more

Webpage View Page

May 11, 2009

Excerpt: A while back, we noticed a blog post From Arun Jacob over at Evri (if you haven’t seen Evri before,... more

Webpage View Page

May 7, 2009

Excerpt: What’s New in Hadoop Core 0.20 by Tom White May 07, 2009... more

Webpage View Page

May 1, 2009

Excerpt: We asked Brian Bockelman, a Post Doc Research Associate in the Computer Science & Engineering Depar... more

Webpage View Page

April 27, 2009

Excerpt: When we announced Cloudera’s Distribution for Hadoop last month, we asked the community to give us fe... more

Webpage View Page

April 27, 2009

Excerpt: When we announced Cloudera's Distribution for Apache Had... more

Webpage View Page

April 23, 2009

Excerpt: Today I did a web search for “pig training” using my favorite search engine. I was wildly entertai... more

Webpage View Page

April 23, 2009

Excerpt: Today I did a web search for "pig training" using my favorite search engine. I was wildly enterta... more

Webpage View Page

April 22, 2009

Excerpt: Welcome to the first guest post on the Cloudera blog. The other day, we saw Toby from  Swingly tweet... more

Webpage View Page

April 22, 2009

Excerpt: Welcome to the first guest post on the Cloudera blog. The other day, we saw Toby from ... more

Webpage View Page

April 21, 2009

Excerpt: Last Tuesday – on my second day of work at Cloudera – I went to London to check out the second UK... more

Webpage View Page

April 20, 2009

Excerpt: One of the perks of using Java is the availability of functional, cross-platform IDEs.  I use vim for m... more

Webpage View Page

April 20, 2009

Excerpt: Update (added 5/15/2013): The information below is a bit dated; see... more

Webpage View Page

April 15, 2009

Excerpt: In the process of working on a few things here I wanted to add some links to launch Hive and the Hadoop Jobt... more

Webpage View Page

April 15, 2009

Excerpt: In the process of working on a few things here I wanted to add some links to launch... more

Webpage View Page

April 9, 2009

Excerpt: A few weeks ago we announced Cloudera’s Distribution for Hadoop , and I want to spend some time showing... more

Webpage View Page

April 9, 2009

Excerpt: A few weeks ago we announced... more

Webpage View Page

April 3, 2009

Excerpt: Upcoming Functionality in “Fair Scheduler 2.0″ by Amr Awadallah April 03, 2... more

Webpage View Page

March 30, 2009

Excerpt: Configuring a Hadoop cluster is something akin to voodoo. There are a large number of variables in hadoop-def... more

Webpage View Page

March 15, 2009

Excerpt: One of the repeating themes we have heard while working with our customers and the community is that Hadoop co... more

Webpage View Page

March 15, 2009

Excerpt: One of the repeating themes we have heard while working with our customers and the community is t... more

Webpage View Page

March 13, 2009

Excerpt: Exciting news: We’re providing our basic hadoop training for free online . We’ll still... more

Webpage View Page

Hadoop Metrics

Philip Zeyliger

March 12, 2009

Excerpt: Hadoop’s NameNode, SecondaryNameNode, DataNode, JobTracker, and TaskTracker daemons all expose runtime m... more

Webpage View Page

March 6, 2009

Excerpt: Hadoop’s strength is that it enables ad-hoc analysis of unstructured or semi-structured data. Relational... more

Webpage View Page

March 6, 2009

Excerpt: Apache Hadoop's strength is that it enables ad-hoc analysis of unstructured or semi-structured da... more

Webpage View Page

February 10, 2009

Excerpt: You might think that the SecondaryNameNode is a hot backup daemon for the NameNode. You’d be wrong. The... more

Webpage View Page

February 2, 2009

Excerpt: Small files are a big problem in Hadoop — or, at least, they are if the number of questions on the user... more

Webpage View Page

January 14, 2009

Excerpt: HDFS Reliability by Tom White January 14, 2009 4 comments... more

Webpage View Page

January 5, 2009

Excerpt: It’s a new year, the time when we take a moment to look back at the previous one, and forward to what mi... more

Webpage View Page

December 31, 2008

Excerpt: The first release (0.19.0) from the 0.19 branch of Hadoop Core was made on November 24. Many changes go into... more

Webpage View Page

December 31, 2008

Excerpt: The first release (0.19.0) from the 0.19 branch of Apache ... more

Webpage View Page

December 16, 2008

Excerpt: As a developer coming to Hadoop it is important to understand how testing is organized in the project. For the... more

Webpage View Page

December 16, 2008

Excerpt: As a developer coming to Apache Hadoop it is important to understand how testing is organized in... more

Webpage View Page

December 3, 2008

Excerpt: A few weeks ago we ran a Hadoop hackathon. ApacheCon participants were invited to use our 10-node Hadoop clust... more

Webpage View Page

December 3, 2008

Excerpt: A few weeks ago we ran an Apache Hadoop hackathon. ApacheCon participants were invited to use our... more

Webpage View Page

November 23, 2008

Excerpt: Job Scheduling in Hadoop by Amr Awadallah November 23, 2008 3 comments... more

Webpage View Page

November 23, 2008

Excerpt: (guest blog post by... more

Webpage View Page

November 18, 2008

Excerpt: Introducing Hadoop Development Status by Alex Loddengaard November 18, 2008 no... more

Webpage View Page

November 14, 2008

Excerpt: It is common for a MapReduce program to require one or more files to be read by each map or reduce task before... more

Webpage View Page

November 2, 2008

Excerpt: As promised in my post about installing Scribe for log collection , I’m going to cover how to configure... more

Webpage View Page

October 28, 2008

Excerpt: Scribe is a newly released log collection tool that dumps log files from various nodes in a cluster to Scri... more

Webpage View Page

October 24, 2008

Excerpt: Apache Hadoop exists within a rich ecosystem of tools for processing and analyzing large data sets. At Facebo... more

Webpage View Page

October 23, 2008

Excerpt: We’ve created this blog as a place to post tips, tricks and insights on using Hadoop and related project... more

Webpage View Page