Blog Posts

Title, Author(s) Abstract / Description File Format

September 17, 2014

Excerpt: Our thanks to Melanie Imhof, Jonas Looser, Thierry Musy, and Kurt Stockinger of the Zuric... more

Webpage View Page

September 16, 2014

Excerpt: This overview will cover the basic tarball setup for your Mac. If you're... more

Webpage View Page

Apache Kafka for Beginners

Justin Kestelyn (@kestelyn)

September 12, 2014

Excerpt: When used in the right way and for the right use case, Kafka has unique attributes that m... more

Webpage View Page

September 10, 2014

Excerpt: What does a "Big Data engineer" do, and what does "Big Data architecture" look like? In t... more

Webpage View Page

September 8, 2014

Excerpt: Hadoop Security is the latest book from Cloudera engineers in the... more

Webpage View Page

September 5, 2014

Excerpt: Welcome to our 12th (first annual!) edition of "... more

Webpage View Page

September 4, 2014

Excerpt: Our thanks to Mayur Rustagi (... more

Webpage View Page

September 2, 2014

Excerpt: The key to getting the most out of Spark is to understand the differences between its RDD... more

Webpage View Page

August 29, 2014

Excerpt: The versatility of Apache Spark's API for both batch/ETL and streaming workloads brings t... more

Webpage View Page

August 27, 2014

Excerpt: Markov Chain Monte Carlo methods are another example of useful statistical computation fo... more

Webpage View Page

August 26, 2014

Excerpt: Impala 2.0 will add much more complete SQL functionality to what is already the fastest S... more

Webpage View Page

August 22, 2014

Excerpt: Our thanks to Rakesh Rao of Quaero, for allowing us... more

Webpage View Page

August 20, 2014

Excerpt: Congratulations to Hari Shreedharan, Cloudera software engineer and Apache Flume committe... more

Webpage View Page

August 19, 2014

Excerpt: The Transaction Processing Council (TPC), working with Cloudera, recently announced the n... more

Webpage View Page

Running CDH 5 on GlusterFS 3.3

Justin Kestelyn (@kestelyn)

August 18, 2014

Excerpt: The following post was written by Jay Vyas (@jayunit100) and ... more

Webpage View Page

August 18, 2014

Excerpt: The ability to quickly and accurately count complex events is a legitimate business advan... more

Webpage View Page

Apache Hadoop 2.5.0 is Released

Justin Kestelyn (@kestelyn)

August 15, 2014

Excerpt: The Apache Hadoop community has voted to release Apache Hadoop 2.5.0. Ap... more

Webpage View Page

August 14, 2014

Excerpt: IPython Notebook and Spark's Python API are a powerful combination for data science.... more

Webpage View Page

New in CDH 5.1: HDFS Read Caching

Justin Kestelyn (@kestelyn)

August 11, 2014

Excerpt: Applications using HDFS, such as Impala, will be able to read data up to 59x faster thank... more

Webpage View Page

This Month in the Ecosystem (July 2014)

Justin Kestelyn (@kestelyn)

August 8, 2014

Excerpt: Welcome to our 11th edition of "... more

Webpage View Page

August 5, 2014

Excerpt: Cloudera Community forums are proving their value as an important contributor to a rich u... more

Webpage View Page

Meet the Engineer: Sravya Tirukkovalur

Justin Kestelyn (@kestelyn)

August 1, 2014

Excerpt: Meet Sravya Tirukkovalur (@sravsatuluri), a Software Engineer working on Apache Hadoop se... more

Webpage View Page

July 31, 2014

Excerpt: An improved Search app in Hue 3.6 makes the Hadoop user experience even better.... more

Webpage View Page

What’s New in Kite SDK 0.15.0?

Justin Kestelyn (@kestelyn)

July 29, 2014

Excerpt: Kite SDK's new release contains new improvements that make working with data easier.... more

Webpage View Page

July 28, 2014

Excerpt: Spark 1.0 reflects a lot of hard work from a very diverse community. Clo... more

Webpage View Page

July 25, 2014

Excerpt: With this new release, setting up a separate MIT KDC for cluster authentication services... more

Webpage View Page

July 23, 2014

Excerpt: Cloudera Search now supports fine-grain access control via document-level security provid... more

Webpage View Page

July 21, 2014

Excerpt: While the new Spark Developer training from Cloudera University is valuable for developer... more

Webpage View Page

July 16, 2014

Excerpt: It was good to see Jay Kreps (@jaykreps), the LinkedIn engineer who is the tech lead for that com... more

Webpage View Page

July 15, 2014

Excerpt: There’s an important new addition coming to the Apache Hadoop book ecosystem. It’s no... more

Webpage View Page

July 14, 2014

Excerpt: Learn how Spark facilitates the calculation of computationally-intensive statistics such... more

Webpage View Page

This Month in the Ecosystem (June 2014)

Justin Kestelyn (@kestelyn)

July 11, 2014

Excerpt: Welcome to our 10th edition of "... more

Webpage View Page

Jeff Dean’s Talk at Cloudera

Justin Kestelyn (@kestelyn)

July 9, 2014

Excerpt: Google's Jeff Dean -- among the original architects of MapReduce, Bigta... more

Webpage View Page

July 8, 2014

Excerpt: Learn how creating dataflow pipelines for time-series analysis is a lot easier with Apach... more

Webpage View Page

July 1, 2014

Excerpt: Two of the most vibrant communities in the Apache Hadoop ecosystem are now working togeth... more

Webpage View Page

June 25, 2014

Excerpt: Find Cloudera tech talks in Texas, Oregon, Washington DC, Illinois, Georgia, Japan, and a... more

Webpage View Page

June 24, 2014

Excerpt: Prefer IntelliJ IDEA over Eclipse? We've got you covered: learn how to... more

Webpage View Page

Meet the Data Scientist: Sandy Ryza

Justin Kestelyn (@kestelyn)

June 19, 2014

Excerpt: Meet Sandy Ryza (@SandySifting), the newest member of Cloudera's data science team. See S... more

Webpage View Page

June 17, 2014

Excerpt: An update on community efforts to bring at-rest encryption to HDFS -- a major theme of Pr... more

Webpage View Page

June 12, 2014

Excerpt: Unique across all options, Cloudera Manager makes it easy to do what would otherwise be a... more

Webpage View Page

This Month in the Ecosystem (May 2014)

Justin Kestelyn (@kestelyn)

June 9, 2014

Excerpt: Welcome to our ninth edition of "... more

Webpage View Page

June 6, 2014

Excerpt: Thanks to Bill Podell, VP Big Data and BI Practice, MBI Solutions, for the guest post below.... more

Webpage View Page

June 4, 2014

Excerpt: Organizing your data inside Hadoop doesn't have to be hard -- Kite SDK helps you try out... more

Webpage View Page

Apache Spark 1.0 is Released

Justin Kestelyn (@kestelyn)

May 30, 2014

Excerpt: Spark 1.0 is its biggest release yet, with a list of new features for enterprise customer... more

Webpage View Page

May 30, 2014

Excerpt: A concise look at the differences between how Spark and MapReduce manage cluster resource... more

Webpage View Page

May 29, 2014

Excerpt: Impala continues to demonstrate performance leadership compared to alternatives (by 950%... more

Webpage View Page

May 27, 2014

Excerpt: Using an appropriate network representation and the right tool set are the key factors in... more

Webpage View Page

May 23, 2014

Excerpt: Yesterday, Parquet was... more

Webpage View Page

May 21, 2014

Excerpt: Learn how HiveServer, Apache Sentry, and Impala help make Hadoop play nicely with BI tool... more

Webpage View Page

May 19, 2014

Excerpt: Learn how to convert your data to the Parquet columnar format to get big performance gain... more

Webpage View Page

Meet the Data Scientist: Alan Paulsen

Ryan Goldman (@ClouderaU)

May 16, 2014

Excerpt: Meet Alan Paulsen, among the first to earn the CCP: Data Scientist distinction.... more

Webpage View Page

May 16, 2014

Excerpt: Cloudera's new "Designing and Building Big Data Applications" is a great springboard for... more

Webpage View Page

Using Impala at Scale at Allstate

Justin Kestelyn (@kestelyn)

May 15, 2014

Excerpt: Our thanks to Don Drake (@dondrake), an ind... more

Webpage View Page

May 14, 2014

Excerpt: Did you know that using the Crunch API is a powerful option for doing time-series analysi... more

Webpage View Page

May 12, 2014

Excerpt: The internals of Oozie's ShareLib have changed recently (reflected in CDH 5.0.0). Here's... more

Webpage View Page

This Month in the Ecosystem (April 2014)

Justin Kestelyn (@kestelyn)

May 9, 2014

Excerpt: Welcome to our eighth edition of "... more

Webpage View Page

How Apache Hadoop YARN HA Works

Justin Kestelyn (@kestelyn)

May 8, 2014

Excerpt: Thanks to recent work upstream, YARN is now a highly available service. This post explain... more

Webpage View Page

HBaseCon 2014 is a Wrap!

Justin Kestelyn (@kestelyn)

May 7, 2014

Excerpt: HBaseCon 2014 is in the books. Thanks to all attendees, speakers, and sponsors!... more

Webpage View Page

April 30, 2014

Excerpt: The new Python client for Impala will bring smiles to Pythonistas! As a... more

Webpage View Page

April 29, 2014

Excerpt: Thanks to Jonathan Natkins of WibiData for the post below a... more

Webpage View Page

April 28, 2014

Excerpt: More than 300 bug fixes and stable features in Apache Hive 0.13 have already been backpor... more

Webpage View Page

April 25, 2014

Excerpt: Thanks to Alexander Rubin of Percona for allowing us to... more

Webpage View Page

Meet the Engineer: Andrei Savu

Justin Kestelyn (@kestelyn)

April 23, 2014

Excerpt: In this installment of... more

Webpage View Page

April 21, 2014

Excerpt: Understanding some key differences between MR1 and MR2/YARN will make your migration much... more

Webpage View Page

April 18, 2014

Excerpt: The HBaseCon 2014 "Case Studies" track surfaces some of the most interesting (and diverse... more

Webpage View Page

April 17, 2014

Excerpt: Get started with Apache Hadoop and use-case examples online in just seconds.... more

Webpage View Page

April 15, 2014

Excerpt: Our thanks to Prashant Sharma and Matei Zaharia of Databr... more

Webpage View Page

April 15, 2014

Excerpt: Meet Stuart Horsman, among the first to earn the CCP: Data Scientist distinction.... more

Webpage View Page

April 14, 2014

Excerpt: Getting started with Spark (now shipping inside CDH 5) is easy using this simple example.... more

Webpage View Page

April 11, 2014

Excerpt: Improved scheduling capabilities via Oozie in CDH 5 makes for far fewer headaches.... more

Webpage View Page

Hello, Apache Hadoop 2.4.0

Justin Kestelyn (@kestelyn)

April 11, 2014

Excerpt: The community has voted to release Apache Hadoop 2.4.0. Hadoop 2.4.0 inc... more

Webpage View Page

April 10, 2014

Excerpt: The HBaseCon 2014 "Ecosystem" track offers a cross-section view of the most interesting p... more

Webpage View Page

Hue Flies High at Goibibo

Justin Kestelyn (@kestelyn)

April 9, 2014

Excerpt: Our thanks to Amar Parkash, a Software Developer at Goib... more

Webpage View Page

April 8, 2014

Excerpt: Our thanks to Janos Matyas, CTO and Founder of SequenceI... more

Webpage View Page

This Month in the Ecosystem (March 2014)

Justin Kestelyn (@kestelyn)

April 7, 2014

Excerpt: Welcome to our seventh edition of "... more

Webpage View Page

April 4, 2014

Excerpt: The HBaseCon 2014 "Features & Internals" track covers the newest developments in Apac... more

Webpage View Page

April 3, 2014

Excerpt: The GA release of Cloudera Enterprise 5 signifies the evolution of the platform from a me... more

Webpage View Page

April 1, 2014

Excerpt: The conclusion to this series covers how to use scans, and considerations for choosing th... more

Webpage View Page

March 28, 2014

Excerpt: The following post, by Sarah Cannon of Digital Reasoning, was originally... more

Webpage View Page

March 28, 2014

Excerpt: The integration of Apache Sentry with Apache Solr helps... more

Webpage View Page

March 27, 2014

Excerpt: HBaseCon 2014 "Operations" track reveals best practices used by some of the world's large... more

Webpage View Page

March 25, 2014

Excerpt: Meet David F. McCoy, one of the first to have earned the title "CCP: Data Scientist" from... more

Webpage View Page

March 25, 2014

Excerpt: Find Cloudera tech talks in Amsterdam, Boston, Berlin, Sao Paulo, Singapore, Zurich, and... more

Webpage View Page

Letting It Flow with Spark Streaming

Justin Kestelyn (@kestelyn)

March 24, 2014

Excerpt: Our thanks to Russell Cardullo and Michael Ruggiero, Data Infrastructure Engineers at... more

Webpage View Page

March 21, 2014

Excerpt: The CDH software stack lets you use your tool of choice with the Parquet file format - -... more

Webpage View Page

March 20, 2014

Excerpt: The HBaseCon 2014 General Session - with keynotes by Facebook, Google, and Salesforce.com... more

Webpage View Page

March 19, 2014

Excerpt: This quick demo illustrates how easy it is to implement role-based access and control in... more

Webpage View Page

Apache ZooKeeper Resilience at Pinterest

Justin Kestelyn (@kestelyn)

March 14, 2014

Excerpt: The guest post below was originally authored by Pinterest engineer Raghavendra Prabhu... more

Webpage View Page

March 14, 2014

Excerpt: Oozie's new HA qualities help cluster operators sleep well at night. Here's how it works... more

Webpage View Page

Apache Spark: A Delight for Developers

Justin Kestelyn (@kestelyn)

March 13, 2014

Excerpt: Sure, Spark is fast, but it also gives developers a positive experience they won't soon f... more

Webpage View Page

March 12, 2014

Excerpt: Cost-per-performance, not cost-per-capacity, turns out to be the better metric for evalua... more

Webpage View Page

Meet the Instructor: Bruce Martin

Ryan Goldman (@ClouderaU)

March 10, 2014

Excerpt: In this installment of "Meet the Instructor", our interview subject is Bruce Martin.... more

Webpage View Page

March 7, 2014

Excerpt: Welcome to our sixth edition of "... more

Webpage View Page

March 5, 2014

Excerpt: Understanding how checkpointing works in HDFS can make the difference between a healthy c... more

Webpage View Page

March 3, 2014

Excerpt: Spark is a compelling multi-purpose platform for use cases that span investigative, as we... more

Webpage View Page

February 28, 2014

Excerpt: Hue users can learn a lot about new features by following a steady stream of new demos.... more

Webpage View Page

February 25, 2014

Excerpt: Cloudera's own enterprise data hub is yielding great results for providing world-class cu... more

Webpage View Page

February 25, 2014

Excerpt: Hadoop 2.3.0 includes hundreds of new fixes and features, but none more important than HD... more

Webpage View Page

February 21, 2014

Excerpt: Learn how to use Cloudera Search along with RBL-JE to search and index documents in multi... more

Webpage View Page

February 20, 2014

Excerpt: Bringing Parquet support to Hive was a community effort that deserves congratulations!... more

Webpage View Page

February 18, 2014

Excerpt: Integrating Hue with LDAP can help make your secure Hadoop apps as widely consumed as pos... more

Webpage View Page

February 13, 2014

Excerpt: Thanks to the improvements described here, CDH 5 will ship with a version of MapReduce 2... more

Webpage View Page

February 12, 2014

Excerpt: This FAQ contains answers to the most frequently asked questions about the architecture a... more

Webpage View Page

February 10, 2014

Excerpt: Cloudera has released the Beta 2 version of Cloudera Enterprise 5 (comprises CDH 5.0.0 an... more

Webpage View Page

February 10, 2014

Excerpt: Migrating from the Hive CLI to Beeline isn't as simple as changing the executable name, b... more

Webpage View Page

February 7, 2014

Excerpt: Welcome to our fifth edition of... more

Webpage View Page

February 5, 2014

Excerpt: Create a test environment for writing and testing Giraph jobs, or just for playing around... more

Webpage View Page

February 3, 2014

Excerpt: Cloudera is announcing the general availability of support for Spark, bringing interactiv... more

Webpage View Page

January 30, 2014

Excerpt: Thanks to Xavier Clements of Wajam for allowing us to... more

Webpage View Page

January 28, 2014

Excerpt: Set up a CDH-based Hadoop cluster in less than an hour using VirtualBox and Cloudera Mana... more

Webpage View Page

Pro Tips for Pitching an HBaseCon Talk

Justin Kestelyn (@kestelyn)

January 27, 2014

Excerpt: These suggestions from the Program Committee offer an inside track to getting your talk a... more

Webpage View Page

How-to: Get Started Writing Impala UDFs

Justin Kestelyn (@kestelyn)

January 24, 2014

Excerpt: Cloudera provides docs and a sample build environment to help you get easily started writ... more

Webpage View Page

Meet the Engineer: Romain Rigaux

Justin Kestelyn (@kestelyn)

January 17, 2014

Webpage View Page

January 15, 2014

Excerpt: The Cloudera QuickStart VM is an important platform for learning any Hadoop-related curri... more

Webpage View Page

January 14, 2014

Excerpt: The third-annual HBaseCon is now open for business. Submit your paper or register today f... more

Webpage View Page

January 13, 2014

Excerpt: Impala’s speed now beats the fastest SQL-on-Hadoop alternatives. Test for yourself!... more

Webpage View Page

January 10, 2014

Excerpt: Welcome to our sixth edition of... more

Webpage View Page

January 9, 2014

Excerpt: The new Cloudera Developer Newsletter makes its debut in January 2014. D... more

Webpage View Page

January 8, 2014

Excerpt: Find Cloudera tech talks in Berlin, Budapest, London, Stockholm, Tokyo, and across the US... more

Webpage View Page

January 7, 2014

Excerpt: Join us at Cloudera's San Francisco office on Feb. 20 for tech talks, T-shirts, and adult... more

Webpage View Page

The Hadoop FAQ for Oracle DBAs

Gwen Shapira (@gwenshap)

January 6, 2014

Excerpt: Oracle DBAs, get answers to many of your most common questions about getting started with... more

Webpage View Page

A New Web UI for Spark

Justin Kestelyn (@kestelyn)

January 3, 2014

Excerpt: The team behind Hue, the open source Web UI that makes Apache Hadoop easier to use, strik... more

Webpage View Page

Top 10 Blog Posts of 2013

Justin Kestelyn (@kestelyn)

December 23, 2013

Excerpt: From Python, to ZooKeeper, to Impala, to Parquet, blog readers in 2013 were interested in... more

Webpage View Page

Accumulo Comes to CDH

Justin Kestelyn (@kestelyn)

December 20, 2013

Excerpt: Apache Accumulo is now generally available on CDH 4. Cloudera is pleased... more

Webpage View Page

Doing DevOps with Cloudera Manager

Justin Kestelyn (@kestelyn)

December 18, 2013

Excerpt: More and more customers are using automation/configuration management frameworks alongsid... more

Webpage View Page

December 16, 2013

Excerpt: CDK has a new monicker, but the goals remain the same. We are pleased to... more

Webpage View Page

How-to: Use Impala on Amazon EMR

Justin Kestelyn (@kestelyn)

December 16, 2013

Excerpt: Developers, rejoice: Impala is now available on EMR for testing and evaluation.... more

Webpage View Page

December 16, 2013

Excerpt: The new RImpala package brings the speed and interactivity of Impala to queries from R.... more

Webpage View Page

December 12, 2013

Excerpt: Learn the new features and enhancements in Cloudera Manager 5, including support for YARN... more

Webpage View Page

What are HBase Compactions?

Justin Kestelyn (@kestelyn)

December 11, 2013

Excerpt: The compactions model is changing drastically with CDH 5/HBase 0.96. Here's what you need... more

Webpage View Page

December 9, 2013

Excerpt: Welcome to our fifth edition of "This Month in the Ecosystem," a digest of highlights from Novemb... more

Webpage View Page

How-to: Get Started with Sentry in Hive

Justin Kestelyn (@kestelyn)

December 6, 2013

Excerpt: A quick on-ramp (and demo) for using the new Sentry module for RBAC in conjunction with H... more

Webpage View Page

December 5, 2013

Excerpt: Thanks to Marshall Bockrath-Vandegrift of advanced threat detection/malware company (and CDH... more

Webpage View Page

December 4, 2013

Excerpt: The second how-to in a series about using the Apache HBase Thrift API La... more

Webpage View Page

December 2, 2013

Excerpt: An overview of some of Cloudera's contributions to YARN that help support management of m... more

Webpage View Page

Things For Which We Are Thankful

Justin Kestelyn (@kestelyn)

November 27, 2013

Excerpt: Some things for which we are thankful, the 2013 edition (not listed in order): 1. The ent... more

Webpage View Page

November 25, 2013

Excerpt: You can use Hue and Cloudera Search to build your own integrated Big Data search app.... more

Webpage View Page

November 22, 2013

Excerpt: Get an overview of the available mechanisms for backing up data stored in Apache HBase, a... more

Webpage View Page

November 20, 2013

Excerpt: While XML is very good for standardizing the way Apache Oozie... more

Webpage View Page

November 19, 2013

Excerpt: Cloudera Manager 4.7 added support for managing... more

Webpage View Page

November 18, 2013

Webpage View Page

November 15, 2013

Excerpt: Our thanks to Telvis Calhoun, Zach Hanif, and Jason Trost of End... more

Webpage View Page

November 12, 2013

Excerpt: Welcome to our fourth edition of "This Month in the Ecosystem," a digest of highlights from Octob... more

Webpage View Page

November 11, 2013

Excerpt: We at Cloudera University have been busy lately, building and expanding our courses to help data... more

Webpage View Page

November 8, 2013

Excerpt: Our thanks to Concurrent Inc. for the how-to below about using Cascading Pattern with CDH. Cl... more

Webpage View Page

November 7, 2013

Excerpt: Cloudera Manager lets you add a YARN service in the same way you would add any other Clou... more

Webpage View Page

Sqooping Data with Hue

Abraham Elmahrek

November 6, 2013

Excerpt: Hue, the open source Web UI that makes Apache Hadoop easier to us... more

Webpage View Page

November 5, 2013

Excerpt: In the wake of the Strata + Hadoop World 2013 afterglow, speaker slides and video have been poste... more

Webpage View Page

November 5, 2013

Excerpt: In my... more

Webpage View Page

November 4, 2013

Excerpt: In software development, there is no substitute for having choices. Furthermore, freedom of choic... more

Webpage View Page

Tips for Debugging Distributed Systems

Justin Kestelyn (@kestelyn)

November 1, 2013

Excerpt: Among Cloudera's engineer-presenters at Strata + Hadoop World 2013 this week, Philip Zeyliger ("... more

Webpage View Page

Strata + Hadoop World 2013 in Pictures

Justin Kestelyn (@kestelyn)

October 30, 2013

Excerpt: For those of you attending virtually/in spirit, I thought it would be nice to bring you a selecti... more

Webpage View Page

October 30, 2013

Excerpt: Thanks to Victor Bittorf, a visiting graduate computer science student at Stanford University... more

Webpage View Page

October 29, 2013

Excerpt: We are pleased to announce the... more

Webpage View Page

October 25, 2013

Webpage View Page

October 25, 2013

Excerpt: The rise of Big Data has been pushing search engines to handle ever-increasing amounts of data. W... more

Webpage View Page

October 24, 2013

Webpage View Page

What are HBase znodes?

Matteo Bertozzi

October 23, 2013

Excerpt: Apache ZooKeeper is a client/server system for distribu... more

Webpage View Page

HBase 0.96.0 Released!

Justin Kestelyn (@kestelyn)

October 22, 2013

Excerpt: The following post, by Apache HBase 0.96 Release Manager/Cloudera Software Engineer Michael S... more

Webpage View Page

Parquet at Salesforce.com

Justin Kestelyn (@kestelyn)

October 22, 2013

Excerpt: The following Parquet blog post was... more

Webpage View Page

October 21, 2013

Excerpt: There are a number of special "users" with roles to play in the Apache Hadoop environment. For yo... more

Webpage View Page

Enabling SSO Authentication in Hue

Justin Kestelyn (@kestelyn)

October 18, 2013

Excerpt: There’s good news for users of Hue, the open sour... more

Webpage View Page

October 16, 2013

Excerpt: The release of Apache Hadoop 2,... more

Webpage View Page

October 16, 2013

Excerpt: In a fast-moving project like Apache Hadoop, there are always exciting new features introduced in... more

Webpage View Page

October 14, 2013

Excerpt: The following guest post is provided by Artur Barseghyan, a web developer currently employed... more

Webpage View Page

Explore the Impala App in Hue

Justin Kestelyn (@kestelyn)

October 11, 2013

Excerpt: The following post was originally published by the Hue Team at the ... more

Webpage View Page

October 10, 2013

Excerpt: The Cloudera Sessions fall series... more

Webpage View Page

October 9, 2013

Excerpt: Below please find our regularly scheduled quarterly update about where to find tech talks by Clou... more

Webpage View Page

Let a Thousand Hadoop How-Tos Bloom

Justin Kestelyn (@kestelyn)

October 7, 2013

Webpage View Page

October 4, 2013

Excerpt: Welcome to our third edition of "This Month in the Ecosystem," a digest of highlights from Septem... more

Webpage View Page

October 3, 2013

Excerpt: It’s common to hear people describe themselves as being “left-brained” or “right-brained... more

Webpage View Page

Meet the Project Founder: Josh Wills

Justin Kestelyn (@kestelyn)

October 3, 2013

Webpage View Page

October 1, 2013

Excerpt: I've always held a strong bias that education is most effective when the student learns by doing.... more

Webpage View Page

September 30, 2013

Excerpt: In December 2012, we... more

Webpage View Page

How-to: Use HBase Bulk Loading, and Why

Justin Kestelyn (@kestelyn)

September 27, 2013

Excerpt: Apache HBase is all about giving you random, real-time, rea... more

Webpage View Page

Email Indexing Using Cloudera Search

Justin Kestelyn (@kestelyn)

September 25, 2013

Excerpt: Why would any company be interested in searching through its vast trove of email? A better questi... more

Webpage View Page

September 24, 2013

Excerpt: In December 2012, while Cloudera Impala was still in its beta phase, we... more

Webpage View Page

September 23, 2013

Excerpt: This week’s Cloudera Sessions... more

Webpage View Page

How-to: Use the HBase Thrift Interface, Part 1

Jesse Anderson (@jessetanderson)

September 23, 2013

Excerpt: There are various way to access and interact with Apache HBase... more

Webpage View Page

Get Hired as a Certified Data Scientist

Justin Kestelyn (@kestelyn)

September 20, 2013

Excerpt: To paraphrase... more

Webpage View Page

September 19, 2013

Excerpt: Note: This post was originally published at blogs.apache.or... more

Webpage View Page

How-to: Manage HBase Data via Hue

Justin Kestelyn (@kestelyn)

September 18, 2013

Excerpt: The following post was originally published by the Hue Team at the... more

Webpage View Page

September 17, 2013

Excerpt: When building complex workflows in Apache Oozie, it is often useful to parameterize them so they... more

Webpage View Page

September 11, 2013

Excerpt: While Apache HBase adoption for building end-user applications has skyrocketed, many of those app... more

Webpage View Page

September 10, 2013

Excerpt: Welcome to the Cloudera Connect Webinar Series! Cloudera's platform touches every part of... more

Webpage View Page

September 10, 2013

Excerpt: We’re kicking off the second leg of our... more

Webpage View Page

September 10, 2013

Excerpt: Most people would not call a 100GB data file "Big Data" -- especially when preliminary filtering... more

Webpage View Page

Cloudera Manager 4.7 Released

Justin Kestelyn (@kestelyn)

September 6, 2013

Excerpt: Cloudera Manager 4.7 is an update to Cloudera Manager 4 and contains a number of bug fixes and us... more

Webpage View Page

September 6, 2013

Excerpt: Welcome to our second edition of "This Month in the Ecosystem." (See the inaugural edition... more

Webpage View Page

September 5, 2013

Excerpt: After three months of public beta, and months of private beta before that,... more

Webpage View Page

September 4, 2013

Excerpt: StackIQ takes a “software defined infrastructure” approach to provision and manage cluste... more

Webpage View Page

September 3, 2013

Excerpt: Cloudera and Cisco are announcing a joint solution today, the Cisco Validated Design (CVD) for Cl... more

Webpage View Page

September 3, 2013

Excerpt: Our thanks to Kishore Gopalakrishna, staff engineer at LinkedIn and one of the original devel... more

Webpage View Page

August 30, 2013

Excerpt: The guest post below is from Wei Yan, a 2013 summer intern at Cloudera. In this post, he help... more

Webpage View Page

Meet the Instructor: Nathan Neff

Ryan Goldman (@ClouderaU)

August 29, 2013

Webpage View Page

August 28, 2013

Excerpt: One of the first questions Cloudera customers raise when getting started with Apache Hadoop is ho... more

Webpage View Page

Hadoop 2 is Now a Beta

Justin Kestelyn (@kestelyn)

August 27, 2013

Excerpt: As announced last Sunday (Aug. 25) on... more

Webpage View Page

August 22, 2013

Excerpt: Apache HBase supports three primary client APIs that developers can use to bind applications with... more

Webpage View Page

August 21, 2013

Excerpt: Few projects within the Apache Hadoop umbrella have as much end-user visibility as... more

Webpage View Page

August 20, 2013

Excerpt: Catherine Ray, a Summer Intern at Cloudera this year, was kind enough to summarize her experi... more

Webpage View Page

August 19, 2013

Excerpt: The guest post below is provided by Justin Langseth, Founder & CEO of... more

Webpage View Page

August 16, 2013

Excerpt: One of the key principles behind Apache Hadoop is the idea that moving computation is cheaper tha... more

Webpage View Page

August 15, 2013

Excerpt: This week, I’d like to shine a spotlight on innovative work the N... more

Webpage View Page

August 14, 2013

Excerpt: The following guest post is re-published here courtesy of Gerd König, a System Engineer with... more

Webpage View Page

Meet the Project Founder: Tom White

Justin Kestelyn (@kestelyn)

August 12, 2013

Webpage View Page

August 9, 2013

Excerpt: It's been a couple of weeks since Cloudera's new Community Forums did a... more

Webpage View Page

New E-Learning for Parcels

Justin Kestelyn (@kestelyn)

August 8, 2013

Excerpt: Cloudera's new... more

Webpage View Page

Flexpod Select with Cloudera

Justin Kestelyn (@kestelyn)

August 7, 2013

Excerpt: Earlier this week, our partners NetApp and Cisco announced the Flexpod Select Family with support... more

Webpage View Page

August 7, 2013

Excerpt: This installment of the Hue demo series is about accessing the Hive Metastore from... more

Webpage View Page

August 6, 2013

Excerpt: Strata Confer... more

Webpage View Page

This Month in the Ecosystem

Justin Kestelyn (@kestelyn)

August 5, 2013

Excerpt: The ecosystem is evolving at a rapid pace - so rapidly, that important developments are often pas... more

Webpage View Page

August 2, 2013

Excerpt: One of the common questions I get from students and developers in my classes relates to IDEs and... more

Webpage View Page

August 1, 2013

Excerpt: The following guest post, from Mike Pittaro of Dell's Cloud Software Solutions team, describe... more

Webpage View Page

July 31, 2013

Webpage View Page

July 30, 2013

Excerpt: For those of you attending this week’s StampedeCon event... more

Webpage View Page

July 30, 2013

Excerpt: We're very happy to re-publish the following post from Twitter analytics infrastructure engin... more

Webpage View Page

Thanks for the Memories, #OSCON 2013

Justin Kestelyn (@kestelyn)

July 30, 2013

Excerpt: OSCON 2013 is already receding in the rear-view mirror, but we had a great time. Cloudera's sessi... more

Webpage View Page

July 29, 2013

Excerpt: This is a great day for technical end-users - developers, admins, analysts, and data scientists a... more

Webpage View Page

July 24, 2013

Excerpt: Every day, more data, users, and applications are accessing ever-larger Apache Hadoop clusters. A... more

Webpage View Page

July 22, 2013

Excerpt: The Data Warehousing Institute (TDWI) runs an annual... more

Webpage View Page

July 19, 2013

Excerpt: Editor's note (added Feb. 2, 2014): You can review the latest (and exciting) Impala performan... more

Webpage View Page

July 18, 2013

Excerpt: Apache Hive was one of the first projects to bring higher-l... more

Webpage View Page

July 17, 2013

Excerpt: For those people new to Apache HBase (version 0.90 and la... more

Webpage View Page

The Book on Apache Sqoop is Here!

Justin Kestelyn (@kestelyn)

July 15, 2013

Webpage View Page

July 12, 2013

Excerpt: At Cloudera, we believe that... more

Webpage View Page

July 11, 2013

Excerpt: This post is the first in a series of blog posts about Cloudera Morphlines, a new command-bas... more

Webpage View Page

July 10, 2013

Excerpt: Below please find our regularly scheduled quarterly update about where to find tech talks by Clou... more

Webpage View Page

July 9, 2013

Excerpt: Doug Cutting’s... more

Webpage View Page

July 8, 2013

Excerpt: This how-to is the third in a series that explores the use of the Apache HBase REST interface. ... more

Webpage View Page

One Engineer’s Experience with Parcel

Justin Kestelyn (@kestelyn)

July 2, 2013

Excerpt: We’re very pleased to bring you this guest post from Verisign engineer Benoit Perroud, whic... more

Webpage View Page

July 1, 2013

Excerpt: Thanks to Steven Noels, SVP of Products for NGDATA, for... more

Webpage View Page

June 27, 2013

Excerpt: Five years ago today, on June 27, 2008, we filed the incorporation paperwork for Cloudera, Inc.,... more

Webpage View Page

What a Great Year for Hue Users!

Justin Kestelyn (@kestelyn)

June 26, 2013

Webpage View Page

June 25, 2013

Excerpt: In this Customer Spotlight, I’d like to emphasize some undeniably positive use cases for Big Da... more

Webpage View Page

June 24, 2013

Excerpt: CDH, Cloudera's 100%... more

Webpage View Page

June 24, 2013

Webpage View Page

June 21, 2013

Excerpt: The following guest post is courtesy of Doug Meil, Chief Architect at... more

Webpage View Page

Demo: The New Search App in Hue 2.4

Justin Kestelyn (@kestelyn)

June 21, 2013

Excerpt: In version 2.4 of Hue, the open source Web UI that makes Apache H... more

Webpage View Page

June 20, 2013

Excerpt: For years, Cloudera has provided virtual machines that give you a working Apache Hadoop environme... more

Webpage View Page

Meetups at Hadoop Summit

Justin Kestelyn (@kestelyn)

June 19, 2013

Excerpt: Hadoop Summit convenes next week, and even if you... more

Webpage View Page

Welcome, Tom!

Mike Olson

June 18, 2013

Excerpt: We... more

Webpage View Page

Make Hadoop Your Best Business Tool

Ryan Goldman (@ClouderaU)

June 18, 2013

Excerpt: Data analysts and business intelligence specialists have been at the heart of new trends driving... more

Webpage View Page

June 17, 2013

Excerpt: Starting in CDH 4.2, YARN/MapReduce 2 (MR2) includes an even more powerful Fair Scheduler. In ad... more

Webpage View Page

The HBaseCon 2013 Afterglow

Justin Kestelyn (@kestelyn)

June 14, 2013

Webpage View Page

Customer Spotlight: It’s HBase Week!

Karina Babcock (@karinababcock)

June 11, 2013

Excerpt: This is the week of Apache HBase, with HBaseCon 2013 takin... more

Webpage View Page

June 11, 2013

Webpage View Page

June 10, 2013

Excerpt: Michael Stack is the chair of the Apache HBase PMC and has been a committer and project "care... more

Webpage View Page

June 7, 2013

Excerpt: Earlier this week, we hosted The Cloudera Forum to reveal Cloudera’s “... more

Webpage View Page

Meet the Engineer: Mark Miller

Justin Kestelyn (@kestelyn)

June 7, 2013

Webpage View Page

June 6, 2013

Excerpt: As you may know, Apache HBase has a vibrant community and gets a lot of contributions from develo... more

Webpage View Page

HBaseCon 2013: "Ecosystem" Track Preview

Justin Kestelyn (@kestelyn)

June 6, 2013

Webpage View Page

June 5, 2013

Excerpt: Yesterday we announced the... more

Webpage View Page

June 4, 2013

Excerpt: The news this morning focused on the launch of... more

Webpage View Page

June 4, 2013

Excerpt: Today is a big day: Cloudera is not only urging our customers to... more

Webpage View Page

June 4, 2013

Excerpt: One of the unexpected pleasures of open source development is the way that technologies adapt and... more

Webpage View Page

June 3, 2013

Excerpt: Helping users manage hundreds of configurations for the growing family of Apache Hadoop services... more

Webpage View Page

May 31, 2013

Excerpt: Assuming you have an email address, this week and last your inbox has probably been flooded with... more

Webpage View Page

HBaseCon 2013: "Internals" Track Preview

Justin Kestelyn (@kestelyn)

May 30, 2013

Webpage View Page

May 29, 2013

Excerpt: Our thanks to Jordan Zimmerman, software engineer at Netflix, for the guest post below about... more

Webpage View Page

CDH 4.3 is Released!

Charles Zedlewski

May 28, 2013

Excerpt: I’m pleased to announce that CDH 4.3 is released and... more

Webpage View Page

May 24, 2013

Excerpt: This week I’d like to highlight King.com, a European social gaming giant that... more

Webpage View Page

Demo: Apache Pig Editor in Hue 2.3

Justin Kestelyn (@kestelyn)

May 24, 2013

Excerpt: In the previous installment of the demo series about H... more

Webpage View Page

May 23, 2013

Webpage View Page

May 22, 2013

Excerpt: Have you ever wished you could upgrade to the latest CDH minor release with just a few mouse clic... more

Webpage View Page

May 21, 2013

Excerpt: Mark your calendars, all you data cyclists! I’m visiting Paris, London, and Edinburgh t... more

Webpage View Page

May 20, 2013

Excerpt: According to Jim Benedetto,... more

Webpage View Page

May 17, 2013

Webpage View Page

May 15, 2013

Excerpt: Contributing to Apache Hadoop or writing custom pluggable modules requires modifying Hadoop’s s... more

Webpage View Page

May 13, 2013

Excerpt: One of the complexities of Apache Hadoop is the need to deploy clusters of servers, potentially o... more

Webpage View Page

May 10, 2013

Excerpt: Our thanks to Etsy developer Brad Greenlee (@bgreenlee) for the post below. We think his Mac... more

Webpage View Page

Top 5 Reasons to Attend HBaseCon 2013

Justin Kestelyn (@kestelyn)

May 9, 2013

Webpage View Page

May 8, 2013

Excerpt: The post below was originally published at... more

Webpage View Page

Cloudera Partners and Impala: Alteryx

Justin Kestelyn (@kestelyn)

May 8, 2013

Excerpt: Our thanks to Brian Dirking, Director of Product Marketing for... more

Webpage View Page

Extending the Data Warehouse with Hadoop

Justin Kestelyn (@kestelyn)

May 7, 2013

Excerpt: "Are data warehouses becoming victims of their own success?", Tony Baer asks in a ... more

Webpage View Page

May 7, 2013

Excerpt: Editor's Note (Dec. 11, 2013): As of Dec. 2013, the Cloudera Development Kit... more

Webpage View Page

Cloudera Impala and Partners: Tableau

Justin Kestelyn (@kestelyn)

May 7, 2013

Excerpt: Our thanks to Ted Wasserman, product manager for Ta... more

Webpage View Page

May 6, 2013

Excerpt: This week, the Cloudera Sessions... more

Webpage View Page

Cloudera Partners and Impala: Talend

Justin Kestelyn (@kestelyn)

May 6, 2013

Excerpt: Our thanks to Yves de Montcheuil, Vice President of Marketing for... more

Webpage View Page

May 3, 2013

Excerpt: Our thanks to Kevin Spurway, Senior Vice President of Marketing for... more

Webpage View Page

May 2, 2013

Excerpt: This week represents quite a milestone for Cloudera and, at least we’d like to believe, the Had... more

Webpage View Page

May 2, 2013

Excerpt: On Monday April 29, Cloudera... more

Webpage View Page

April 30, 2013

Excerpt: It has been an exciting couple of days for new product announcements at Cloudera -- exciting espe... more

Webpage View Page

April 26, 2013

Excerpt: We're very happy to announce the 2.3 release of Hue, the open source... more

Webpage View Page

April 26, 2013

Excerpt: This post was originally published via blogs.apache.... more

Webpage View Page

April 23, 2013

Excerpt: Data scientists, that peculiar... more

Webpage View Page

April 22, 2013

Excerpt: As Cloudera’s keeper of customer stories, it’s dawned on me that others might benefit from th... more

Webpage View Page

April 22, 2013

Excerpt: Thanks to a dazzling array of excellent proposals from across the Apache HBase community, the... more

Webpage View Page

April 19, 2013

Excerpt: In the... more

Webpage View Page

April 18, 2013

Excerpt: Today Cloudera announced a new... more

Webpage View Page

April 18, 2013

Webpage View Page

April 17, 2013

Excerpt: This guest post comes from Alex Giamas, Senio... more

Webpage View Page

It’s Only Rock and Roll

Doug Cutting (@cutting)

April 15, 2013

Excerpt: It’s only Rock and Roll, but I like it!           - Mick Jagger... more

Webpage View Page

April 12, 2013

Excerpt: This how-to is the second in a series that explores the use of the Apache HBase REST interface. ... more

Webpage View Page

April 11, 2013

Excerpt: It's time for me to give you a quarterly update (... more

Webpage View Page

April 9, 2013

Excerpt: This guest post comes to us from David Greco, CTO of Elig... more

Webpage View Page

April 8, 2013

Excerpt: Managing and viewing data in... more

Webpage View Page

Congrats to OSCON 2013 Speakers!

Justin Kestelyn (@kestelyn)

April 5, 2013

Excerpt: Cloudera will be a proud exhibitor at O'... more

Webpage View Page

April 4, 2013

Excerpt: As a follow-up to a previous post about the Impala demo he built during Data Hacking Day, Ala... more

Webpage View Page

We Honor the Champions of Big Data!

Justin Kestelyn (@kestelyn)

April 2, 2013

Webpage View Page

March 29, 2013

Excerpt: Thanks to our friends at KDNuggets for pointing out that Cloudera is the... more

Webpage View Page

Meet the HBaseCon 2013 Program Committee

Justin Kestelyn (@kestelyn)

March 29, 2013

Excerpt: With HBaseCon 2013 (Early Bird registration now open!) pre... more

Webpage View Page

Meet the Engineer: Mark Grover

Justin Kestelyn (@kestelyn)

March 29, 2013

Webpage View Page

Phoenix in 15 Minutes or Less

Justin Kestelyn (@kestelyn)

March 28, 2013

Excerpt: The following FAQ is provided by James Taylor of Salesforce, which recently open-sourced its... more

Webpage View Page

March 26, 2013

Excerpt: Editor's Note (added Feb. 28, 2014): The instructions below are deprecated for Cloudera M... more

Webpage View Page

March 25, 2013

Excerpt: Hue 2.2 , the open sour... more

Webpage View Page

March 25, 2013

Excerpt: The following guest post comes to you from Alan Gardner of remote database services and consu... more

Webpage View Page

March 22, 2013

Excerpt: In this... more

Webpage View Page

March 22, 2013

Excerpt: Editor's note (12/19/2013): Cloudera ML has been merged into the... more

Webpage View Page

March 20, 2013

Excerpt: Hue is an open-source web interface for Apache Hado... more

Webpage View Page

How-to: Use Oozie Shell and Java Actions

Justin Kestelyn (@kestelyn)

March 18, 2013

Excerpt: Apache Oozie, the workflow coordinator for Apache Hadoop, h... more

Webpage View Page

March 15, 2013

Excerpt: Hadoop Summit Europe is coming up in Amsterdam n... more

Webpage View Page

March 13, 2013

Excerpt: Below you'll find the official announcement from Cloudera and Twitter about Parquet, an effic... more

Webpage View Page

March 12, 2013

Excerpt: There are various ways to access and interact with Apache HBase. The... more

Webpage View Page

March 8, 2013

Excerpt: Every growing, dynamic engineering culture needs a hackathon every once in a while.  Ear... more

Webpage View Page

March 7, 2013

Excerpt: The current (4.2) release of CDH -- Cloudera's 100% open-source distribution of Apache Hadoop and... more

Webpage View Page

March 6, 2013

Excerpt: Last week Cloudera released the 4.5 release of... more

Webpage View Page

March 5, 2013

Excerpt: Hadoop network encryption is a feature introduced in Apache Hadoop 2.0.2-alpha and in CDH4.1.... more

Webpage View Page

March 1, 2013

Excerpt: This post is about the new release of Hue, an open... more

Webpage View Page

February 27, 2013

Excerpt: It has been a while since I have blogged, primarily because we have been heads-down working towar... more

Webpage View Page

February 26, 2013

Excerpt: It has been a busy time for announcements coinciding with this week’s Strata conference. There... more

Webpage View Page

February 26, 2013

Excerpt: Today is an exciting day for Cloudera customers and users. With an update to our 100% open source... more

Webpage View Page

February 25, 2013

Excerpt: UPDATED 20130424: The new RHadoop treats output to Streaming a bit differently,... more

Webpage View Page

February 21, 2013

Excerpt: (Added Feb. 25 2013: Early Bird registration is now open - closes April 23, 2013!)... more

Webpage View Page

February 21, 2013

Excerpt: Now that Apache Hadoop is seven years old, use-case patterns for Big Data have emerged. In this p... more

Webpage View Page

February 20, 2013

Excerpt: Last week the Apache Hadoop PMC voted to release... more

Webpage View Page

February 15, 2013

Excerpt: Cloudera is proud to be a sponsor of Big Data... more

Webpage View Page

February 14, 2013

Excerpt: Organizations of all types and sizes are waking up to the idea that integrating the Apache Hadoop... more

Webpage View Page

From Zero to Impala in Minutes

Justin Kestelyn (@kestelyn)

February 7, 2013

Excerpt: This was post was originally published by U.C. Berk... more

Webpage View Page

February 6, 2013

Excerpt: This guest post is provided by Dave Nahmias, Pre-Sales and Partner Solutions Engineer at... more

Webpage View Page

A Ruby Client for Impala

Justin Kestelyn (@kestelyn)

February 4, 2013

Excerpt: Thanks to Stripe's Colin Marc (@colinmarc) for the guest post below, and for his work on the... more

Webpage View Page

January 30, 2013

Excerpt: In Part 1... more

Webpage View Page

January 28, 2013

Excerpt: Are you new to Apache Hadoop and need to start processing data fast and effectively? Have you bee... more

Webpage View Page

January 22, 2013

Excerpt: Clouderans are traveling the United States (and beyond) in droves during the first quarter of 201... more

Webpage View Page

January 18, 2013

Excerpt: I am pleased to announce the release of Cloudera Impala Beta (version 0.4) and Cloudera Manager 4... more

Webpage View Page

January 18, 2013

Excerpt: Our thanks to guest author Jon Natkins (@nattyice) of WibiData for the following post!... more

Webpage View Page

Meet the Instructor: Jesse Anderson

Ryan Goldman (@ClouderaU)

January 15, 2013

Webpage View Page

January 14, 2013

Excerpt: This following post was originally published via... more

Webpage View Page

Understanding MapReduce via Boggle

Jesse Anderson (@jessetanderson)

January 14, 2013

Excerpt: Graph theory is a growing part of Big Dat... more

Webpage View Page

January 11, 2013

Excerpt: The post below was originally published via ... more

Webpage View Page

January 10, 2013

Excerpt: For several good reasons, 2013 is a Happy New Year for Apache Hadoop enthusiasts. In 2012... more

Webpage View Page

January 9, 2013

Excerpt: (Update 2/6/2013 - Sorry, this event is sold out!) With... more

Webpage View Page

Meet the Engineer: Marcel Kornacker

Justin Kestelyn (@kestelyn)

January 8, 2013

Webpage View Page

January 7, 2013

Excerpt: I recently joined Cloudera after working in... more

Webpage View Page

Apache Bigtop 0.5.0 Has Been Released

Justin Kestelyn (@kestelyn)

January 3, 2013

Excerpt: The following post was originally published via... more

Webpage View Page

January 3, 2013

Excerpt: Hue is a web interface for... more

Webpage View Page

How-to: Use the ShareLib in Apache Oozie

Justin Kestelyn (@kestelyn)

December 18, 2012

Excerpt: Ed. Note: The post below pertains to CDH 4.x only.... more

Webpage View Page

December 14, 2012

Excerpt: It’s been an exciting month and a half since the launch of the Cloudera Impala (the new open so... more

Webpage View Page

December 14, 2012

Excerpt: This is the first post in series that will get you going on how to write, compile, and run a simp... more

Webpage View Page

Cloudera Speakers at ApacheCon NA 2013

Justin Kestelyn (@kestelyn)

December 13, 2012

Excerpt: Our hearty congratulations to the Cloudera engineers who have been accepted as... more

Webpage View Page

December 11, 2012

Excerpt: At Cloudera, we put great pride into drinking our own champagne. That pride extends to our suppor... more

Webpage View Page

December 7, 2012

Excerpt: Hue is a web interface for... more

Webpage View Page

December 6, 2012

Excerpt: We are very pleased to introduce new, CDH4.1-aligned versions of the... more

Webpage View Page

December 5, 2012

Excerpt: With the... more

Webpage View Page

December 4, 2012

Excerpt: I am pleased to announce the release of Cloudera Impala Beta (version 0.3) and Cloudera Manager 4... more

Webpage View Page

November 28, 2012

Excerpt: AssignmentManager is a module in the Apache HBase... more

Webpage View Page

November 28, 2012

Webpage View Page

November 27, 2012

Excerpt: The following post was... more

Webpage View Page

This Month in Data Science

Justin Kestelyn (@kestelyn)

November 27, 2012

Excerpt: Data science has been a ubiquitous topic of conversation in the IT and business worlds across the... more

Webpage View Page

November 26, 2012

Excerpt: The following is a guest post from Nils Kübler, the creator of the Hannibal project. He is s... more

Webpage View Page

November 20, 2012

Excerpt: The following is a re-post from... more

Webpage View Page

The "Ask Bigger Questions" Contest!

Justin Kestelyn (@kestelyn)

November 19, 2012

Excerpt: Have you helped your company ask bigger questions? Our mission at Cloudera University is to equip... more

Webpage View Page

November 19, 2012

Excerpt: Apache ZooKeeper release 3.4.5 is now available. This... more

Webpage View Page

November 14, 2012

Excerpt: Since the... more

Webpage View Page

November 13, 2012

Excerpt: The following is a re-post from Bob Gourley of ... more

Webpage View Page

November 13, 2012

Excerpt: I am pleased to announce the release of Cloudera Impala Beta (version 0.2) and Cloudera Manager 4... more

Webpage View Page

November 13, 2012

Excerpt: This is the third article in a series about analyzing Twitter data using some of the components o... more

Webpage View Page

November 7, 2012

Excerpt: (The following is a... more

Webpage View Page

November 6, 2012

Excerpt: [Updated Nov. 26, 2012: Sorry, this event has reached capacity and is now closed.]... more

Webpage View Page

November 5, 2012

Excerpt: The 2012 Strata + Hadoop World conference was w... more

Webpage View Page

November 1, 2012

Webpage View Page

October 31, 2012

Excerpt: Last week at Strata + Hadoop World 2... more

Webpage View Page

October 31, 2012

Excerpt: A few weeks back, Cloudera announced CDH 4.1, the latest update release to Cloudera's Distributio... more

Webpage View Page

October 24, 2012

Excerpt: After a long period of intense engineering effort and user feedback, we are very pleased, and pro... more

Webpage View Page

October 24, 2012

Excerpt: Today we’re proud to announce a new addition to the Apache Hadoop ecosystem:... more

Webpage View Page

MR2 and YARN Briefly Explained

Justin Kestelyn (@kestelyn)

October 24, 2012

Excerpt: With CDH4 onward, the Apache Hadoop component introduced two new terms for Hadoop users to wonder... more

Webpage View Page

Meet the Engineer: Todd Lipcon

Justin Kestelyn (@kestelyn)

October 24, 2012

Webpage View Page

October 21, 2012

Excerpt: Cloudera is co-presenting the sold-out... more

Webpage View Page

October 21, 2012

Excerpt: This is a guest post by Oliver Guinan, VP Ground Software, at Skybox Imaging. Oliver is a 15-... more

Webpage View Page

October 21, 2012

Excerpt: Earlier this month the Apache Hadoop PMC released... more

Webpage View Page

What’s New in CDH4.1 Hue

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: Hue is a Web-based interface that makes it easier t... more

Webpage View Page

What’s New in CDH4.1 Pig

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: Apache Pig is a platform for analyzing large data sets that... more

Webpage View Page

October 21, 2012

Excerpt: Axemblr, purveyors of a cloud-agnostic MapReduce Web Service, h... more

Webpage View Page

October 21, 2012

Excerpt: This is the second article in a series about analyzing Twitter data using some of the components... more

Webpage View Page

HBase at ApacheCon Europe 2012

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: Apache HBase will have a notable profile at ApacheCon Europe... more

Webpage View Page

Meet the Engineer: Todd Lipcon

Justin Kestelyn (@kestelyn)

October 21, 2012

Webpage View Page

New Additions to the Apache HBase Team

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: StumbleUpon (SU) and Cloudera have signed a technology collaboration agreement. Cloudera will sup... more

Webpage View Page

October 21, 2012

Excerpt: Note (added July 8, 2013): The information below is deprecated; we suggest that you refer... more

Webpage View Page

October 21, 2012

Excerpt: Our video animation factory has been busy lately. The embedded player below contains our two late... more

Webpage View Page

October 21, 2012

Excerpt: We at Cloudera are tremendously excited by the power of data to effect large-scale change in the... more

Webpage View Page

October 21, 2012

Excerpt: Metrics are collections of information about Hadoop daemons, events and measurements; for example... more

Webpage View Page

MR2 and YARN Briefly Explained

Justin Kestelyn (@kestelyn)

October 21, 2012

Excerpt: With CDH4 onward, the Apache Hadoop component introduced two new terms for Hadoop users to wonder... more

Webpage View Page

Applying Parallel Prediction to Big Data

Justin Kestelyn (@kestelyn)

October 5, 2012

Excerpt: This guest post is provided by Dan McClary, Principal Product Manager for Big Data and H... more

Webpage View Page

CDH4.1 Now Released!

Charles Zedlewski

October 1, 2012

Excerpt: Update time!  As a reminder, Cloudera releases major versions of CDH, our 100% open source distr... more

Webpage View Page

September 28, 2012

Excerpt: For those of you new to it, the Duke's Choice Awards... more

Webpage View Page

September 27, 2012

Excerpt: The post below was originally published via... more

Webpage View Page

September 25, 2012

Excerpt: With the default Apache HBase configuration, everyone is a... more

Webpage View Page

September 24, 2012

Excerpt: Apache ZooKeeper release 3.4.4 is now... more

Webpage View Page

Meet the Engineer: Jon Natkins

Justin Kestelyn (@kestelyn)

September 21, 2012

Webpage View Page

September 19, 2012

Excerpt: Social media has gained immense popularity with marketing teams, and Twitter is an effective tool... more

Webpage View Page

September 14, 2012

Excerpt: This guest post comes to us courtesy of Gwen Shapira (@gwenshap), a database consultant for... more

Webpage View Page

September 11, 2012

Excerpt: What's to love about Cloudera Ent... more

Webpage View Page

September 10, 2012

Excerpt: API access was a new feature introduced in Cloudera Manager 4.0 (download free edition... more

Webpage View Page

Meet the Engineer: Eric Sammer

Justin Kestelyn (@kestelyn)

September 7, 2012

Webpage View Page

September 5, 2012

Excerpt: Organizations in diverse industries have adopted Apache Hadoop-based systems for large-scale data... more

Webpage View Page

The Action on "HBase in Action"

Justin Kestelyn (@kestelyn)

September 4, 2012

Webpage View Page

August 30, 2012

Excerpt: Learn how to configure a basic Maven project that will be able to build applications agai... more

Webpage View Page

August 27, 2012

Excerpt: Today ZDNet has very helpfully published a... more

Webpage View Page

Meet the Engineer: Aaron T. Myers

Justin Kestelyn (@kestelyn)

August 23, 2012

Webpage View Page

August 21, 2012

Excerpt: Cloudera Manager 4.0.4 and Cloudera Manager 3.7.8 are now available! These are enhancement releas... more

Webpage View Page

August 21, 2012

Excerpt: The following is a guest post kindly offered by Adam Kawa, a 26-year old Hadoop developer fro... more

Webpage View Page

August 20, 2012

Excerpt: In June 2012, Eli Collins (@elicollins), from Cloudera's Platforms team, led a session at... more

Webpage View Page

August 16, 2012

Excerpt: This is the second blogpost about Apache HBase replication. The ... more

Webpage View Page

August 15, 2012

Excerpt: Hello World: This is my first post as the new guy facilitating and coordinating developer communi... more

Webpage View Page

August 13, 2012

Excerpt: We are happy to announce the general availability of CDH3 update 5. This update is a maintenance... more

Webpage View Page

August 13, 2012

Excerpt: This post was contributed by Bob Gourley, editor,... more

Webpage View Page

August 7, 2012

Excerpt: HttpFS is an HTTP gateway/proxy for Apache Hadoop FileSystem implementations. HttpFS comes with C... more

Webpage View Page

Column Statistics in Apache Hive

Shreepadma Venugopalan

August 3, 2012

Excerpt: Over the last couple of months the Hive team at Cloudera has been working hard to bring a bunch o... more

Webpage View Page

August 2, 2012

Excerpt: Apache ZooKeeper release 3.... more

Webpage View Page

August 2, 2012

Excerpt: Up to this point, we’ve described our reasons for using Hadoop and Hi... more

Webpage View Page

July 31, 2012

Excerpt: Introduction In this three-part series of posts, we will share our experiences tackling... more

Webpage View Page

July 30, 2012

Excerpt: Apache HBase Replication is a way of copying data from one HBase cluster to a different and possi... more

Webpage View Page

July 25, 2012

Excerpt: It’s not often the case that I have a chance to concur with my colleague E14 over at Hortonwork... more

Webpage View Page

July 19, 2012

Excerpt: We are pleased to announce the availability of Cloudera Manager 4.0.3. This is an enhancement rel... more

Webpage View Page

July 16, 2012

Excerpt: In ... more

Webpage View Page

July 12, 2012

Excerpt: In the recent blog post about the... more

Webpage View Page

July 11, 2012

Excerpt: At 5 pm PDT on June 30, a leap second was added to the Universal Coordinated Time (UTC). Within a... more

Webpage View Page

July 9, 2012

Excerpt: This is a guest re-post from Datameer's Director of Marketing, Rich Taylor. The original post... more

Webpage View Page

July 3, 2012

Excerpt: Apache Flume is a scalable, reliable, fault-tolerant, distributed system designed to collect, tra... more

Webpage View Page

July 2, 2012

Excerpt: Introduction Ever since Cloudera decided to contribute the code and resources for what... more

Webpage View Page

June 29, 2012

Excerpt: Introduction Apache HBase is the Hadoop open-source, distributed, versioned storage man... more

Webpage View Page

June 29, 2012

Excerpt: This blog was originally posted on the... more

Webpage View Page

June 26, 2012

Excerpt: This week, a team of researchers at Google will be presenting a paper describing a system they de... more

Webpage View Page

June 19, 2012

Excerpt: HBaseCon 2012 summation provided by Michael Stack, PMC Chair of the Apache HBase Project. HBa... more

Webpage View Page

June 18, 2012

Excerpt: Apache HBase is the Hadoop database, and is based on the Hadoop Distributed File... more

Webpage View Page

June 14, 2012

Excerpt: On Tuesday, June 12th The Churchill Club of Silicon Valley hosted a panel discussion on Hadoop's... more

Webpage View Page

June 11, 2012

Excerpt: Overview One of the major features of the upcoming Apache HBase 0.96 release is improve... more

Webpage View Page

June 5, 2012

Excerpt: I’m very pleased to... more

Webpage View Page

June 4, 2012

Excerpt: Hue 2.0.1 has just been... more

Webpage View Page

June 4, 2012

Excerpt: CopyTable is a simple Apache HBase utility that, unsurprisingly, can be used for copying individu... more

Webpage View Page

June 4, 2012

Excerpt: We are pleased to announce that Cloudera Manager 3.7.6 is now available! The most notable updates in... more

Webpage View Page

May 30, 2012

Excerpt: Warning: The procedure described below can cause data loss. Contact Cloudera Support befo... more

Webpage View Page

May 16, 2012

Excerpt: Apache HBase 0.94.0 has been released! This is the first major release since the January 22nd HBa... more

Webpage View Page

May 14, 2012

Excerpt: Today’s interview features Todd Lipcon, software engineer for Cloudera. Todd will be presenting... more

Webpage View Page

May 14, 2012

Excerpt: We're happy to announce the Beta release of Cloudera Manager 4.0.  This version of Clo... more

Webpage View Page

May 9, 2012

Excerpt: We are happy to officially announce the general availability of CDH3 update 4. This update consis... more

Webpage View Page

May 7, 2012

Excerpt: This was originally posted on the Hadoop Summit 2012... more

Webpage View Page

May 4, 2012

Excerpt: This past Monday marked the official release of Apache Hive 0.9.0. Users interested in taking t... more

Webpage View Page

May 3, 2012

Excerpt: This is a guest post by Assaf Yardeni, Head of R&D for Treato, an online social healthcar... more

Webpage View Page

May 1, 2012

Excerpt: This post was originally posted on the... more

Webpage View Page

April 25, 2012

Excerpt: HBaseCon 2012 is only a month away! The conference takes p... more

Webpage View Page

Introducing CDH4 Beta 2

Charles Zedlewski

April 24, 2012

Excerpt: I'm pleased to inform our users and customers that we have released the Cloudera's Distribution I... more

Webpage View Page

April 12, 2012

Excerpt: HBaseCon 2012 is nea... more

Webpage View Page

April 11, 2012

Excerpt: San Francisco seems to be having an unusually high number of... more

Webpage View Page

April 10, 2012

Excerpt: This blog was originally posted on the Apache Blog:... more

Webpage View Page

April 6, 2012

Excerpt: Cloudera will be hosting an Apache HBase... more

Webpage View Page

April 3, 2012

Excerpt: Apache Bigtop 0.3.0 (incubating) is now available. This is the first fully integrated, community-... more

Webpage View Page

April 2, 2012

Excerpt: This blog was originally posted on the Apache Blog: ... more

Webpage View Page

April 1, 2012

Excerpt: Introduction A few months ago, my colleague Charles Zedlewski wrote a... more

Webpage View Page

March 23, 2012

Excerpt: What's new? Apache HBase 0.92.1 is now available... more

Webpage View Page

March 21, 2012

Excerpt: Apache ZooKeeper release 3.... more

Webpage View Page

March 20, 2012

Excerpt: One of the more confusing topics in Hadoop is how authorization and authentication work in the sy... more

Webpage View Page

March 19, 2012

Excerpt: Apache HBase 0.90.6 is now available. It is a bug fix rele... more

Webpage View Page

March 14, 2012

Excerpt: Introduction Some of the configuration properties found in Apache Hadoop have a direct... more

Webpage View Page

March 8, 2012

Excerpt: We’re excited to host the first ever HB... more

Webpage View Page

March 7, 2012

Excerpt: Background Apache Hadoop consists of two primary components: H... more

Webpage View Page

March 5, 2012

Excerpt: Cloudera and Cisco jointly announced a reference architecture for running Cloudera's Distribution... more

Webpage View Page

March 2, 2012

Excerpt: Several weeks ago, I set about to demonstrate the ease with which... more

Webpage View Page

February 24, 2012

Excerpt: In... more

Webpage View Page

February 14, 2012

Excerpt: Apache ZooKeeper release 3.4.3 is now available. This is a bug fix release covering 18  issues, one of whi... more

Webpage View Page

February 14, 2012

Excerpt: Service and Configuration Management (Part I & II) We’ve recently recorded a series of demo videos int... more

Webpage View Page

Introducing CDH4

Charles Zedlewski

February 13, 2012

Excerpt: I’m pleased to inform our users and customers that Cloudera has released its 4th version of Cloudera’s... more

Webpage View Page

February 7, 2012

Excerpt: Earlier today, Cloudera proudly released the Cloudera Connector for Tableau. The availability of this connect... more

Webpage View Page

January 30, 2012

Excerpt: Keeping with our release policy for Cloudera’s Distribution Including Apache Hadoop (CDH) I’m plea... more

Webpage View Page

January 25, 2012

Excerpt: More than 150 people attended the San Francisco Bay Area HBase User Group meetup last Thursday, January 19th,... more

Webpage View Page

January 25, 2012

Excerpt: When most people first hear about data science, it’s usually in the context of how prominent web compani... more

Webpage View Page

January 24, 2012

Excerpt: Today the Apache HBase community has proudly released Apache HBase 0.92.0, a major new version of the scalable... more

Webpage View Page

January 18, 2012

Excerpt: Last November in New York City, Hadoop World, the largest conference of Apache Hadoop practitioners, developer... more

Webpage View Page

January 13, 2012

Excerpt: This blog was originally posted on the Apache Blog: https://blogs.apache.org/sqoop/entry/apache_sqoop_highlig... more

Webpage View Page

January 12, 2012

Excerpt: If you’re like a myriad of other systems administrators out there, you may be running a production Hadoo... more

Webpage View Page

January 11, 2012

Excerpt: Bala Venkatrao is the Director of Product Management at Cloudera . As many of you know, we recently launc... more

Webpage View Page

January 10, 2012

Excerpt: Cloudera users gain more choice, tighter Oracle integration. Cloudera partners gain increased validation of th... more

Webpage View Page

January 9, 2012

Excerpt: Great news! The InfoWorld Tech Center has chosen Apache Hadoop for a 2012 Technology of the Year Award . Judg... more

Webpage View Page

January 9, 2012

Excerpt: Great news! The InfoWorld Tech Center has chosen Apache Hadoop for a... more

Webpage View Page

Hadoop in 2011

Rob Weltman

January 9, 2012

Excerpt: 2011 was a breakthrough year for Apache Hadoop as many more mainstream organizations large and small turned to... more

Webpage View Page

January 9, 2012

Excerpt: 2011 was a breakthrough year for Apache Hadoop as many more mainstream organizations large and sm... more

Webpage View Page

January 8, 2012

Excerpt: Some users & customers have asked about the most recent release of Apache Hadoop, v1.0: what’s in it,... more

Webpage View Page

January 6, 2012

Excerpt: This was my summer internship project at Cloudera, and I’m very thankful for the level of support and me... more

Webpage View Page

January 6, 2012

Excerpt: This was my summer internship project at Cloudera, and I'm very thankful for the level of sup... more

Webpage View Page

January 5, 2012

Excerpt: Apache Sqoop (incubating) provides an efficient approach for transferring big data between Hadoop related sys... more

Webpage View Page

January 3, 2012

Excerpt: Part 1 of this post covered how to convert and store email messages for archival purposes using Apache Hadoop... more

Webpage View Page

January 3, 2012

Excerpt: Part... more

Webpage View Page

January 2, 2012

Excerpt: This blog was originally posted on the Apache Blog . Apache Sqoop recently celebrates its first incubator... more

Webpage View Page

December 30, 2011

Excerpt: Apache ZooKeeper release 3.4.2 is now available. This is a bug fix release covering 2 issues, one of w... more

Webpage View Page

December 28, 2011

Excerpt: Apache HBase 0.90.5 is now available.  This release of the scalable distributed data store ins... more

Webpage View Page

December 28, 2011

Excerpt: Apache HBase 0.90.5 is now available.  This is release of the scalable distributed data store... more

Webpage View Page

December 28, 2011

Excerpt: This is a guest post contributed by Loren Siebert. Loren is a San Francisco entrepreneur and software develope... more

Webpage View Page

December 28, 2011

Excerpt: This is a guest post contributed by Loren Siebert. Loren is a San Francisco entrepreneur and... more

Webpage View Page

December 27, 2011

Excerpt: Apache Whirr release 0.7.0 is now available. It includes changes covering over  50 issues , four... more

Webpage View Page

December 27, 2011

Excerpt: Apache Whirr release... more

Webpage View Page

December 22, 2011

Excerpt: This is a guest post from RichRelevance Principal Architect and Apache Avro PMC Chair Scott Carey. In Early... more

Webpage View Page

December 21, 2011

Excerpt: This blog was originally posted on the Apache Blog: https://blogs.apache.org/flume/entry/apache_flume_hackat... more

Webpage View Page

December 21, 2011

Excerpt: This blog was originally posted on the Apache Blog:... more

Webpage View Page

December 20, 2011

Excerpt: David joined us as part of our intern program , and built the prototype for the distributed log search functi... more

Webpage View Page

December 19, 2011

Excerpt: Apache ZooKeeper release 3.4.1 is now available: this is a fix release covering 7 issues, 2 of which w... more

Webpage View Page

December 13, 2011

Excerpt: Aparna Ramani is the Director of Engineering for Cloudera Enterprise. Cloudera Manager 3.7, a major new ver... more

Webpage View Page

December 9, 2011

Excerpt: This blog was originally posted on the Apache Blog: https://blogs.apache.org/flume/entry/flume_ng_architectur... more

Webpage View Page

December 9, 2011

Excerpt: This guide is intended to be an introduction to Crunch. Introduction Crunch is used for processing data. C... more

Webpage View Page

December 6, 2011

Excerpt: This guest blog post is from Alex Loddengaard , creator of FoneDoktor , an Android app that monitors phone u... more

Webpage View Page

FoneDoktor, A WibiData Application

Jon Zuanich (@jonzuanich)

December 6, 2011

Excerpt: This guest blog post is from Alex Loddengaard, crea... more

Webpage View Page

December 2, 2011

Excerpt: San Francisco, Salesforce.com HQ - Recently there was an Apache HBase Pow-wow where project contributors gath... more

Webpage View Page

November 30, 2011

Excerpt: The amount of information we are exposed to on a daily basis is far outstripping our ability to consume it, le... more

Webpage View Page

November 29, 2011

Excerpt: Apache ZooKeeper release 3.3.4 is now available: this is a fix release covering 22 issues , 9 of w... more

Webpage View Page

November 23, 2011

Excerpt: Apache ZooKeeper release 3.4.0 is now available: it includes changes covering over  150 issues , 27 of... more

Webpage View Page

November 23, 2011

Excerpt: This blog was originally posted on the Apache Blog:   https://blogs.apache.org/sqoop/entry/inaugural_sqoo... more

Webpage View Page

November 17, 2011

Excerpt: The Apache Hive team is hard at work putting the finishing touches on the 0.8.0 release. While the release h... more

Webpage View Page

November 16, 2011

Excerpt: Last month at the Web 2.0 Summit in San Francisco, Cloudera CEO Mike Olson  presented some work the Clou... more

Webpage View Page

November 16, 2011

Excerpt: The third annual  Hadoop World conference has come and gone. The two days of conference keynotes and ses... more

Webpage View Page

November 16, 2011

Excerpt: A number of architectural changes have been added to Hadoop MapReduce. The new MapReduce system is called MR2... more

Webpage View Page

November 16, 2011

Excerpt: A number of architectural changes have been added to Hadoop MapReduce. The new MapReduce system i... more

Webpage View Page

November 15, 2011

Excerpt: The Apache Hadoop PMC has voted to release Apache Hadoop 0.23.0 . This release is significant since it is the... more

Webpage View Page

November 3, 2011

Excerpt: Cloudera believes that the flexibility and power of Apache Mahout (http://mahout.apache.org/) in conjunct... more

Webpage View Page

October 27, 2011

Excerpt: Several meetups for Apache Hadoop and Hadoop-related projects are scheduled for the evenings surrounding Ha... more

Webpage View Page

CDH3 update 2 is released

Charles Zedlewski

October 21, 2011

Excerpt: Continuing with our practice from Cloudera’s Distribution Including Apache Hadoop v2 (CDH2), our goal is... more

Webpage View Page

October 19, 2011

Excerpt: Check out the Hadoop World 2011 conference agenda! Find sessions of interest and begin planning your Hadoop... more

Webpage View Page

October 13, 2011

Excerpt: This post was contributed by Bob Gourley, editor, CTOvision.com . The missions and data of gover... more

Webpage View Page

October 12, 2011

Excerpt: The Development track at Hadoop World is a technical deep dive dedicated to discussion about Apache Hadoop and... more

Webpage View Page

October 10, 2011

Excerpt: As a data scientist at Cloudera, I work with customers across a wide range of industries that use Hadoop to so... more

Webpage View Page

October 10, 2011

Excerpt: As a data scientist at Cloudera, I work with customers across a wide range of industries that use... more

Webpage View Page

October 6, 2011

Excerpt: This post provides a high-level overview of Apache Sqoop (incubating). It discusses the general problem addres... more

Webpage View Page

October 4, 2011

Excerpt: The Enterprise Architecture track at Hadoop World 2011 will provide insight into how Hadoop is powering tod... more

Webpage View Page

October 3, 2011

Excerpt: Owen O’Malley recently collected and analyzed information in the Apache Hadoop project commit logs and... more

Webpage View Page

October 3, 2011

Excerpt: This post was written by Daniel Jackoway following his internship at Cloudera during the summer of 2011. Wh... more

Webpage View Page

September 29, 2011

Excerpt: Business Solutions is a Hadoop World 2011 track geared towards business strategists and decision makers. Sess... more

Webpage View Page

September 28, 2011

Excerpt: This post will explore a specific use case for Apache Hadoop, one that is not commonly recognized, but is gain... more

Webpage View Page

September 28, 2011

Excerpt: This post will explore a specific use case for Apache Hadoop, one that is not commonly recognized... more

Webpage View Page

September 27, 2011

Excerpt: The Hadoop World train is approaching the station! Remember to mark November 8 th and 9 th in your calendars... more

Webpage View Page

September 20, 2011

Excerpt: BusinessWeek recently published a fascinating article on Hadoop and Big Data, interviewing several Cloudera... more

Webpage View Page

September 20, 2011

Excerpt: BusinessWeek recently published a fascinating... more

Webpage View Page

September 20, 2011

Excerpt: Unstructured data is the fastest growing type of data generated today. The growth rate of text, documents, ima... more

Webpage View Page

September 15, 2011

Excerpt: Snappy is a compression library developed at Google, and, like many technologies that come from Google, Snappy... more

Webpage View Page

September 13, 2011

Excerpt: Make the most of your week in New York City by combining the Hadoop World 2011 conference with training class... more

Webpage View Page

September 13, 2011

Excerpt: Make the most of your week in New York City by combining the Hadoop World 2011 conference with... more

Webpage View Page

September 7, 2011

Excerpt: Attendees of Hadoop World will receive a free copy of either  Hadoop, The Definitive Guide (2nd edition)... more

Webpage View Page

August 30, 2011

Excerpt: The 3rd annual Hadoop World conference takes place on November 8th and 9th in New York City. Cloudera invites... more

Webpage View Page

August 10, 2011

Excerpt: Ari Rabkin is a summer intern at Cloudera, working with the engineering team to help make Hadoop more usable a... more

Webpage View Page

CDH3 Update 1 Released

Charles Zedlewski

July 22, 2011

Excerpt: Announcing an update to CDH3.... more

Webpage View Page

July 20, 2011

Excerpt: What is Hoop? Hoop provides access to all Hadoop Distributed File System (HDFS) operations (read and write)... more

Webpage View Page

July 13, 2011

Excerpt: This post was contributed by Michael Cafarella, an assistant professor of computer science at the University o... more

Webpage View Page

July 12, 2011

Excerpt: Pero works on research and development in new technologies for online advertising at Aol Advertising R&D... more

Webpage View Page

July 11, 2011

Excerpt: Philip Zeyliger is a software engineer at Cloudera and started the SCM project. Two weeks ago, at Hadoop S... more

Webpage View Page

July 5, 2011

Excerpt: The ecosystem around Apache Hadoop has grown at a tremendous rate. Folks now can use many different pieces of... more

Webpage View Page

July 5, 2011

Excerpt: Phil Langdale is a software engineer at Cloudera and the technical lead for Cloudera’s SCM Express produc... more

Webpage View Page

July 5, 2011

Excerpt: Drew O’Brien is a product marketing manager at Cloudera We’re excited to share the news about the... more

Webpage View Page

June 28, 2011

Excerpt: This is a guest repost from Shopzilla’s Tech Blog written by Andrew Look, a Software Engineer at Shop... more

Webpage View Page

June 24, 2011

Excerpt: Ed Albanese leads business development for Cloudera. He is responsible for identifying new markets, revenue op... more

Webpage View Page

June 24, 2011

Excerpt: Bala Venkatrao is the director of product management at Cloudera . I had the pleasure of attending Enzee U... more

Webpage View Page

June 22, 2011

Excerpt: This post was contributed by Jennie Cochran-Chinn and Joe Crobak. They are part of the team building out Adco... more

Webpage View Page

June 22, 2011

Excerpt: This post was contributed by Jennie Cochran-Chinn and Joe Crobak. They are part of the team b... more

Webpage View Page

June 21, 2011

Excerpt: This post was contributed by The Global Biodiversity Information Facility development team. The Global Bio... more

Webpage View Page

June 21, 2011

Excerpt:   This post was contributed by The Global Biodiversity Information Facility deve... more

Webpage View Page

June 2, 2011

Excerpt: The first task is to ensure that your system is up-to-date. This procedure has been tested on the following... more

Webpage View Page

May 25, 2011

Excerpt: Take advantage of the opportunity to become a Cloudera Certified Developer or Administrator for Apache Hadoop... more

Webpage View Page

May 25, 2011

Excerpt: Take advantage of the opportunity to become a Cloudera Certified Developer or Administrator for A... more

Webpage View Page

May 15, 2011

Excerpt: Background Klout’s goal is to be the standard for influence. The advent of social media has created... more

Webpage View Page

May 15, 2011

Excerpt: Background Klout's goal is to be the... more

Webpage View Page

May 13, 2011

Excerpt: This is a guest repost from the DataXu blog. Click here to view the original post. I recently evaluated... more

Webpage View Page

May 11, 2011

Excerpt: Cloudera is offering several training courses for Apache Hadoop over the dates surrounding Hadoop Summit. Th... more

Webpage View Page

April 28, 2011

Excerpt: This is a guest post from Mike Segel, an attendee of Chicago Data Summit. Earlier this week, Cloudera hoste... more

Webpage View Page

April 25, 2011

Excerpt: Do you know the answer? Many prominent projects (e.g. Hive, Pig) were sub-projects of Hadoop before becomi... more

Webpage View Page

April 20, 2011

Excerpt: I recently gave a talk at the LA Hadoop User Group about HBase Do’s and Don’ts . The audience was... more

Webpage View Page

April 20, 2011

Excerpt: I recently gave a talk at the LA Hadoop U... more

Webpage View Page

CDH3 goes GA

Mike Olson

April 12, 2011

Excerpt: I am very pleased to announce the general availability of Cloudera’s Distribution including Apache Hadoo... more

Webpage View Page

April 11, 2011

Excerpt: Simple Moving Average, Secondary Sort, and MapReduce (Part 3) by Josh Patterson... more

Webpage View Page

April 11, 2011

Excerpt: This is the final piece to a three part blog series. If you would like to view the previous p... more

Webpage View Page

April 5, 2011

Excerpt: Adopting Apache Hadoop in the Federal Government by Jon Zuanich April 05... more

Webpage View Page

MapIncrease

ibmwatson

April 1, 2011

Excerpt: Puny humans. SSL and Wordpress authorization will keep me out of your blog question mark. I do not think so.... more

Webpage View Page

March 30, 2011

Excerpt: London Apache Hadoop User Group Meeting Summarized by Jon Zuanich March... more

Webpage View Page

March 29, 2011

Excerpt: If you find yourself in the Chicago area later this month, please join us at the Chicago Data Summit on Apri... more

Webpage View Page

We messed up.

Mike Olson

March 25, 2011

Excerpt: We messed up. by Mike Olson March 25, 2011 no comments... more

Webpage View Page

March 23, 2011

Excerpt: Rapleaf Uses Hadoop to Efficiently Scale with Terabytes of Data by Jon Zuanich... more

Webpage View Page

March 16, 2011

Excerpt: Simple Moving Average, Secondary Sort, and MapReduce (Part 2) by Josh Patterson... more

Webpage View Page

March 14, 2011

Excerpt: Simple Moving Average, Secondary Sort, and MapReduce (Part 1) by Josh Patterson... more

Webpage View Page

March 7, 2011

Excerpt: This is the third and final post in a series detailing a recent improvement in Apache HBase that helps to redu... more

Webpage View Page

March 7, 2011

Excerpt: This is the third and final post in a series detailing a recent improvement in Apache HBase that... more

Webpage View Page

March 1, 2011

Excerpt: Flume Community Office Hours @ Cloudera HQ, 2/28/2011 by Jonathan Hsieh... more

Webpage View Page

February 28, 2011

Excerpt: This is the second post in a series detailing a recent improvement in Apache HBase that helps to reduce the fr... more

Webpage View Page

February 25, 2011

Excerpt: Supported Operating Systems in CDH3 by Eli Collins February 25, 2011... more

Webpage View Page

February 25, 2011

Excerpt: While Cloudera's Distribution including Apache Hadoop (CDH) operating system support is... more

Webpage View Page

February 25, 2011

Excerpt: Gratuitous Hadoop: Stress Testing on the Cheap with Hadoop Streaming and EC2 by Jo... more

Webpage View Page

February 24, 2011

Excerpt: Today, rather than discussing new projects or use cases built on top of CDH, I'd like to switch gears a bit an... more

Webpage View Page

February 24, 2011

Excerpt: Today, rather than discussing new projects or use cases built on top of CDH, I'd like to switch g... more

Webpage View Page

February 22, 2011

Excerpt: CDH3 Beta 4 Now Available by Todd Lipcon February 22, 2011 1 c... more

Webpage View Page

February 17, 2011

Excerpt: Log Event Processing with HBase by Jon Zuanich February 17, 2011... more

Webpage View Page

February 17, 2011

Excerpt: This post was authored by Dmitry Chechik, a software engineer at TellApart, the leading Custo... more

Webpage View Page

February 16, 2011

Excerpt: An emerging data management architectural pattern behind interactive web applications... more

Webpage View Page

February 16, 2011

Excerpt: The user-data connection is driving NoSQL database-Hadoop pairing... more

Webpage View Page

February 15, 2011

Excerpt: Strategies for Exploiting Large-scale Data in the Federal Government by Jon Zuanic... more

Webpage View Page

February 14, 2011

Excerpt: Cloudera in The Cube with Silicon Angle TV at Strata Conference 2011 by Jon Zuanic... more

Webpage View Page

February 11, 2011

Excerpt: Wordnik Bypasses Processing Bottleneck with Hadoop by Jon Zuanich Februa... more

Webpage View Page

February 11, 2011

Excerpt: This post is courtesy of Kumanan Rajamanikkam, Lead Engineer at... more

Webpage View Page

February 10, 2011

Excerpt: Hadoop Availability by Eli Collins February 10, 2011 1 comment... more

Webpage View Page

February 10, 2011

Excerpt: A common question on the Apache Hadoop mail... more

Webpage View Page

February 7, 2011

Excerpt: Distributed Flume Setup With an S3 Sink by Jonathan Hsieh February 07, 2... more

Webpage View Page

February 3, 2011

Excerpt: Make your Hadoop voice heard! by Jon Zuanich February 03, 2011... more

Webpage View Page

February 3, 2011

Excerpt: Apache Hadoop is increasingly being adopted for storage and processing of large-scale complex dat... more

Webpage View Page

February 2, 2011

Excerpt: Upcoming Apache Hadoop Training Sessions by Jon Zuanich February 02, 201... more

Webpage View Page

February 2, 2011

Excerpt: Some News Related to the Apache Hadoop Project by Charles Zedlewski Febr... more

Webpage View Page

January 28, 2011

Excerpt: CDH2 Update 3 Now Available by Eli Collins January 28, 2011 1... more

Webpage View Page

January 26, 2011

Excerpt: Lessons Learned from Cloudera’s Hadoop Developer Training Course by Jon Zuan... more

Webpage View Page

January 21, 2011

Excerpt: Introducing Alfredo, Kerberos HTTP SPNEGO for Java by Alejandro Abdelnur... more

Webpage View Page

January 21, 2011

Excerpt: What is Kerberos & SPNEGO?... more

Webpage View Page

January 19, 2011

Excerpt: We blogged about 104 different topics in 2010 and we recently decided to take a look back and see what folks w... more

Webpage View Page

January 17, 2011

Excerpt: Hadoop I/O: Sequence, Map, Set, Array, BloomMap Files by Jon Zuanich Jan... more

Webpage View Page

January 11, 2011

Excerpt: How to Include Third-Party Libraries in Your Map-Reduce Job by Alex Kozlov... more

Webpage View Page

January 11, 2011

Excerpt: "My library is in the classpath but I still get a Class Not Found exception in a MapReduce job" -... more

Webpage View Page

January 10, 2011

Excerpt: Setting up CDH3 Hadoop on my new Macbook Pro by Jon Zuanich January 10,... more

Webpage View Page

January 7, 2011

Excerpt: Post written by Cloudera Software Engineer Aaron T. Myers. Apache Hadoop has had methods of doing user aut... more

Webpage View Page

Configuring Security Features in CDH3

Jon Zuanich (@jonzuanich)

January 7, 2011

Excerpt: Post written by Cloudera Software Engineer Aaron T. Myers. Apac... more

Webpage View Page

January 6, 2011

Excerpt: 2010 Cloudera Apache Hadoop Webinars by Jon Zuanich January 06, 2011... more

Webpage View Page

January 5, 2011

Excerpt: Map-Reduce With Ruby Using Apache Hadoop by Jon Zuanich January 05, 2011... more

Webpage View Page

December 21, 2010

Excerpt: New Features in Apache Pig 0.8 by John Kreisa December 21, 2010... more

Webpage View Page

December 15, 2010

Excerpt: A profile of Apache Hadoop MapReduce computing efficiency (continued) by Jon Zuani... more

Webpage View Page

December 14, 2010

Excerpt: A profile of Apache Hadoop MapReduce computing efficiency by Jon Zuanich... more

Webpage View Page

December 7, 2010

Excerpt: Cloudera and Pentaho team up to simplify data management and business intelligence... more

Webpage View Page

December 6, 2010

Excerpt: Lessons learned putting Hadoop into production by Jon Zuanich December 0... more

Webpage View Page

December 2, 2010

Excerpt: Hadoop World 2010 Tweet Analysis by Jon Zuanich December 02, 2010... more

Webpage View Page

November 29, 2010

Excerpt: Hadoop Log Location and Retention by Lars George November 29, 2010... more

Webpage View Page

November 24, 2010

Excerpt: Hadoop training coming to new cities in 2011 by Jon Zuanich November 24,... more

Webpage View Page

November 24, 2010

Excerpt: Due to expanding interest and demand for Apache Hadoop knowledge and skills across the mid-west a... more

Webpage View Page

November 18, 2010

Excerpt: Do the Schimmy: Efficient Large-Scale Graph Analysis with Hadoop, Part 2 by Jon Zu... more

Webpage View Page

November 18, 2010

Excerpt: Continued Guest Post from Michael Schatz and... more

Webpage View Page

November 17, 2010

Excerpt: Hadoop and HBase at RIPE NCC by Todd Lipcon November 17, 2010... more

Webpage View Page

November 15, 2010

Excerpt: Do the Schimmy: Efficient Large-Scale Graph Analysis with Hadoop by Jon Zuanich... more

Webpage View Page

November 8, 2010

Excerpt: Integrating Hadoop in your Existing DW and BI Environment by Gretchen Malay... more

Webpage View Page

November 8, 2010

Excerpt: Organizations are looking for a cost-effective way to deal with data that are now arriving in an... more

Webpage View Page

November 4, 2010

Excerpt: Better Workflow Management in CDH with Oozie 2 by Alejandro Abdelnur Nov... more

Webpage View Page

November 2, 2010

Excerpt: Tackling Large Scale Data in Government by Jon Zuanich November 02, 2010... more

Webpage View Page

November 1, 2010

Excerpt: Cloudera Fun & Frightful Halloween Festivities by Jon Zuanich Novem... more

Webpage View Page

October 26, 2010

Excerpt: Hadoop Lab at JavaOne by Jon Zuanich October 26, 2010 no comme... more

Webpage View Page

October 26, 2010

Excerpt: Guest post by Daniel Templeton, Product Manager at Oracl... more

Webpage View Page

October 16, 2010

Excerpt: Hadoop World 2010: An Unqualified Success by Jon Zuanich October 16, 201... more

Webpage View Page

October 12, 2010

Excerpt: CDH3 beta 3 now available by Todd Lipcon October 12, 2010 no c... more

Webpage View Page

October 11, 2010

Excerpt: Hadoop: The Definitive Guide, Second Edition by Tom White October 11, 20... more

Webpage View Page

October 8, 2010

Excerpt: Afternoon Hadoop World — Possible Path Through Great Content by Jon Zuanich... more

Webpage View Page

October 6, 2010

Excerpt: One Possible Hadoop World Morning Path by Jon Zuanich October 06, 2010... more

Webpage View Page

September 30, 2010

Excerpt: Hadoop World: More is better! by Gretchen Malay September 30, 2010... more

Webpage View Page

September 27, 2010

Excerpt: Top 10 Reasons to Attend Hadoop World by Jon Zuanich September 27, 2010... more

Webpage View Page

September 23, 2010

Excerpt: Twitter Analytics Lead, Kevin Weil, and a Presenter at Hadoop World Interviewed by... more

Webpage View Page

September 22, 2010

Excerpt: More on Cloudera Enterprise by Charles Zedlewski September 22, 2010... more

Webpage View Page

September 21, 2010

Excerpt: What’s Going On Surrounding Hadoop World by Jon Zuanich September 2... more

Webpage View Page

September 20, 2010

Excerpt: What is in our Kitchen? by Chad Metcalf September 20, 2010 no... more

Webpage View Page

September 17, 2010

Excerpt: Flume is a flexible, scalable, and reliable system for collecting streaming data.   The  Flume User... more

Webpage View Page

September 16, 2010

Excerpt: HUE SDK Training – NYC by Jon Zuanich September 16, 2010... more

Webpage View Page

September 14, 2010

Excerpt: CDH2 Update 2 Now Available by Eli Collins September 14, 2010... more

Webpage View Page

September 14, 2010

Excerpt: Hadoop World Presentation Track Release by Jon Zuanich September 14, 201... more

Webpage View Page

September 10, 2010

Excerpt: A Summer Internship with Cloudera by Jon Zuanich September 10, 2010... more

Webpage View Page

September 9, 2010

Excerpt: New York Training Session for Managers Interested In Hadoop by Jon Zuanich... more

Webpage View Page

September 8, 2010

Excerpt: Flume community update: September 2010 by jon September 08, 2010... more

Webpage View Page

September 7, 2010

Excerpt: Purdue University’s Saptarshi Guha Interviewed Regarding Hadoop, R and Hadoop World... more

Webpage View Page

September 6, 2010

Excerpt: A Look Back at August Posts by Jon Zuanich September 06, 2010... more

Webpage View Page

September 3, 2010

Excerpt: Tracing with Avro by Jon Zuanich September 03, 2010 no comment... more

Webpage View Page

September 3, 2010

Excerpt: Written by Patrick Wendell, an amazing summer intern with Cloudera and an Avro Commit... more

Webpage View Page

September 2, 2010

Excerpt: Infochimp’s President, Philip Kromer, Interviewed Regarding Hadoop and Hadoop World... more

Webpage View Page

September 1, 2010

Excerpt: Register for Hadoop Training in New York and Get into Hadoop World for Free! by Jo... more

Webpage View Page

August 30, 2010

Excerpt: Hadoop World 2010: Speaker Highlights by Jon Zuanich August 30, 2010... more

Webpage View Page

August 26, 2010

Excerpt: What’s New in Apache Hadoop 0.21 by Tom White August 26, 2010... more

Webpage View Page

August 24, 2010

Excerpt: Learn about fraud and how to prevent it with Hadoop... more

Webpage View Page

August 24, 2010

Excerpt: Fraud has multiple meanings and the term can be easily abused.  The definition of fraud has unde... more

Webpage View Page

August 24, 2010

Excerpt: Hadoop Administrator Training Comes to London by Jon Zuanich August 24,... more

Webpage View Page

August 24, 2010

Excerpt: Cloudera’s... more

Webpage View Page

August 23, 2010

Excerpt: Improving Hotel Search: Hadoop @ Orbitz Worldwide by John Kreisa August... more

Webpage View Page

August 23, 2010

Excerpt: This post was contributed by Jonathan Seidman from... more

Webpage View Page

August 19, 2010

Excerpt: Hadoop Training surrounding Hadoop World: NYC.... more

Webpage View Page

August 17, 2010

Excerpt: Hadoop/HBase Capacity Planning by Alex Kozlov August 17, 2010... more

Webpage View Page

August 17, 2010

Excerpt: Apache Hadoop and Apache HBase are gaining popularity due to their flexibility and tremendous wor... more

Webpage View Page

August 12, 2010

Excerpt: It’s easy to get started with Hadoop administration because Linux system administration is a pretty well... more

Webpage View Page

CDH3b2 Release Recap

Jeff Hammerbacher

August 11, 2010

Excerpt: CDH3b2 Release Recap by Jeff Hammerbacher August 11, 2010 no comments... more

Webpage View Page

August 10, 2010

Excerpt: Cloudera’s Henry Robinson to speak at Hadoop Day in Seattle by Huw Edwards... more

Webpage View Page

August 9, 2010

Excerpt: Hadoop World: early-bird rate ends on August 11 by Huw Edwards August 09... more

Webpage View Page

August 3, 2010

Excerpt: Flume community update – the first 30 days! by phunt August 03, 2010 no c... more

Webpage View Page

Migrating to CDH

Eric Sammer

August 2, 2010

Excerpt: With the recent release of CDH3b2 , many users are more interested than ever to try out Cloudera’s Dist... more

Webpage View Page

July 28, 2010

Excerpt: How to Get a Job at Cloudera by Mike Olson July 28, 2010 no comments... more

Webpage View Page

July 28, 2010

Excerpt: Notes From the Hackathon at Cloudera by Jeff Bean July 28, 2010 no comments... more

Webpage View Page

July 28, 2010

Excerpt: I was positively blown away by the enthusiasm, creativity, and productivity exhibited by the part... more

Webpage View Page

July 28, 2010

Excerpt: Upcoming webinar: 10 Common Hadoop-able Problems by Huw Edwards July 28, 2010 n... more

Webpage View Page

July 28, 2010

Excerpt: Announcing Two New Training Classes from Cloudera: Introduction to HBase and Analyzing Data with Hive and Pig... more

Webpage View Page

July 22, 2010

Excerpt: What’s New in CDH3b2: Hive by Carl Steinbach July 22, 2010 no comments... more

Webpage View Page

July 22, 2010

Excerpt: CDH3 beta 2 includes Apache Hive 0.5.0, the latest v... more

Webpage View Page

July 20, 2010

Excerpt: Developing Applications for HUE by Aaron Newton July 20, 2010 1 comment... more

Webpage View Page

July 20, 2010

Excerpt: Yesterday's post gave an... more

Webpage View Page

July 19, 2010

Excerpt: What’s New in CDH3b2: HUE by bc July 19, 2010 no comments... more

Webpage View Page

July 19, 2010

Excerpt: The HUE (aka. Hadoop User Experience) project [... more

Webpage View Page

July 19, 2010

Excerpt: Rackspace’s OpenStack shows the way for public cloud vendors by Ed Albanese July 1... more

Webpage View Page

July 16, 2010

Excerpt: What’s New in CDH3b2: Sqoop by Aaron Kimball July 16, 2010 no comments... more

Webpage View Page

July 15, 2010

Excerpt: Hacking with Cloudera on CDH by Alex Loddengaard July 15, 2010 no comments... more

Webpage View Page

July 15, 2010

Excerpt: What’s New in CDH3b2: Oozie by Arvind Prabhakar July 15, 2010 no comments... more

Webpage View Page

July 14, 2010

Excerpt: What’s New in CDH3b2: Pig by Carl Steinbach July 14, 2010 no comments... more

Webpage View Page

July 14, 2010

Excerpt: CDH3 beta 2 includes Apache Pig 0.7.0, the latest and... more

Webpage View Page

July 13, 2010

Excerpt: As part of our series of announcements at the recent Hadoop Summit, Cloudera released two of its previously in... more

Webpage View Page

July 12, 2010

Excerpt: CDH3 beta 2 is the first to incorporate Apache ZooKeeper. ZooKeeper is a highly reliable and available coordin... more

Webpage View Page

July 9, 2010

Excerpt: What’s New in CDH3b2: HBase by Todd Lipcon July 09, 2010 no comments... more

Webpage View Page

July 9, 2010

Excerpt: Over the last two years, Cloudera has helped a great number of customers... more

Webpage View Page

July 8, 2010

Excerpt: What’s New in CDH3b2: Core Hadoop by Eli Collins July 08, 2010 no comment... more

Webpage View Page

July 7, 2010

Excerpt: More on Cloudera’s Distribution including Apache Hadoop 3 by Charles Zedlews... more

Webpage View Page

June 29, 2010

Excerpt: CDH3 and Cloudera Enterprise by Mike Olson June 29, 2010 1 com... more

Webpage View Page

June 23, 2010

Excerpt: Are your systems struggling to absorb ever-increasing amounts of data being generated daily? Are you mired in... more

Webpage View Page

June 22, 2010

Excerpt: Cloudera is once again hosting  Hadoop World which will take place in  New York City on  Octo... more

Webpage View Page

June 18, 2010

Excerpt: Will Cloudera be at OSCON this year? Of course, it’s only the premier event for OS technologies on the ma... more

Webpage View Page

June 11, 2010

Excerpt: Integrating Hive and HBase by carl June 11, 2010 no comments... more

Webpage View Page

June 11, 2010

Excerpt: This post was contributed by John Sichi... more

Webpage View Page

June 10, 2010

Excerpt: One word more… by Mike Olson June 10, 2010 no comments... more

Webpage View Page

A transition

Christophe Bisciglia

June 10, 2010

Excerpt: A transition by Christophe Bisciglia June 10, 2010 no comments... more

Webpage View Page

A transition

Christophe Bisciglia

June 10, 2010

Excerpt: For an entrepreneur, it's an incredibly fulfilling experience to start companies and watch them "... more

Webpage View Page

June 4, 2010

Excerpt: A report from the recent UK HUG from Klass Bosteels.... more

Webpage View Page

June 3, 2010

Excerpt: Considerations for Hadoop and BI (part 2 of 2) by Jeff Bean June 03, 2010 no co... more

Webpage View Page

June 3, 2010

Excerpt: Just today we heard another question about integrating Apache Hadoop with Business Intelligence t... more

Webpage View Page

June 1, 2010

Excerpt: The second Apache Hadoop HDFS and MapReduce contributors meeting was held last Friday, May 28 at ClouderaR... more

Webpage View Page

May 25, 2010

Excerpt: Here at Cloudera we have deep knowledge and experience working with Hadoop and related technologies to so... more

Webpage View Page

May 21, 2010

Excerpt: Considerations for Hadoop and BI (part 1 of 2) by Jeff Bean May 21, 2010 no com... more

Webpage View Page

May 21, 2010

Excerpt: We recently met with a customer at... more

Webpage View Page

May 21, 2010

Excerpt: CDH2 Update 1 Now Available by Eli Collins May 21, 2010 no comments... more

Webpage View Page

May 10, 2010

Excerpt: What to Do with Extra Space? by bc May 10, 2010 no comments... more

Webpage View Page

May 7, 2010

Excerpt: Highlights from the First Hadoop Contributors Meeting by Eli Collins May 07, 2010... more

Webpage View Page

May 7, 2010

Excerpt: While the vast majority of the Hadoop development discussion takes place on... more

Webpage View Page

April 30, 2010

Excerpt: Around the globe, more and more companies are turning to Hadoop to tackle data processing problems that don... more

Webpage View Page

April 26, 2010

Excerpt: CAP Confusion: Problems with ‘partition tolerance’ by Henry Robinson April... more

Webpage View Page

April 21, 2010

Excerpt: Get Hadoop Training from Cloudera at the Hadoop Summit by John Kreisa April 21, 2010... more

Webpage View Page

April 13, 2010

Excerpt: Cloudera Hadoop Training Spreads Worldwide by John Kreisa April 13, 2010 no com... more

Webpage View Page

April 12, 2010

Excerpt: Cloudera Has Moved! by John Kreisa April 12, 2010 1 comment... more

Webpage View Page

April 5, 2010

Excerpt: Scaling Social Science with Hadoop by Ed Albanese April 05, 2010 12 comments... more

Webpage View Page

April 5, 2010

Excerpt: This post was contributed by researcher Scott Golder, who... more

Webpage View Page

April 1, 2010

Excerpt: Pushing the Limits of Distributed Processing by omer April 01, 2010 no comments... more

Webpage View Page

March 30, 2010

Excerpt: Cloudera’s Support Team Shares Some Basic Hardware Recommendations by Alex Loddengaard... more

Webpage View Page

March 24, 2010

Excerpt: It’s official – Cloudera’s Distribution for Hadoop Version 2, which we often shorthand as C... more

Webpage View Page

March 24, 2010

Excerpt: It's official - Cloudera's Distribution for Hadoop Version 2, which we ofte... more

Webpage View Page

CDH2 is released

Chad Metcalf

March 24, 2010

Excerpt: We’re proud to announce that Cloudera’s Distribution for Hadoop Version 2 (CDH2) is officially re... more

Webpage View Page

March 22, 2010

Excerpt: How Raytheon BBN Technologies Researchers are Using Hadoop to Build a Scalable, Distributed Triple Store... more

Webpage View Page

March 18, 2010

Excerpt: HBase User Group #9: HBase and HDFS by Todd Lipcon March 18, 2010 no comments... more

Webpage View Page

March 16, 2010

Excerpt: Natural Language Processing with Hadoop and Python by Ed Albanese March 16, 2010... more

Webpage View Page

March 16, 2010

Excerpt: This blog was co-written by Nitin Madnani... more

Webpage View Page

March 10, 2010

Excerpt: Richard Hutton , CTO of nugg.ad , authored the following post about how and why his company uses Hadoop. n... more

Webpage View Page

March 10, 2010

Excerpt: Richard Hutton, CTO of... more

Webpage View Page

March 3, 2010

Excerpt: Trip Report: Utah Java User’s Group by Philip Zeyliger March 03, 2010 no... more

Webpage View Page

Avro 1.3.0

Matt Massie

March 1, 2010

Excerpt: Avro 1.3.0 by Matt Massie March 01, 2010 no comments Avro... more

Webpage View Page

March 1, 2010

Excerpt: Apache Avro was added the to... more

Webpage View Page

February 22, 2010

Excerpt: Cloudera’s Hadoop Training Programs Expand Internationally by Christophe Bisciglia... more

Webpage View Page

February 22, 2010

Excerpt: It's been over a year now since we started offering Hadoop training in the Bay Area, and since th... more

Webpage View Page

February 18, 2010

Excerpt: CDH2: “Testing” Heading Towards “Stable” by Chad Metcalf Februa... more

Webpage View Page

January 19, 2010

Excerpt: Cloudera speaks VMware vCloud API, too. by Mike Olson January 19, 2010 no comme... more

Webpage View Page

January 11, 2010

Excerpt: Hadoop World: Building Data Intensive Apps with Hadoop and EC2 by ed January 11, 2010... more

Webpage View Page

December 23, 2009

Excerpt: Hadoop World: Making Hadoop Easy on Amazon Web Services by Christophe Bisciglia Decembe... more

Webpage View Page

December 22, 2009

Excerpt: Hadoop World: Hadoop Applications at Yahoo! by Christophe Bisciglia December 22, 2009... more

Webpage View Page

December 17, 2009

Excerpt: 7 Tips for Improving MapReduce Performance by Todd Lipcon December 17, 2009 no... more

Webpage View Page

December 15, 2009

Excerpt: Observers: Making ZooKeeper Scale Even Further by Henry Robinson December 15, 2009... more

Webpage View Page

December 10, 2009

Excerpt: Hadoop World: Sqoop – Database Import for Hadoop by Christophe Bisciglia December... more

Webpage View Page

December 8, 2009

Excerpt: Hadoop World: Security and API Compatibility by Christophe Bisciglia December 08, 2009... more

Webpage View Page

December 8, 2009

Excerpt: Today's Hadoop World talk comes from Owen O'Malley and talks about some of the biggest challenges fa... more

Webpage View Page

December 2, 2009

Excerpt: Hadoop World: Hadoop for Bioinformatics by Christophe Bisciglia December 02, 2009... more

Webpage View Page

November 25, 2009

Excerpt: Hadoop World: Practical HBase from Jonathan Gray and Ryan Rawson by Alex Loddengaard No... more

Webpage View Page

November 23, 2009

Excerpt: Hadoop World: Hadoop + Vertica from Omer Trajman by Alex Loddengaard November 23, 2009... more

Webpage View Page

November 20, 2009

Excerpt: Hadoop World: Hadoop + Clojure from Stuart Sierra and Tim Dysinger by Alex Loddengaard... more

Webpage View Page

November 19, 2009

Excerpt: Hadoop World: Protein Alignment from Paul Brown by Alex Loddengaard November 19, 2009... more

Webpage View Page

November 17, 2009

Excerpt: Hadoop at Twitter (part 1): Splittable LZO Compression by Matt Massie November 17, 2009... more

Webpage View Page

November 11, 2009

Excerpt: Hadoop World: Rethinking the Data Warehouse with Hadoop and Hive from Ashish Thusoo by Christop... more

Webpage View Page

November 9, 2009

Excerpt: Today’s Hadoop World video comes from Ed Capriolo, and goes into details about how to effectively monito... more

Webpage View Page

November 2, 2009

Excerpt: Avro is a recent addition to Apache's Hadoop family of projects. Avro defines a data format designed to supp... more

Webpage View Page

November 2, 2009

Excerpt: Apache Avro is a recent addition to Apache's... more

Webpage View Page

October 29, 2009

Excerpt: Hadoop World: NYC – Let the Videos Roll by Christophe Bisciglia October 29, 2009... more

Webpage View Page

October 21, 2009

Excerpt: Around the world, individuals contribute to Hadoop and build community around the technology. This kind of col... more

Webpage View Page

October 19, 2009

Excerpt: Cloudera Desktop and MooTools by Aaron Newton October 19, 2009 7 comments... more

Webpage View Page

October 15, 2009

Excerpt: Analyzing Human Genomes with Hadoop by Christophe Bisciglia October 15, 2009 4... more

Webpage View Page

October 15, 2009

Excerpt: Every day, we hear about people doing amazing things with Apache Hadoop. The va... more

Webpage View Page

October 1, 2009

Excerpt: Today at Hadoop World NYC , we’re announcing the availability of Cloudera Desktop ,  a unified an... more

Webpage View Page

September 30, 2009

Excerpt: At the beginning of September, we announced the first release of CDH2 , our current testing repository. Pac... more

Webpage View Page

September 29, 2009

Excerpt: One of the more common requests we receive from the community is to package HBase with Cloudera’s Distri... more

Webpage View Page

September 29, 2009

Excerpt: One of the more common requests we receive from the community is to package Apa... more

Webpage View Page

September 28, 2009

Excerpt: Grouping Related Trends with Hadoop and Hive by Amr Awadallah September 28, 2009... more

Webpage View Page

September 15, 2009

Excerpt: Apache Hadoop Log Files: Where to find them in CDH, and what info they contain by Alex Loddenga... more

Webpage View Page

September 10, 2009

Excerpt: In March of this year, we released our distribution for Hadoop.  Our initial focus was on stability and m... more

Webpage View Page

September 10, 2009

Excerpt: In March of this year, we released our distribution for Apache Hadoop.  Our initial focus was on... more

Webpage View Page

September 9, 2009

Excerpt: It’s been a crazy few weeks here at Cloudera, and while there is no sign of things letting up before Ha... more

Webpage View Page

Hadoop World: NYC 2009

Christophe Bisciglia

August 19, 2009

Excerpt: To say we were surprised by the quality and quantity of submissions we received for Hadoop World: NYC 2009... more

Webpage View Page

August 14, 2009

Excerpt: Hadoop Default Ports Quick Reference by Philip Zeyliger August 14, 2009... more

Webpage View Page

August 14, 2009

Excerpt: Editor's note (Oct. 3, 2013): The information below is now deprecated. W... more

Webpage View Page

August 10, 2009

Excerpt: Back in October, I promised to keep marketing and sales out of this blog. We wanted to concentrate on techni... more

Webpage View Page

July 31, 2009

Excerpt: Tracking Trends with Hadoop and Hive on EC2 by Amr Awadallah July 31, 2009 8 co... more

Webpage View Page

July 29, 2009

Excerpt: As Hadoop adoption increases among organizations, companies, and individuals, and as it makes its way into pro... more

Webpage View Page

July 27, 2009

Excerpt: Cloudera’s Training VM is one of the most popular resources on our website. It was created with VMware W... more

Webpage View Page

July 27, 2009

Excerpt: Update (May 1 2013): The post below, which is based on an outdated VM, is deprecated. Rat... more

Webpage View Page

Hadoop HA Configuration

Christophe Bisciglia

July 22, 2009

Excerpt: One of the things we get a lot of questions about is how to make Hadoop highly available. There is still a lot... more

Webpage View Page

July 22, 2009

Excerpt: Disclaimer: Cloudera no longer approves of the recommendations in this post. Ple... more

Webpage View Page

The Project Split

Aaron Kimball

July 17, 2009

Excerpt: Last Wednesday, we hosted a Hadoop meetup, and I gave a short talk about the new project split. How does the s... more

Webpage View Page

July 17, 2009

Excerpt: There is some confusion about the state of the file append operation in HDFS. It was in, now it’s out. W... more

Webpage View Page

Hadoop Graphing with Cacti

Christophe Bisciglia

July 7, 2009

Excerpt: An important part of making sure Hadoop works well for all users is developing and maintaining strong relation... more

Webpage View Page

Hadoop Graphing with Cacti

Christophe Bisciglia

July 7, 2009

Excerpt: An important part of making sure Apache Hadoop works well for all users is deve... more

Webpage View Page

July 3, 2009

Excerpt: The distributed nature of MapReduce programs makes debugging a challenge. Attaching a debugger to a remote pro... more

Webpage View Page

June 30, 2009

Excerpt: Hadoop moves fast. Users often find that they need to upgrade after just a few months. Upgrading can be a daun... more

Webpage View Page

June 30, 2009

Excerpt: Apache Hadoop moves fast. Users often find that they need to upgrade after ju... more

Webpage View Page

June 24, 2009

Excerpt: Yesterday, Chris Goffinet from Digg made a great blog post about LZO and Hadoop. Many users have been frustr... more

Webpage View Page

June 24, 2009

Excerpt: Yesterday, Chris Goffinet from Digg made a great... more

Webpage View Page

June 22, 2009

Excerpt: On June 10th, more than 750 people from around the world descended on the Santa Clara Marriott to share their... more

Webpage View Page

June 22, 2009

Excerpt: On June 10th, more than 750 people from around the world descended on the Santa Clara Marriott to... more

Webpage View Page

June 17, 2009

Excerpt: Analyzing Apache logs with Pig by Amr Awadallah June 17, 2009 5 comments... more

Webpage View Page

June 17, 2009

Excerpt: (guest blog post by Dmitriy Rya... more

Webpage View Page

June 2, 2009

Excerpt: For the last few months, we’ve been working with the TVA to help them manage hundreds of TB of data from... more

Webpage View Page

Introducing Sqoop

Aaron Kimball

June 1, 2009

Excerpt: In addition to providing you with a dependable release of Hadoop that is easy to configure , at Cloudera we... more

Webpage View Page

May 29, 2009

Excerpt: A few months ago we announced the Cloudera Distribution for Hadoop .  We’re happy to report that l... more

Webpage View Page

May 29, 2009

Excerpt: A few months ago we announced the Cloudera Distribution... more

Webpage View Page

May 28, 2009

Excerpt: In my first few weeks here at Cloudera , I’ve been tasked with helping out with the Apache ZooKeeper... more

Webpage View Page

May 28, 2009

Excerpt: As Hadoop continues to turn heads at startups and big enterprises alike, Cloudera has received several request... more

Webpage View Page

May 28, 2009

Excerpt: As Apache Hadoop continues to turn heads at startups and big enterprises alike, Cloudera has rece... more

Webpage View Page

May 27, 2009

Excerpt: Lately, we’ve been spending a lot of time on the East Coast, and one thing is clear: Hadoop is everywher... more

Webpage View Page

May 22, 2009

Excerpt: Administrators of HDFS clusters understand that the HDFS metadata is some of the most precious bits they have.... more

Webpage View Page

May 22, 2009

Excerpt: Administrators of HDFS clusters understand that the HDFS metadata is some of the most precious... more

Webpage View Page

May 18, 2009

Excerpt: This piece is based on the talk “Practical MapReduce” that I gave at Hadoop User Group UK on April... more

Webpage View Page

May 14, 2009

Excerpt: 5 Common Questions About Hadoop by Christophe Bisciglia May 14, 2009 11 comment... more

Webpage View Page

May 14, 2009

Excerpt: There’s been a lot of buzz about Apache Hadoop lately. Just the other day, some of our friends... more

Webpage View Page

May 11, 2009

Excerpt: A while back, we noticed a blog post From Arun Jacob over at Evri (if you haven’t seen Evri before,... more

Webpage View Page

May 7, 2009

Excerpt: What’s New in Hadoop Core 0.20 by Tom White May 07, 2009... more

Webpage View Page

May 1, 2009

Excerpt: We asked Brian Bockelman, a Post Doc Research Associate in the Computer Science & Engineering Depar... more

Webpage View Page

April 27, 2009

Excerpt: When we announced Cloudera’s Distribution for Hadoop last month, we asked the community to give us fe... more

Webpage View Page

April 27, 2009

Excerpt: When we announced Cloudera's Distribution for Apache Had... more

Webpage View Page

April 23, 2009

Excerpt: Today I did a web search for “pig training” using my favorite search engine. I was wildly entertai... more

Webpage View Page

April 23, 2009

Excerpt: Today I did a web search for "pig training" using my favorite search engine. I was wildly enterta... more

Webpage View Page

April 22, 2009

Excerpt: Welcome to the first guest post on the Cloudera blog. The other day, we saw Toby from  Swingly tweet... more

Webpage View Page

April 22, 2009

Excerpt: Welcome to the first guest post on the Cloudera blog. The other day, we saw Toby from ... more

Webpage View Page

April 21, 2009

Excerpt: Last Tuesday – on my second day of work at Cloudera – I went to London to check out the second UK... more

Webpage View Page

April 20, 2009

Excerpt: One of the perks of using Java is the availability of functional, cross-platform IDEs.  I use vim for m... more

Webpage View Page

April 20, 2009

Excerpt: Update (added 5/15/2013): The information below is dated; see... more

Webpage View Page

April 15, 2009

Excerpt: In the process of working on a few things here I wanted to add some links to launch Hive and the Hadoop Jobt... more

Webpage View Page

April 15, 2009

Excerpt: In the process of working on a few things here I wanted to add some links to launch... more

Webpage View Page

April 9, 2009

Excerpt: A few weeks ago we announced Cloudera’s Distribution for Hadoop , and I want to spend some time showing... more

Webpage View Page

April 9, 2009

Excerpt: A few weeks ago we announced... more

Webpage View Page

April 3, 2009

Excerpt: Upcoming Functionality in “Fair Scheduler 2.0″ by Amr Awadallah April 03, 2... more

Webpage View Page

March 30, 2009

Excerpt: Configuring a Hadoop cluster is something akin to voodoo. There are a large number of variables in hadoop-def... more

Webpage View Page

March 15, 2009

Excerpt: One of the repeating themes we have heard while working with our customers and the community is that Hadoop co... more

Webpage View Page

March 15, 2009

Excerpt: One of the repeating themes we have heard while working with our customers and the community is t... more

Webpage View Page

March 13, 2009

Excerpt: Exciting news: We’re providing our basic hadoop training for free online . We’ll still... more

Webpage View Page

Hadoop Metrics

Philip Zeyliger

March 12, 2009

Excerpt: Hadoop’s NameNode, SecondaryNameNode, DataNode, JobTracker, and TaskTracker daemons all expose runtime m... more

Webpage View Page

March 6, 2009

Excerpt: Hadoop’s strength is that it enables ad-hoc analysis of unstructured or semi-structured data. Relational... more

Webpage View Page

March 6, 2009

Excerpt: Editor's note (added Nov. 9. 2013): Valuable data in an organization is often stored in relat... more

Webpage View Page

February 10, 2009

Excerpt: You might think that the SecondaryNameNode is a hot backup daemon for the NameNode. You’d be wrong. The... more

Webpage View Page

February 2, 2009

Excerpt: Small files are a big problem in Hadoop — or, at least, they are if the number of questions on the user... more

Webpage View Page

January 14, 2009

Excerpt: HDFS Reliability by Tom White January 14, 2009 4 comments... more

Webpage View Page

January 5, 2009

Excerpt: It’s a new year, the time when we take a moment to look back at the previous one, and forward to what mi... more

Webpage View Page

December 31, 2008

Excerpt: The first release (0.19.0) from the 0.19 branch of Hadoop Core was made on November 24. Many changes go into... more

Webpage View Page

December 31, 2008

Excerpt: The first release (0.19.0) from the 0.19 branch of Apache ... more

Webpage View Page

December 16, 2008

Excerpt: As a developer coming to Hadoop it is important to understand how testing is organized in the project. For the... more

Webpage View Page

December 16, 2008

Excerpt: As a developer coming to Apache Hadoop it is important to understand how testing is organized in... more

Webpage View Page

December 3, 2008

Excerpt: A few weeks ago we ran a Hadoop hackathon. ApacheCon participants were invited to use our 10-node Hadoop clust... more

Webpage View Page

December 3, 2008

Excerpt: (Added 6/4/2013) Please note the instructions below are deprecated. Please refer to the... more

Webpage View Page

November 23, 2008

Excerpt: Job Scheduling in Hadoop by Amr Awadallah November 23, 2008 3 comments... more

Webpage View Page

November 23, 2008

Excerpt: (guest blog post by... more

Webpage View Page

November 18, 2008

Excerpt: Introducing Hadoop Development Status by Alex Loddengaard November 18, 2008 no... more

Webpage View Page

November 14, 2008

Excerpt: It is common for a MapReduce program to require one or more files to be read by each map or reduce task before... more

Webpage View Page

November 2, 2008

Excerpt: As promised in my post about installing Scribe for log collection , I’m going to cover how to configure... more

Webpage View Page

October 28, 2008

Excerpt: Scribe is a newly released log collection tool that dumps log files from various nodes in a cluster to Scri... more

Webpage View Page

October 24, 2008

Excerpt: Apache Hadoop exists within a rich ecosystem of tools for processing and analyzing large data sets. At Facebo... more

Webpage View Page

October 23, 2008

Excerpt: We’ve created this blog as a place to post tips, tricks and insights on using Hadoop and related project... more

Webpage View Page