Tag Archives: impala

See You at Data Science Day (Nov. 29, New York)!

Categories: Data Science Impala

[Updated Nov. 26, 2012: Sorry, this event has reached capacity and is now closed.]

Please join us in New York on Nov. 29, 2012, for a unique opportunity to hear from industry icons Jeff Hammerbacher (@hackingdata), Amr Awadallah (@awadallah) and Josh Wills (@josh_wills) as they discuss their approach to Data Science and how it transformed business for companies like Facebook, Yahoo! and Google. You will also hear more about Cloudera Enterprise: The Platform for Big Data powered by Cloudera Impala,

Read more

How to Get Rich on Big Data

Categories: General

The 2012 Strata + Hadoop World conference was week before last in New York City. Cloudera co-presented the conference with O’Reilly Media this year, and we were really pleased with how the event turned out. Of course we launched Cloudera Impala, but there was a ton of news from companies across the Apache Hadoop ecosystem. Andrew Brust over at ZDNet wins the prize for comprehensive coverage of all the announcements.

Read more

The New "Hadoop in Practice" Book: A Chat with The Author

Categories: Books Hadoop Hive MapReduce Pig

Today we bring you a brief interview with Alex Holmes, author of the new book, Hadoop in Practice (Manning). You can learn more about the book and download a free sample chapter here.

There are a few good Hadoop books on the market right now. Why did you decide to write this book, and how is it complementary to them?
When I started working with Hadoop I leaned heavily on Tom White’s excellent book,

Read more

Cloudera Manager 4.1 Now Available; Supports Impala Beta Release

Categories: CDH Cloudera Manager Impala Ops and DevOps

I am very pleased to announce the availability of Cloudera Manager 4.1. This release adds support for the Cloudera Impala beta release, and management and monitoring of key CDH features.

Here are the highlights of Cloudera Manager 4.1:

  • Support for Quorum-based Storage HDFS High Availability
  • Cloudera Impala management and monitoring
  • Flume NG management and monitoring
  • ZooKeeper monitoring
  • Directory disk-space monitoring
  • Host decommissioning
  • Reduced monitoring latency
  • Maintenance mode
  • Several usability,

Read more

Cloudera Impala: Real-Time Queries in Apache Hadoop, For Real

Categories: CDH HBase Hive Impala

After a long period of intense engineering effort and user feedback, we are very pleased, and proud, to announce the Cloudera Impala project. This technology is a revolutionary one for Hadoop users, and we do not take that claim lightly.

When Google published its Dremel paper in 2010, we were as inspired as the rest of the community by the technical vision to bring real-time, ad hoc query capability to Apache Hadoop,

Read more