Category Archives: Impala

Cloudera Impala Beta (version 0.2) and Cloudera Manager 4.1.1 Now Available

Categories: Cloudera Manager Impala

I am pleased to announce the release of Cloudera Impala Beta (version 0.2) and Cloudera Manager 4.1.1. These are both enhancement releases to make bug fixes available quickly. Key enhancements in each release are:

Cloudera Impala Beta (version 0.2)

  • Bug fixes
  • Impala Beta will be regularly updated with features, bug fixes, and performance enhancements. We will typically release such updates every 2 weeks. Please check the release notes to find out what’s new with each update.

Read More

See You at Data Science Day (Nov. 29, New York)!

Categories: Data Science Impala

[Updated Nov. 26, 2012: Sorry, this event has reached capacity and is now closed.]

Please join us in New York on Nov. 29, 2012, for a unique opportunity to hear from industry icons Jeff Hammerbacher (@hackingdata), Amr Awadallah (@awadallah) and Josh Wills (@josh_wills) as they discuss their approach to Data Science and how it transformed business for companies like Facebook, Yahoo! and Google. You will also hear more about Cloudera Enterprise: The Platform for Big Data powered by Cloudera Impala,

Read More

Cloudera Manager 4.1 Now Available; Supports Impala Beta Release

Categories: CDH Cloudera Manager Impala Ops and DevOps

I am very pleased to announce the availability of Cloudera Manager 4.1. This release adds support for the Cloudera Impala beta release, and management and monitoring of key CDH features.

Here are the highlights of Cloudera Manager 4.1:

  • Support for Quorum-based Storage HDFS High Availability
  • Cloudera Impala management and monitoring
  • Flume NG management and monitoring
  • ZooKeeper monitoring
  • Directory disk-space monitoring
  • Host decommissioning
  • Reduced monitoring latency
  • Maintenance mode
  • Several usability,

Read More

Cloudera Impala: Real-Time Queries in Apache Hadoop, For Real

Categories: CDH HBase Hive Impala

After a long period of intense engineering effort and user feedback, we are very pleased, and proud, to announce the Cloudera Impala project. This technology is a revolutionary one for Hadoop users, and we do not take that claim lightly.

When Google published its Dremel paper in 2010, we were as inspired as the rest of the community by the technical vision to bring real-time, ad hoc query capability to Apache Hadoop,

Read More

Cloudera, The Platform for Big Data

Categories: CDH Hadoop Impala

Today we’re proud to announce a new addition to the Apache Hadoop ecosystem: Cloudera Impala, a parallel SQL engine that runs natively on Hadoop storage. The salient points are:

  • Hive compatible
  • 10x the performance of Hive/MapReduce, on average
  • 100% open source, under the Apache License v2 – just like Hadoop
  • Tested to run on CDH4.1 or higher

There’s a blog post that follows mine that provides more details about Impala and how it works.

Read More