Category Archives: Impala

Meet the Engineer: Marcel Kornacker

Categories: Impala Meet the Engineer

Marcel Kornacker

In this installment of “Meet the Engineer”, meet Marcel Kornacker, the architect of the Cloudera Impala open-source real-time query engine for Apache Hadoop.

What do you do at Cloudera?

I’m a tech lead at Cloudera, working on the Cloudera Impala team. And although it’s not in my formal title, I’m also the architect of Impala. What that means in practice is that I have the very enviable but demanding job of not only creating Impala requirements,

Read more

External Hands-on Experiences with Cloudera Impala

Categories: Impala

The beta release of Cloudera Impala, the first (and open source) real-time query engine for Apache Hadoop, has been out in the wild (in binary as well as VM forms) for over a month now, and users have had time to get up-close and hands-on. Consequently, we’re beginning to see some fascinating self-published observations and guides.  

Here are just a few examples; you may know of more that we’ve missed:

Read more

Cloudera Impala: Real-Time Queries in Apache Hadoop, For Real

Categories: CDH HBase Hive Impala

After a long period of intense engineering effort and user feedback, we are very pleased, and proud, to announce the Cloudera Impala project. This technology is a revolutionary one for Hadoop users, and we do not take that claim lightly.

When Google published its Dremel paper in 2010, we were as inspired as the rest of the community by the technical vision to bring real-time, ad hoc query capability to Apache Hadoop,

Read more