Category Archives: Hadoop

What’s Next for Impala: Focus on Advanced SQL Functionality

Categories: Hadoop Impala

Impala 2.0 will add much more complete SQL functionality to what is already the fastest SQL-on-Hadoop solution available.

In September 2013, we provided a roadmap for Impala — the open source MPP SQL query engine for Apache Hadoop, which was on release 1.1 at the time — that documented planned functionality through release 2.0 and beyond.

Impala is now on release 1.4, with many major features delivered since our previous roadmap update,

Read More

Big Data Benchmarks: Toward Real-Life Use Cases

Categories: Guest Hadoop Ops and DevOps Performance

The Transaction Processing Council (TPC), working with Cloudera, recently announced the new TPCx-HS benchmark, a good first step toward providing a Big Data benchmark.

In this interview by Roberto Zicari with Francois Raab, the original author of the TPC-C Benchmark, and Yanpei Chen, a Performance Engineer at Cloudera, the interviewees share their thoughts on the next step for benchmarks that reflect real-world use cases.

This interview was originally published at ODBMS.org;

Read More

New in CDH 5.1: HDFS Read Caching

Categories: CDH Hadoop HDFS Impala Performance

Applications using HDFS, such as Impala, will be able to read data up to 59x faster thanks to this new feature.

Server memory capacity and bandwidth have increased dramatically over the last few years. Beefier servers make in-memory computation quite attractive, since a lot of interesting data sets can fit into cluster memory, and memory is orders of magnitude faster than disk.

For the latest release of CDH 5.1,

Read More

Progress Report: Cloudera Community Forums After One Year

Categories: Community Hadoop

Cloudera Community forums are proving their value as an important contributor to a rich user experience.

It’s been almost exactly one year since the debut of the Cloudera Community forums. In addition to doing the birthday shout-out, I thought it would be interesting to bring you up to date about adoption and usage patterns.

Launched in response to candid feedback from our customers, use of these forums has been steadily growing,

Read More