Author Archives: Shreepadma Venugopalan

Column Statistics in Apache Hive

Categories: Hive

Over the last couple of months the Hive team at Cloudera has been working hard to bring a bunch of exciting new features to Apache Hive. In this blog post, I’m going to talk about one such feature – Column Statistics in Hive – and how Hive’s query processing engine can benefit from it. The feature is currently a work in progress but we expect it to be available for review imminently.


While there are many possible execution plans for a query,

Read more