Apache Impala and Apache Kudu make a great combination for real-time analytics on streaming data for time series and real-time data warehousing use cases. More than 200 Cloudera customers have implemented Apache Kudu with Apache Spark for ingestion and Apache Impala for real-time BI use cases successfully over the last decade, with thousands of nodes […]
I recently presented a How-tos for Gurus series session on data modeling for big data systems. During the presentation, a number of attendees asked some very interesting questions. As many of you know, big data systems are known to have less formality around the need for structure, yet for data warehouses to continue to serve […]