Cloudera at Strata + Hadoop World 2013

Categories: Community Events Hadoop

Strata Conference + Hadoop World 2013 is looming on the horizon and pacing to be the largest gathering of Big Data professionals on the globe. As co-hosts with O’Reilly, we have seen the conference thrive, grow, and are excited about the upcoming Oct. 28 – 30 event!


Below you will find a listing of all the ways you can engage with Cloudera throughout the conference (all speakers are Cloudera employees unless otherwise indicated):


  • Cloudera co-founder and Chief Strategy Officer Mike Olson will explain how Apache Hadoop and Cloudera are transforming the way organizations think about their data.


  • How to Build a Hadoop Data Application
    Mon., Oct. 28, 9am – Grand Ballroom West
    Tom White, Eric Sammer, Joey Echeverria
    In this tutorial, you’ll use the Cloudera Development Kit (CDK) to build a Java web app that logs application events to Hadoop, and then run ad hoc and scheduled queries against the collected data. 

Breakout Sessions

  • From Promise to a Platform: Next Steps in Bringing Workload Diversity to Hadoop
    Tues., Oct. 29, 11:50am – Grand Ballroom East
    Henry Robinson 
    In this session, Henry presents Cloudera’s vision and implementation for generalized resource management on Hadoop, suitable for all uses. 
  • Parquet: An Open Columnar Storage for Hadoop
    Tues., Oct. 29, 1:45pm – Gramercy Suite
    Julien Le Dem (Twitter), Nong Li 
    Parquet is a columnar file format for Hadoop that brings performance and storage benefits. Here, two of its architects discuss Parquet’s design and share performance numbers.
  • What’s Next for Apache HBase: Multi-tenancy, Predictability, and Extensions.
    Tues., Oct. 29, 2:35pm – Grand Ballroom East
    Jonathan Hsieh
    This talk will describe themes emerging from recent features slated for the upcoming post-0.96 release. These include improvements for multi-tenant deployments; a focus on predictable latencies; and the proliferation of new extensions for features traditionally from databases. 
  • Securing the Apache Hadoop Ecosystem
    Tues., Oct. 29, 4:15pm – Grand Ballroom East
    Aaron Myers, Shreepadma Venugopalan
    This session covers how various parts of the Hadoop ecosystem can interact in a secure way to address enterprise security requirements. It will focus on the advanced authorization features enabled by the Apache Sentry (incubating) project. 
  • Working with Geospatial Data Using Hadoop and HBase and How Monsanto Used It to Help Farmers Increase Their Yield
    Tues., Oct. 29, 5:05pm – Sutton Center/Sutton South
    Erich Hochmuth (Monsanto), Amandeep Khurana
    Monsanto is building new technology driven products for its customers that will leverage big data. This talk describes how Monsanto is building these scalable applications with geospatial data, using Hadoop and HBase as the backend systems. 
  • Unifying Your Data Management Platform with Hadoop: Batch and Real-time Machine Data Ingest, Alerts, and Analytics
    Weds., Oct. 30, 11:50am – Grand Ballroom East
    Jayant Shekhar 
    Today Hadoop serves as a unified platform for near-real-time (NRT) and batch workflows. In this session, dive into the details of using SolrCloud and Cloudera Impala together to serve search queries, by integrating Flume to stream events into Solr, Impala and HBase. 
  • Practical Performance Analysis and Tuning for Cloudera Impala
    Weds., Oct. 30, 2:35pm – Murray Hill Suite
    Greg Rahn 
    Impala brings SQL to Hadoop, but it also brings SQL performance tuning to those using the platform. This session will cover Impala performance analysis topics to aid in answering the question, “Why is my query slow?”, as well as practical tips and techniques to get the best performance from Impala. 
  • Trickery and Tooling for Distributed System Diagnosis and Debugging
    Weds., Oct. 30, 5pm – Murray Hill Suite
    Philip Zeyliger
    All is quiet on the log file front, but yet the system is down. What next? This talk covers the tricks of the trade for debugging distributed systems. Motivated by experience gained diagnosing Hadoop, we’ll dig into the JVM, Linux esoterica, and outlier visualization. 

Meetups During the Week

  • Evening meetups are planned for Impala, Cloudera Manager, Apache Hive, Apache Sqoop, Apache HBase, and Apache Flume, and Cloudera will help support the NYC Hadoop Users for its traditional pre-Hadoop World meeting. Stay tuned for more details about dates, times, and locations.

Cloudera Exhibition

  • Visit Booth 403 in the sponsor pavilion to learn more about how our Platform for Big Data empowers enterprises to Ask Bigger Questions of all their data!
  • You can also see demos of Cloudera Manager and of Hue, the open source Web UI that makes Hadoop easier to use.
  • Don’t forget your Cloudera Tshirt!

We look forward to seeing you at Strata Conference + Hadoop World 2013! Register with the code CLOUDERA and receive 20% off admission.