This summer I sent the following tweet, “Had lunch today at Twitter HQ. Thanks for the invite, @kevinweil! Great lunch conversation. Smart, friendly and fun team.” Kevin Weil leads the analytics team at Twitter and is an active member of the Hadoop community, and his colleague Eric Maland leads Operations. Needless to say, Twitter is doing amazing things with Hadoop. This guest blog from Kevin and Eric covers one of Twitter’s open-source projects which provides a solution for splittable LZO for Hadoop.
Today’s Hadoop World talk comes from Ashish Thusoo at Facebook and goes into detail about how Facebook uses Hadoop and Hive to expose massive volumes of data to their internal users familiar with traditional data warehousing tools. Thanks Ashish, and stay tuned for more!
Today’s Hadoop World video comes from Ed Capriolo, and goes into details about how to effectively monitor Hadoop in production environments. Thanks Ed, and stay tuned for more!
Avro is a recent addition to Apache’s Hadoop family of projects. Avro defines a
data format designed to support data-intensive applications and provides support for this format in a variety of programming languages.
It has been almost a month since Hadoop World: NYC, and things are just starting to get back to normal here at Cloudera HQ. We were thrilled to see over 500 Apache Hadoop enthusiasts descend upon New York City for the first major Hadoop event on the East Coast. The variety of applications, and the number of companies involved, were mind-boggling. For those of you who weren’t able to join us, we hope to see you at another event soon!