Author Archives: John Kreisa

New Features in Apache Pig 0.8

Categories: Pig

This is a guest post contributed by Dmitriy Ryaboy (@squarecog) and was originally published in his blog on December 19th. We thought the information was valuable enough that it was worth reposting to spread the word even further. 

The Pig 0.8 release includes a large number of bug fixes and optimizations, but at the core it is a feature release. It’s been in the works for almost a full year (most of the work on 0.7 was completed by January of 2009,

Read More

Improving Hotel Search: Apache Hadoop @ Orbitz Worldwide

Categories: General Guest Hadoop Hive

This post was contributed by Jonathan Seidman from Orbitz. Jonathan is a Lead Engineer on the Intelligent Marketplace/Machine Learning team at Orbitz Worldwide . You can hear more from Jonathan at Hadoop World October 12th in NYC.

Orbitz Worldwide (NYSE:OWW) is composed of a global portfolio of online consumer travel brands including Orbitz, Cheaptickets, The Away Network, ebookers and HotelClub, Additionally, the company operates business-to-business service: Orbitz Worldwide Distribution provides third parties such as Amtrak,

Read More

Announcing Two New Training Classes from Cloudera: Introduction to HBase and Analyzing Data with Hive and Pig

Categories: General Hadoop HBase Hive Pig

Cloudera is pleased to announce two new training courses: a one-day Introduction to HBase and a two-day session on Analyzing Data with Hive and Pig. These join a recently-expanded two-day Hadoop for Administrators course and our popular three-day Hadoop for Developers offering, any of which can be combined to provide extensive, customized training for your organization. Please contact for more information regarding on-site training, or visit to view our public course schedule.

Read More

Upcoming Webinars From Cloudera

Categories: General Hadoop

Here at Cloudera we have deep knowledge and experience working with Hadoop and related technologies to solve a wide range of data challenges that organizations are facing today. Lately we’ve seen booming interest in tapping into that knowledge to gain a better understanding of the technology and to improve Hadoop deployments.

We want to nurture the broader Hadoop community so we are offering two free Hadoop-focused webinars to share some of our insights.

The first webinar is designed for technical audiences and is called “

Read More