Tag Archives: apache whirr

Meet the Engineer: Andrei Savu

Categories: Cloud Cloudera Manager Meet the Engineer

In this installment of “Meet the Engineer”, our subject is Andrei Savu!

What do you do at Cloudera?

At Cloudera I work on cloud deployment automation and general platform improvements to make sure everything runs smoothly on elastic infrastructure when using various managed services. My team builds on top of Cloudera Manager and we integrate with different cloud provider APIs to provision production Cloudera Enterprise Data Hub Edition clusters on-demand,

Read More

The Hadoop FAQ for Oracle DBAs

Categories: Hadoop

Oracle DBAs, get answers to many of your most common questions about getting started with Hadoop.

As a former Oracle DBA, I get a lot of questions (most welcome!) from current DBAs in the Oracle ecosystem who are interested in Apache Hadoop. Here are few of the more frequently asked questions, along with my most common replies.

How much does the IT industry value Oracle DBA professionals who have switched to Hadoop administration,

Read More

Meet the Project Founder: Josh Wills

Categories: Data Science Hadoop MapReduce Meet the Engineer

In this installment of “Meet the Project Founder,” we speak with Josh Wills (@josh_wills), Cloudera’s Senior Director of Data Science and founder of Apache Crunch and Cloudera ML.

What led you to your project idea(s)?
When I first started at Cloudera in 2011, I had a fairly vague job description, no real responsibilities, and wasn’t all that familiar with the Apache Hadoop stack, so I started working on various pet projects in order to learn more about the tools and the use cases in domains like healthcare and energy.

Read More

Meet the Project Founder: Tom White

Categories: Cloud Meet the Engineer

Tom

In this new installment of our “Meet the Project Founder” series, meet Tom White, founder of Apache Whirr, PMC Member for multiple other projects (Apache Hadoop, Apache Avro, Apache Bigtop, Apache Sqoop), and author of O’Reilly Media’s best-selling book, Hadoop: The Definitive Guide.

What led you to your project idea(s)?

Whirr grew out of some scripts I had written in 2006 for spinning up Hadoop clusters on Amazon EC2.

Read More

Apache Bigtop: The "Fedora of Hadoop" is Now Built on Hadoop 2.x

Categories: Bigtop CDH Hadoop

BigtopJust in time for Hadoop Summit 2013, the Apache Bigtop team is very pleased to announce the release of Bigtop 0.6.0: The very first release of a fully integrated Big Data management distribution built on the currently most advanced Hadoop 2.x, Hadoop 2.0.5-alpha.

Bigtop, as many of you might already know, is a project aimed at creating a 100% open source and community-driven Big Data management distribution based on Apache Hadoop.

Read More