Category Archives: Cloud

Meet the Engineer: Andrei Savu

Categories: Cloud Cloudera Manager Meet the Engineer

In this installment of “Meet the Engineer”, our subject is Andrei Savu!

What do you do at Cloudera?

At Cloudera I work on cloud deployment automation and general platform improvements to make sure everything runs smoothly on elastic infrastructure when using various managed services. My team builds on top of Cloudera Manager and we integrate with different cloud provider APIs to provision production Cloudera Enterprise Data Hub Edition clusters on-demand,

Read more

Best Practices for Deploying Cloudera Enterprise on Amazon Web Services

Categories: CDH Cloud Ops and DevOps

This FAQ contains answers to the most frequently asked questions about the architecture and configuration choices involved.

In December 2013, Cloudera and Amazon Web Services (AWS) announced a partnership to support Cloudera Enterprise on AWS infrastructure. Along with this announcement, we released a Deployment Reference Architecture Whitepaper. In this post, you’ll get answers to the most frequently asked questions about the architecture and the configuration choices that have been highlighted in that whitepaper.

Read more

How-to: Use Impala on Amazon EMR

Categories: Cloud How-to Impala

Developers, rejoice: Impala is now available on EMR for testing and evaluation.

Very recently, Amazon Web Services announced support for running Cloudera Impala queries on its Elastic MapReduce (EMR) service. This is very good news for EMR users — as well as for users of other platforms interested in kicking Impala’s tires in a friction-free way. It’s also yet another sign that Impala is rapidly being adopted across the ecosystem as the gold standard for interactive SQL and BI queries on Apache Hadoop.

Read more

Meet the Project Founder: Tom White

Categories: Cloud Meet the Engineer

Tom

In this new installment of our “Meet the Project Founder” series, meet Tom White, founder of Apache Whirr, PMC Member for multiple other projects (Apache Hadoop, Apache Avro, Apache Bigtop, Apache Sqoop), and author of O’Reilly Media’s best-selling book, Hadoop: The Definitive Guide.

What led you to your project idea(s)?

Whirr grew out of some scripts I had written in 2006 for spinning up Hadoop clusters on Amazon EC2.

Read more

How-to: Create a CDH Cluster on Amazon EC2 via Cloudera Manager

Categories: CDH Cloud Cloudera Manager How-to Impala Ops and DevOps

Editor’s Note (added Feb. 25, 2015): For releases beyond 4.5, Cloudera recommends the use of Cloudera Director for deploying CDH in cloud environments. 

Cloudera Manager includes a new express installation wizard for Amazon Web Services (AWS) EC2. Its goal is to enable Cloudera Manager users to provision CDH clusters and Cloudera Impala (the open source distributed query engine for Apache Hadoop) on EC2 as easily as possible (for testing and development purposes only,

Read more