Tag Archives: students

New Cloudera Search Training: Learn Powerful Techniques for Full-Text Search on an EDH

Categories: Search Training

Cloudera Search combines the speed of Apache Solr with the scalability of CDH. Our newest training course covers this exciting technology in depth, from indexing to user interfaces, and is ideal for developers, analysts, and engineers who want to learn how to effectively search both structured and unstructured data at scale.

Despite being nearly 10 years old, Apache Hadoop already has an interesting history. Some of you may know that it was inspired by the Google File System and MapReduce papers,

Read more

5 Pitfalls of Benchmarking Big Data Systems

Categories: Hadoop Performance

Benchmarking Big Data systems is nontrivial. Avoid these traps!

Here at Cloudera, we know how hard it is to get reliable performance benchmarking results. Benchmarking matters because one of the defining characteristics of Big Data systems is the ability to process large datasets faster. “How large” and “how fast” drive technology choices, purchasing decisions, and cluster operations. Even with the best intentions, performance benchmarking is fraught with pitfalls—easy to get numbers,

Read more

Meet the Instructor: Bruce Martin

Categories: Meet the Engineer Training

In this installment of “Meet the Instructor”, our interview subject is Bruce Martin.

What is your role at Cloudera?

I am a Senior Instructor at Cloudera. I teach all of our courses. I most often teach our Data Science, Developer, and Data Analyst courses, all of which make up the Developer Learning Path.

What do you enjoy most about training and/or curriculum development?

Read more

NYU, Analytics, and Cloudera’s QuickStart VM

Categories: Hadoop QuickStart VM Training

The Cloudera QuickStart VM is an important platform for learning any Hadoop-related curriculum.

In the Fall 2013 semester, more than 30 NYU graduate students completed the Real-time and Big Data Analytics course at the NYU Courant Institute of Mathematical Sciences, for which I served as instructor.

In this introductory analytics course, students learn the architectures of the Apache Hadoop storage and compute systems (HDFS and MapReduce respectively).

Read more

This Month (and Year) in the Ecosystem (December 2013)

Categories: Community Hadoop HBase Impala Spark

Welcome to our sixth edition of “This Month in the Ecosystem,” a digest of highlights from December 2013 (never intended to be comprehensive; for completeness, see the excellent Hadoop Weekly).

With the close of 2013, we also thought it appropriate to include some high points from across the year (not listed in any particular order):

Read more