New Cloudera Search Training: Learn Powerful Techniques for Full-Text Search on an EDH

Categories: Search Training

Cloudera Search combines the speed of Apache Solr with the scalability of CDH. Our newest training course covers this exciting technology in depth, from indexing to user interfaces, and is ideal for developers, analysts, and engineers who want to learn how to effectively search both structured and unstructured data at scale.

Despite being nearly 10 years old, Apache Hadoop already has an interesting history. Some of you may know that it was inspired by the Google File System and MapReduce papers, which detailed how the search giant was able to store and process vast amounts of data. Search was the original Big Data application, and, in fact, Hadoop itself was a spinoff of a project designed to create a reliable, scalable system to index data using one of Doug Cutting’s other creations: Apache Lucene.

Fortunately, many of Hadoop’s early developers and users had the vision to see that a system that offers scalable, cost-effective data storage and processing would become increasingly important in a world that generates data at ever-increasing rates. Hadoop ultimately became the center of an entire ecosystem of tools that made it easier to ingest, process, and analyze a variety of data. Concurrently, Yonik Seeley was creating what eventually became Apache Solr, which is also based on Apache Lucene. Cloudera Search brings these technologies together to enable full-text search at scale.

What is search, exactly? Many people picture Google’s home page, simply offering a text box for you to enter search terms and a button that submits your query. That’s certainly one application of search — and one that helps to illustrate how Cloudera Search can help users to find relevant information regardless of their technical ability — but it’s not the only one. In fact, search also encompasses analytical capabilities that allow you to explore your data interactively, as you can see below:

Economics and technology have converged to usher in the era of the enterprise data hub. As organizations become more data-driven, it’s more important than ever for decision makers to identify important trends. Cloudera Search is an ideal tool for this, and it’s often used for data discovery and exploration. Yet the ability to quickly drill down and find a specific result also makes it a good choice for detecting anomalies. Rather than finding a Web site as one might with internet search, an enterprise user would use Cloudera Search to analyze customer satisfaction through social media posts, identify a failing device using network sensor data, or uncover intrusion attempts hidden away in firewall logs.

Although our Designing and Building Big Data Applications course does include a section on Cloudera Search, many students wanted to cover Search in greater depth after they discovered how they could apply it in their own organizations. That’s why I’m pleased to announce the availability of Cloudera Search Training, a three-day course dedicated to showing you how to ingest, index, and query data at scale using this versatile technology.

If you’d like to learn more about this course, I invite you to join me for a webinar on Tuesday, April 28th. We’ll cover:

  • What Cloudera Search is and how it enables data discovery and analytics
  • How to perform indexing of data from various sources and in various formats
  • Who is best suited to attend the course and what prior knowledge you should have
  • The benefits Search delivers as part of an enterprise data hub

Register now for the webinar.

Tom Wheeler is a Senior Curriculum Developer for Cloudera University.