Category Archives: Hue

How-to: Make Hadoop Accessible via LDAP

Categories: Hadoop How-to Hue Security

Integrating Hue with LDAP can help make your secure Hadoop apps as widely consumed as possible.

Hue, the open source Web UI that makes Apache Hadoop easier to use, easily integrates with your corporation’s existing identity management systems and provides authentication mechanisms for SSO providers. So, by changing a few configuration parameters, your employees can start analyzing Big Data in their own browsers under an existing security policy.

Read More

A New Web UI for Spark

Categories: Hue Spark

The team behind Hue, the open source Web UI that makes Apache Hadoop easier to use, strikes again with a new Spark app.

Editor’s note: This post was recently published on the Hue blog. We republish it here for your convenience.

Hi Spark Makers!

Hue application for Apache Spark (incubating) was recently created. It lets users execute and monitor Spark jobs directly from their browser and be more productive.

Read More

How-to: Index and Search Data with Hue’s Search App

Categories: How-to Hue Search

You can use Hue and Cloudera Search to build your own integrated Big Data search app.

In a previous post, you learned how to analyze data using Apache Hive via Hue’s Beeswax and Catalog apps. This time, you’ll see how to make Yelp Dataset Challenge data searchable by indexing it and building a customizable UI with the Hue Search app.

Indexing Data in Cloudera Search

Indexing data in Cloudera Search involves :

  • Setting up SolrCloud to partition your dataset into multiple indexes and processes
  • Configuring SolrCloud collections to hold indexes
  • Specifying the schema by which indexes will be created
  • Feeding relevant data into the SolrCloud

First,

Read More

Sqooping Data with Hue

Categories: Hue Sqoop

Hue, the open source Web UI that makes Apache Hadoop easier to use, has a brand-new application that enables transferring data between relational databases and Hadoop. This new application is driven by Apache Sqoop 2 and has several user experience improvements, to boot.

Sqoop is a batch data migration tool for transferring data between traditional databases and Hadoop. The first version of Sqoop is a heavy client that drives and oversees data transfer via MapReduce.

Read More