New in CDH 5.7: Improved Performance, Security, and SQL Experience in Hue

Categories: CDH Hue

CDH 5.7 includes a lot of changes (more than 1,500) to Hue, the Web UI that makes Apache Hadoop easier to use.

In this new release, the emphasis on performance and security carries over from 5.5. The overall improvement in the SQL user experience is also considerable.

In this post, we’ll cover some highlights.

New Hive Metastore Interface

This app is now on a single page, offering more speed and easier access to statistics and data previews. Read about more of the improvements below; or, watch a demo of all of them in action.

New UI

The front end has been rewritten to be slicker and more user-friendly. More information is displayed, and navigating across tabs is seamless as it no longer entails a page refresh.

hue-57-f2

Table-specific page before

hue-57-f1

Table-specific page after

Improved Speed

Single-page app: The initial page loads very quickly and asynchronously fetches the list of tables, table statistics, data sample, and partition list. Subsequent navigation clicks will trigger only 1 or 2 calls to the server, instead of reloading all the page resources again. As an added bonus, the browser history now works on all the pages.

hue-57-f3

hue-57-f4

Caching: Because all pages containing tables and databases (as well as the autocomplete editor) now point to the same cache, the fetching of that data will only happen once. On the Apache Hive side, these calls have also been optimized to take seconds instead of minutes (HIVE-7575).

Don’t freeze my browser: On top of doing caching, Hue is now much smarter about displaying only elements visible on the screen. For example, if the user has a list of 5,000 tables, only tens of them will actually be rendered (the rendering is the costly part).

Read more here.

SQL Editor Improvements

The most used app in Hue got:

  • A format queries button
  • A delete history button
  • Fixed columns and rows headers for a smoother scrolling
  • Hive on Spark support

hue-57-f7

Security

  • A timeout now logs out inactive user after idle_session_timeout seconds.
  • Add an optional custom security splash screen at login with login_splash_html.
  • TLS certificate chain support
  • SAML
    • Password for the key_file was introduced with key_file_password.
    • Customize your xmlsec1 binary by changing xmlsec_binary.
    • Customize your SAML username mapping. It also supports syncing groups on login.

hue-57-f8

Search Suggest

Result in the Grid Widget can be plotted as in the SQL editor. This is ideal for clicking visualizing the rows returned by the search query.

hue-57-f9

Hue supports Solr Suggesters and makes your data easier to search! Suggester assists the user by proposing an auto-completable list of queries. (Read more here).

HDFS Summary

Right-click on a file or directory to access disk-space consumed, quotas, and number of directories and files.

hue-57-f10

Apache Oozie Improvements

  • External workflow graph: This feature lets you see the graph for workflows submitted from the file browser as well as the ones submitted from CLI.
  • Dry-run Oozie job: The dry-run option tests running a workflow/coordinator/bundle job with given properties without the need to create the job.
  • Timezone improvements: All the times on the dashboard are now defaulted to the browser timezone; submitting a coordinator/bundle no longer need UTC times.
  • Emailing automatically on failure: Each kill node now embeds an optional email action. Edit a kill node to insert a custom message in case it gets called.

(Read more here).

hue-57-f11

Next Up!

The next CDH version (5.8) of Hue will be rebased on Hue 3.10. On top of hundreds of improvements it brings a revamp of the SQL editor and the home page!

facebooktwittergoogle_pluslinkedinmailfacebooktwittergoogle_pluslinkedinmail

4 responses on “New in CDH 5.7: Improved Performance, Security, and SQL Experience in Hue

    1. Romain

      CDH 5.7 comes with an hybrid of Hue version, think 3.10 minus the new SQL Editor and Home page which are huge changes. The main improvement in 5.7 was the revamp into a single page app of the Metastore Browser.

      Will have a blog post on gethue.com and after the CDH5.8 release about these additional large improvements.

  1. Naveen

    I am upgrading from CDH version 5.4 to 5.7.
    Can some one post the improvements that I can expect in BDR (backup and data recovery) from CDH 5.4 tp CDH 5.7?
    Even though I set 40 mappers for a bdr job it runs on 40 for a while and then it will reduce the mappers to say 20 or 15 and runs for several hours. Because of this we are not able to keep up with the replication. Is this fixed in CDH 5.7?
    Need to know what improvements are done for BDR on 5.7.