Cloudera Enterprise 5.2 is Released

Categories: CDH Cloud Hadoop

Cloudera Enterprise 5.2 contains new functionality for security, cloud deployments, and real-time architectures, and support for the latest open source component releases and partner technologies.

We’re pleased to announce the release of Cloudera Enterprise 5.2 (comprising CDH 5.2, Cloudera Manager 5.2, Cloudera Director 1.0, and Cloudera Navigator 2.1).

This release reflects our continuing investments in Cloudera Enterprise’s main focus areas, including security, integration with the partner ecosystem, and support for the latest innovations in the open source platform (including Impala 2.0, its most significant release yet, and Apache Hive 0.13.1). It also includes a new product, Cloudera Director, that streamlines deployment and management of enterprise-grade Hadoop clusters in cloud environments; new component releases for building real-time applications; and new support for significant partner technologies like EMC Isilon. Furthermore, this release ships the first results of joint engineering with Intel, including WITH GRANT OPTION for Hive and Impala and performance optimizations for MapReduce.

Here are some of the highlights (incomplete; see the respective Release Notes for CDH, Cloudera Manager, and Cloudera Navigator for full lists of features and fixes):

Security

  • Via Apache Sentry (incubating) 1.4, GRANT and REVOKE statements in Impala and Hive can now include WITH GRANT OPTION, for delegation of granting and revoking privileges (joint work with Intel under Project Rhino). (Learn more.)
  • Hue has a new Sentry UI that supports policy management for visually creating/editing roles in Sentry and permissions on files in HDFS.
  • Kerberos authentication is now supported in Apache Accumulo.
  • Impala, authentication can now be done through a combination of Kerberos and LDAP. (Learn more.)

Data Management and Governance

  • Cloudera Navigator 2.1 features a brand-new auditing UI that is unified with lineage and discovery, so you now have access to all Navigator functionality from a single interface.
  • Navigator 2.1 includes role-based access control so you can restrict access to auditing, metadata and policy management capabilities.
  • We’re also shipping a beta policy engine in Navigator 2.1. Targeted to GA by year-end, the policy engine allows you to set up rules and notifications so you can classify data as it arrives and integrate with data preparation and profiling tools. Try it out and let us know what you think!
  • And we’ve added lots of top-requested enhancements, such as Sentry auditing for Impala and integration with Hue.

Cloud Deployment

  • Cloudera Director is a simple and reliable way to deploy, scale, and manage Hadoop in the cloud (initially for AWS) in an enterprise-grade fashion. It’s free to download and use, and supported by default for Cloudera Enterprise customers. See the User Guide for more details.

Real-Time Architecture

  • Re-base on Apache HBase 0.98.6
  • Re-base on Apache Spark/Streaming 1.1
  • Re-base on Impala 2.0
  • Apache Sqoop now supports import into Apache Parquet (incubating) file format
  • Apache Kafka integration with CDH is now incubating in Cloudera Labs; a Kafka-Cloudera Labs parcel (unsupported) is available for installation. Integration with Flume via special Source and Sink have also been provided.

Impala 2.0

  • Disk-based query processing: enables large queries to “spill to disk” if their in-memory structures are larger than the currently available memory. (Note that this feature only uses disk for the portion that doesn’t fit in the available memory.)
  • Greater SQL compatibility: SQL 2003 analytic window functions, support for legacy data types (such as CHAR and VARCHAR), better compliance with SQL standards (WHERE, EXISTS, IN), and additional vendor-specific SQL extensions. (Learn more.)

New Open Source Releases and Certifications

Cloudera Enterprise 5.2 includes multiple new component releases:

  • Apache Avro 1.7.6
  • Apache Crunch 0.11
  • Apache Hadoop 2.5
  • Apache HBase 0.98.6
  • Apache Hive 0.13.1
  • Apache Parquet (incubating) 1.5 / Parquet-format 2.1.0
  • Apache Sentry (incubating) 1.4
  • Apache Spark 1.1
  • Apache Sqoop 1.4.5
  • Impala 2.0
  • Kite SDK 0.15.0

…with new certifications on:

  • Filesystems: EMC Isilon
  • OSs: Ubuntu 14.04 (Trusty)
  • Java: Oracle JDK1.7.0_67

Over the next few weeks, we’ll publish blog posts that cover some of these and other new features in detail. In the meantime:

As always, we value your feedback; please provide any comments and suggestions through our community forums. You can also file bugs via issues.cloudera.org.

facebooktwittergoogle_pluslinkedinmailfacebooktwittergoogle_pluslinkedinmail

7 responses on “Cloudera Enterprise 5.2 is Released

  1. Sree Vaddi

    the installer seems still looking to the 5.1.2 files ?

    # more /var/log/cloudera-manager-installer/3.install-cloudera-manager-server.log
    Loaded plugins: product-id, refresh-packagekit, subscription-manager
    Updating Red Hat repositories.
    Setting up Install Process
    Resolving Dependencies
    –> Running transaction check
    —> Package cloudera-manager-server.x86_64 0:5.1.2-1.cm512.p0.116.el6 will be installed
    –> Processing Dependency: cloudera-manager-daemons = 5.1.2 for package: cloudera-manager-server-5.1.2-1.cm512.p0.116.el6.x86_64
    –> Running transaction check
    —> Package cloudera-manager-daemons.x86_64 0:5.1.2-1.cm512.p0.116.el6 will be installed
    –> Finished Dependency Resolution

    Dependencies Resolved

    ================================================================================
    Package Arch Version Repository Size
    ================================================================================
    Installing:
    cloudera-manager-server x86_64 5.1.2-1.cm512.p0.116.el6 cloudera-manager 8.0 k
    Installing for dependencies:
    cloudera-manager-daemons
    x86_64 5.1.2-1.cm512.p0.116.el6 cloudera-manager 377 M

    Transaction Summary
    ================================================================================
    Install 2 Package(s)

    Total download size: 377 M
    Installed size: 508 M
    Downloading Packages:
    http://archive.cloudera.com/cm5/redhat/6/x86_64/cm/5/RPMS/x86_64/cloudera-manager-daemons-5.1.2-1.cm512.p0.116.el6.x86_64.rpm: [Errno 14] PYCURL ERROR 22 – “The requested URL returned er
    ror: 404 Not Found”
    Trying other mirror.
    http://archive.cloudera.com/cm5/redhat/6/x86_64/cm/5/RPMS/x86_64/cloudera-manager-server-5.1.2-1.cm512.p0.116.el6.x86_64.rpm: [Errno 14] PYCURL ERROR 22 – “The requested URL returned err
    or: 404 Not Found”
    Trying other mirror.

    Error Downloading Packages:
    cloudera-manager-daemons-5.1.2-1.cm512.p0.116.el6.x86_64: failure: RPMS/x86_64/cloudera-manager-daemons-5.1.2-1.cm512.p0.116.el6.x86_64.rpm from cloudera-manager: [Errno 256] No more m
    irrors to try.
    cloudera-manager-server-5.1.2-1.cm512.p0.116.el6.x86_64: failure: RPMS/x86_64/cloudera-manager-server-5.1.2-1.cm512.p0.116.el6.x86_64.rpm from cloudera-manager: [Errno 256] No more mir
    rors to try.

    1. Justin Kestelyn (@kestelyn) Post author

      Could be due to a stale package list — A “yum clean all” should take care of it.

  2. Hugh McBride

    Could only get Spark to deploy in single nodes instance .
    should the executors not run on all the data nodes , it did in cdh 5.1 by default

  3. Brian Baillod

    Sree – I had that problem and YUM CLEAN ALL didn’t fix it. I had to go to /var/cache/yum/x86_64/6/cloudera-manager and remove the offending Cloudera Manager RPM files manually.

  4. Prabhanjana

    Please let me know is Cascading certified with CDH 5.2. I don’t see it in the above list.

    Thanks,
    Prabhanjana