The Apache ZooKeeper team has announced that Apache ZooKeeper release 3.5.3-beta is now available! This release is the first beta release of the 3.5 series, which cover 77 issues, thirteen of which were considered blockers. Here are some highlights:
- ZOOKEEPER-2719 Enable creation of TTL nodes, which are znode that’s not tied to a session and will get cleaned up automatically once expired.
- ZOOKEEPER-2014 Only admin roles should be allowed to reconfigure a cluster
- ZOOKEEPER-2693 Prevent DOS attack on wchp/wchc four letter words (4lw)
Critical Bug Fixes
- ZOOKEEPER-2383 Solve startup race in ZooKeeperServer
- ZOOKEEPER-2172 Cluster crashes when reconfig a new node as a participant
- ZOOKEEPER-2737 NettyServerCnxFactory leaks connection if exception happens while writing to a channel
- ZOOKEEPER-2247 Zookeeper service becomes unavailable when leader fails to write transaction log
- ZOOKEEPER-2080 Fix deadlock in dynamic reconfiguration
- ZOOKEEPER-2687 Deadlock while shutting down the Leader server
Today, Cloudera announced the availability of an Apache Spark 2.0 Beta release for users of the Cloudera platform.
Apache Spark 2.0 is tremendously exciting (read this post for more background) because (among other things):
- The Dataset API further enhances Spark’s claim as the best tool for data engineering by providing compile-time type safety along with the benefits of a query-optimization engine.
- The Structured Streaming API enables the modeling of streaming data as a continuous DataFrame and expresses operations on that data with a SQL-like API.
With this new beta release, column-level privileges set via Apache Sentry (incubating) are now enforced on Spark/MapReduce jobs.
Cloudera is excited to announce the availability of the second beta release for RecordService. This release is based on CDH 5.5 and provides some new features, including:
- Support for Sentry column-level security. Previously, column-level access control required the use of views; now,
Contributors from Intel, Cloudera, and the rest of the community have been making strong progress on the Hive-on-Spark initiative. This post provides an update.
[Editor’s note (April 20, 2016): Hive-on-Spark is now GA/shipping starting in CDH 5.7.]
Since its inception about one year ago, the community initiative to make Apache Spark a data processing engine for Apache Hive (HIVE-7292) has attracted widespread interest from developers around the world and gone through phases of rapid development,
Now there’s an even quicker “QuickStart” option for getting hands-on with the Apache Hadoop ecosystem and Cloudera’s platform: a new Docker image.
You might already be familiar with Cloudera’s popular QuickStart VM, a virtual image containing our distributed data processing platform. Originally intended as a demo environment, the QuickStart VM quickly evolved over time into quite a useful general-purpose environment for developers, customers,