As announced last Sunday (Aug. 25) on the project mailing list, Apache Hadoop 2.1.0 is the first beta release for Hadoop 2. (See the Release Notes for full list of new features and fixes.) Our congratulations to the Hadoop community for reaching this important milestone in the ongoing adoption of the core Hadoop platform!
With the release of this new beta, and the follow-on GA release on the horizon, we expect to see more customers exploring Hadoop 2 for production use cases. In fact, the upcoming CDH5 beta will be based on the Hadoop 2 GA release, delivering features that we’ve thoroughly tested against enterprise requirements, including (but not limited to):
- YARN, for generalized resource management across MR and other applications
- YARN Client APIs, to make building YARN-based apps easier
- Improved NFS support in HDFS, for better integration with external storage systems
- HBase and HDFS snapshots, for improved recoverability
We would also be remiss to ignore that the integration testing for Hadoop 2.1.0 beta was made possible by the Apache Bigtop project, which by virtue of its mission to ensure interoperability across the Hadoop ecosystem, provides the framework for this testing to continually occur. Thanks to that work, you can be assured of Hadoop 2.x interoperability with a wide range of ecosystem projects.
In a future blog post, we will explain how and why compatibility between most applications written on 1.x and 2.x is assured. In the meantime, the entire community of users and developers deserves to bask in the glow of Hadoop 2.1.0 beta!
Justin Kestelyn is Cloudera’s developer outreach director.