Accumulo Comes to CDH
- by Sean Busbey, Bill Havanki, and Mike Drob
- December 20, 2013
- 8 comments
Apache Accumulo is now generally available on CDH 4.
Cloudera is pleased to announce the immediate availability of its first release of Accumulo packaged to run under CDH, our open source distribution of Apache Hadoop and related projects and the foundational infrastructure for Enterprise Data Hubs.
Accumulo is an open source project that provides the ability to store data in massive tables (billions of rows, millions of columns) for fast, random access. Accumulo was created and contributed to the Apache Software Foundation by the National Security Agency (NSA), and it has quickly gained adoption as a Hadoop-based key/value store for applications that require access to sensitive data sets. Cloudera provides enterprise support with the RTD Accumulo add-on subscription for Cloudera Enterprise.
This release provides Accumulo 1.4.3 tested for use under CDH 4.3.0. The release includes a significant number of backports and fixes to allow use with CDH 4’s highly available, production-ready packaging of HDFS. As a part of our commitment to the open source community, these changes have been submitted back upstream.
To get started now, you can follow these instructions to handle installation using the Cloudera Manager 5.0.0 open beta release with our new extensibility framework. The instructions will also walk you through downloading a single distribution file and installing Accumulo on the current Cloudera QuickStart VM.
Once you get going, we would love to hear your feedback:
- You can ask questions, get help, and share your growing expertise on our community forum for questions about CDH storage components.
- You can file a bug through our public Jira instances.
For more information, please visit Cloudera’s Accumulo page.
Sean Busbey and Bill Havanki are Solutions Architects at Cloudera. Mike Drob is a Software Engineer at Cloudera and an Accumulo Committer.