Debian packages for Apache Hadoop
When we announced Cloudera’s Distribution for Apache Hadoop last month, we asked the community to give us feedback on what features they liked best and what new development was most important to them. Almost immediately, Debian and Ubuntu packages for Hadoop emerged as the most popular request. A lot of customers prefer Debian derivatives over Red Hat, and installing RPMs on top of Debian, while possible with tools like alien, is a pain to say the least.
After some weeks of development and testing, we are happy to announce the Cloudera APT Repository. APT is the standard package distribution mechanism for Ubuntu and Debian, and by simply pointing your machines at our repository, you can have Hadoop installed within minutes.
Our Debian packages are comprised of the same components as our RPM based distribution, including:
- Standard Linux service management – we package scripts in
/etc/init.dfor all of the major components of the Hadoop system
- Native libraries on supported platforms – there are separate architecture-dependent packages for Hadoop Pipes, libhdfs, and native-code compression acceleration.
- Extra Hadoop-based tools – along with core Hadoop, we have packages available for Pig and Hive
To get you started with Cloudera’s Distribution for Hadoop on Debian and Ubuntu, we’ve written up a short tutorial. Check it out, and remember to let us know what you think!