Open Source

Open source is a central part of Cloudera’s business. We recognize that Apache Hadoop’s success comes in large part from being an open platform. To further this goal, Cloudera’s Distribution Including Apache Hadoop is 100% open source, Apache licensed.

Cloudera does not just package and distribute open source software, we actively contribute to it. More than 50% of Cloudera engineering investment goes back to Apache-licensed open source projects. Projects Cloudera contributes to include:

Name Project URL
Adverse Drug Event System https://github.com/cloudera/ades
Apache Avro http://avro.apache.org
Apache Bigtop (incubating) http://incubator.apache.org/bigtop/
Apache Flume http://incubator.apache.org/flume
Apache Hadoop Common http://hadoop.apache.org/common
Apache HBase http://hbase.apache.org
Apache HDFS http://hadoop.apache.org/hdfs
Apache Hive http://hive.apache.org
Apache Mahout http://mahout.apache.org/
Apache MapReduce http://hadoop.apache.org/mapreduce
Apache Oozie (incubating) https://github.com/yahoo/oozie
Apache Pig http://pig.apache.org
Apache Sqoop http://sqoop.apache.org/
Apache Whirr http://whirr.apache.org
Apache ZooKeeper http://zookeeper.apache.org
Crepo http://github.com/cloudera/crepo
Crunch https://github.com/cloudera/crunch
Hadoop LZO http://github.com/cloudera/hadoop-lzo
Hue http://github.com/cloudera/hue
JCarder http://github.com/toddlipcon/jcarder
Jenkins http://jenkins-ci.org/
MooTools http://github.com/mootools/mootools-core
Record Breaker https://github.com/cloudera/RecordBreaker
Seismic Hadoop https://github.com/cloudera/seismichadoop