Updated 11/22/16 – Important: All features below are working on CDH 5.9.0 and CM 5.9.0 and above. This tool makes Oozie migrations off Apache Derby (or any other supported database) easy, in addition to streamlining upgrades. The Apache Oozie server is a stateless web application by design, with all information about running and completed workflows, […]
The internals of Oozie’s ShareLib have changed recently (reflected in CDH 5.0.0). Here’s what you need to know. In a previous blog post about one year ago, I explained how to use the Apache Oozie ShareLib in CDH 4. Since that time, things have changed about the ShareLib in CDH 5 (particularly directory structure), so some […]
Introduction Some of the configuration properties found in Apache Hadoop have a direct effect on clients, such as Apache HBase. One of those properties is called “dfs.datanode.max.xcievers”, and belongs to the HDFS subproject. It defines the number of server side threads and – to some extent – sockets used for data connections. Setting this number too […]