This Month in the Ecosystem (November 2013)

Categories: Community Hadoop

Welcome to our fifth edition of “This Month in the Ecosystem,” a digest of highlights from November 2013 (never intended to be comprehensive; for completeness, see the excellent Hadoop Weekly).

With the holidays upon us, the news in November was sparse. Even so, the ecosystem never stops churning!

  • Continuuity Weave was Proposed as an Apache Incubator Project
    Weave, an effort to make building new apps on top of YARN much easier for mainstream developers, has been proposed as an incubator project under a new monicker, “Twill.” We think anything that makes developers’ lives easier is a Very Good Thing, and we applaud Continuuity for this proposal. Cloudera’s Tom White and Patrick Hunt were nominated as mentors.
    Read the Incubator Proposal for Twill
  • Apache Hadoop Got Massively More Accessible
    Cloudera and Udacity, purveyor of massively open online courses (MOOC) for tech education, have partnered up to bring the first Hadoop-related MOOCs to the world. Co-developed by Cloudera University faculty and Udacity’s online instructional designers, the new courses help technologists acquire fundamental skills for working with the foundational technology of Enterprise Data Hubs.
    Enroll in the first new course: “Introduction to Hadoop MapReduce” 
  • The Data Products Toolbox Got Deeper with Oryx and Gertrude
    Cloudera’s data science department announced two new open source projects to help facilitate the creation of new “data products”:  Oryx (infrastructure for building and deploying machine-learning models) and Gertrude (a framework for testing/experimenting with data products). Both projects are in alpha state so there is still much work to be done.
    Learn more about/see source code for OryxLearn more about/see source code for Gertrude
  • Facebook Open-Sourced Presto
    Facebook contributed its distributed SQL-on-Hadoop engine, Presto, to open source. Like Cloudera Impala, Presto dispenses with MapReduce for performance reasons, and Apache Hive data is accessed via CDH. Unlike Impala, no commercial support is yet available. Let the benchmarks fly!
    Learn more about Presto

The next installment of “This Month in the Ecosystem” will publish in early January.

Justin Kestelyn is Cloudera’s developer outreach director.