Anand Iyer, Author at Cloudera Blog

February 8, 2017 | Technical

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

There are two clear trends in the big-data ecosystem: the growth of machine learning use cases that leverage large distributed data sets, and the growth of Spark’s Machine Learning libraries (often referred to as MLlib) for these use cases. In fact, Spark’s MLlib library is arguably the leading solution for machine learning on large distributed […]

by Anand Iyer , Vikram Saletore (Intel) 3 min read

Apache Spark Data Science

July 1, 2015 | Technical

Deploying Apache Kafka: A Practical FAQ

This post contains answers to common questions about deploying and configuring Apache Kafka as part of a Cloudera-powered enterprise data hub. Cloudera added support for Apache Kafka, the open standard for streaming data, in February 2015 after its brief incubation period in Cloudera Labs. Apache Kafka now is an integrated part of CDH, manageable via […]

by Anand Iyer , Gwen Shapira 5 min read

Apache Kafka Cloudera Enterprise

More by this author:

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

Deploying Apache Kafka: A Practical FAQ