Author Archives: Ed Albanese

Oracle selects CDH and Cloudera Manager as the Apache Hadoop Platform for the Oracle Big Data Appliance

Categories: CDH Community General

Cloudera users gain more choice, tighter Oracle integration. Cloudera partners gain increased validation of their platform choice.

Ed Albanese
Ed leads business development for Cloudera. He is responsible for identifying new markets, revenue opportunities and strategic alliances for the company.

Summary: Oracle has selected Cloudera’s Distribution Including Apache Hadoop (CDH) and Cloudera Manager software as core technologies on the Oracle Big Data Appliance,

Read more

Scaling Social Science with Apache Hadoop

Categories: General

This post was contributed by researcher Scott Golder, who studies social networks at Cornell University. Scott was previously a research scientists at HP Labs and the MIT Media Laboratory.

The methods of social science are dear in time and money and getting dearer every day.
— George C. Homans, Social Behavior: Its Elementary Forms, 1974.

When Homans — one of my favorite 20th century social scientists — wrote the above,

Read more

Natural Language Processing with Apache Hadoop and Python

Categories: General

This blog was co-written by Nitin Madnani andJimmy Lin, both researchers at the University of Maryland, who are sharing their thoughts and experiences with Apache Hadoop and Python for improving Natural Language Processing techniques.

If you listen to analysts talk about complex data, they all agree, it’s growing, and faster than anything else before. Complex data can mean a lot of things,

Read more

Why Europe’s Largest Ad Targeting Platform Uses Apache Hadoop

Categories: General

Richard Hutton, CTO of nugg.ad, authored the following post about how and why his company uses Apache Hadoop.

nugg.ad operates Europe’s largest targeting platform. The company’s core business is to derive targeting recommendations from clicks and surveys. We measure these, store them in log files and later make sense of them all. In 2007 up until mid 2009 we used a classical data warehouse solution.

Read more