Cloudera Videos Hadoop World 2011: Building a Model of Organic Link Traffic
At bit.ly, we study behaviour on the internet by capturing clicks on shortened URLs. This link traffic comes in many forms, yet when studying human behaviour, we are only interested in 'organic' traffic: the traffic patterns caused by actual humans clicking on links that have been shared on the social web. This session will look at a model to extract and analyze these patterns by employing Python/Numpy, Streaming Hadoop, and machine learning. This model lets us extract the traffic were interested in from the variety of patterns generated by inorganic entities following bit.ly links.
To view the resource, please fill out this registration form.
We believe strongly in user privacy.