Presentations
July 3, 2012
Excerpt: Cloudera COO, Kirk Dunn, speaks to Cloudera's role in Hadoop coming of age.... more
The Origins of Cloudera
Dr. Amr Awahdallah
July 3, 2012
Excerpt: Cloudera CTO, Dr. Amr Awahdallah, explains the value he saw in Hadoop and how Cloudera came to be formed.... more
The Origins and Evolution of Hadoop
Doug Cutting
July 3, 2012
Excerpt: Doug Cutting, the creator of Apache Hadoop, explains the reasoning behind the creation of Hadoop and why Hadoo... more
The Standard for Hadoop in the Enterprise
Charles Zedlewski
July 3, 2012
Excerpt: Cloudera VP Product Charles Zedlewski explains Cloudera's products and services including CDH4, Cloudera Manag... more
July 3, 2012
Excerpt: A panel of Hadoop users and Hadoop ecosystem partners share their thoughts around Hadoop and share use cases.... more
Bringing Big Data Down to Size with Hadoop
Dr. Amr Awadallah
July 2, 2012
Excerpt: Dr. Amr Awahdallah, Cloudera Co-founder & CTO Apache Hadoop, an open-source platform, is increasingly gaining... more
Hadoop Summit 2012 | HDFS High Availability
Suresh Srinivas and Aaron Myers
June 19, 2012
Excerpt: The HDFS NameNode is a robust and reliable service as seen in practice in production at Yahoo and other custom... more
Hadoop Summit 2012, June 18, 2012
Excerpt: Branch-and-bound is a widely used technique for efficiently searching for solutions to combinatorial optimizat... more
Hadoop Summit 2012, June 18, 2012
Excerpt: Optimizing MapReduce job performance is often seen as something of a black art. In order to maximize performan... more
Hadoop Summit 2012 | HBase Consistency and Performance Improvements
Esteban Gutierrez and Gregory Chanan
Hadoop Summit 2012, June 18, 2012
Excerpt: The latest Apache HBase releases, 0.92 and 0.94, contain many improvements over prior releases in terms of cor... more
Hadoop Summit 2012, June 18, 2012
Excerpt: Processing of large data requires new approaches to data mining: low, close to linear, complexity and stream p... more
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2
Bilung Lee and Kathleen Ting
Hadoop Summit 2012, June 18, 2012
Excerpt: Apache Sqoop (incubating) was created to efficiently transfer big data between Hadoop related systems (such as... more
Hadoop Summit 2012 | Integrating Hadoop into the Enterprise
Jonathan Siedman
Hadoop Summit 2012, June 18, 2012
Excerpt: The power of Hadoop lies in its ability to help users cost effectively analyze all kinds of data. We are now s... more
Hadoop Summit 2012 | Improving HBase Availability and Repair
Jonathan Hsieh, Jeff Bean
Hadoop Summit 2012, June 18, 2012
Excerpt: Apache HBase is a rapidly-evolving random-access distributed data store built on top of Apache Hadoop’s HDFS... more
March 6, 2012
Excerpt: Differences between creating and running machine learning models in industry rather than academia and the curr... more
Avro Data | Washington DC HUG
Doug Cutting
Washington DC HUG, January 31, 2012
Excerpt: Avro is a common data format that's expressive, efficient and dynamic for the Apache Hadoop ecosystem.... more
Washington DC Hadoop User Group, January 31, 2012
Excerpt: HBase and Accumulo are both open-source, Apache 2.0 licensed implementations of Google's BigTable infrastructu... more
Mahout, CDH3, and Recommendation
Josh Patterson
Los Angeles, December 9, 2011
Excerpt: Presentation slides by Cloudera Sr Solution Architect, Josh Patterson, from the Los Angeles Hadoop User Group.... more
Hadoop Troubleshooting 101 – Japanese Version
Sho Shimauchi
Japan, December 1, 2011
Excerpt: This is a presentation given by Cloudera's Sho Shimauchi in Japan.... more
Boston, November 22, 2011
Excerpt: Omer Trajman highlights several use cases of advanced analytics and data processing with Hadoop. Hear Hadoop u... more
Starta Conference 2011, September 28, 2011
Excerpt: Cloudera, CTO, Dr. Amr Awadallah explains how Apache Hadoop is revolutionizing the business intelligence and d... more
Apache Sqoop: A Data Transfer Tool for Hadoop
Arvind Prabhakar
September 19, 2011
Excerpt: Apache Sqoop is a tool designed for efficiently transferring bulk data between Hadoop and structured datastore... more
NoSQL MeetUp: Apache Hadoop and HBase
Todd Lipcon
NoSQL MeetUp, August 10, 2011
Excerpt: What is Apache Hadoop, what does it do? How does Hadoop Work? What is Apache HBase? See examples of Hadoop and... more
Hadoop Security: Overview
Aaron T. Myers
Los Angeles Hadoop User Group, June 24, 2011
Excerpt: Cloudera Software Engineer, Aaron T. Myers, presented an overview of Apache Hadoop security at the Los Angeles... more
Apache Hadoop and HBase in the Real World
Joey Echeverria
Hadoop Consortia, June 24, 2011
Excerpt: Cloudera Solutions Architect, Joey Echeverria, explains Hadoop and HBases architecture and roles in the real w... more
Chicago Data Summit: Extending the Enterprise Data Warehouse with Hadoop
Jonathan Seidman & Robert Lancaster
Chicago Data Summit, April 26, 2011
Excerpt: Hadoop provides the ability to extract business intelligence from extremely large, heterogeneous data sets tha... more
Cloudera’s Distribution including Apache Hadoop & Cloudera Enterprise, Charles Zedlewski, VP Product, Cloudera
Charles Zedlewski
Chicago Data Summit, April 26, 2011
Excerpt: Charles Zedlewski explains the flexibility, scalability, and affordability of Cloudera's Distribution includin... more
Geo-based Content Processing Using HBase, Ravi Veeramachaneni, NAVTEQ
Ravi Veeramachaneni
Chicago Data Summit, April 26, 2011
Excerpt: Ravi Veeramachaneni explains the use of Cloudera's Distribution including Apache Hadoop at NAVTEQ, primarily h... more
Flume: An Introduction, Jonathan Hsieh
Jonathan Hsieh
Chicago Data Summit, April 26, 2011
Excerpt: Jonathan Hsieh Introduces Flume to the Apache Hadoop community. Flume is used to collect data, usually log fil... more
Extending the Enterprise Data Warehouse with Hadoop, Jonathan Seidman and Robert Lancaster, Orbitz
Jonathan Seidman, Robert Lancaster
Chicago Data Summit, April 26, 2011
Excerpt: Jonathan Seidman and Robert Lancaster share the use of Cloudera's Distribution including Apache Hadoop at Orbi... more
Apache HBase: An Introduction, Todd Lipcon
Todd Lipcon
Chicago Data Summit, April 26, 2011
Excerpt: Todd Lipcon explains what HBase is, the architecture of HBase, compares HBase with other technologies, explain... more
Chicago Data Summit, April 26, 2011
Excerpt: Apache HBase is an open source distributed data-store capable of managing billions of rows of semi-structured... more
Chicago Data Summit: Flume – An Introduction
Jonathan Hsieh
Chicago Data Summit, April 26, 2011
Excerpt: Flume is an open-source, distributed, streaming log collection system designed for ingesting large quantities... more
Chicago Data Summit: Cloudera’s Distribution including Apache Hadoop & Cloudera Enterprise
Charles Zedlewski
Chicago Data Summit, April 26, 2011
Excerpt: This presentation explains what's new in the recently released CDH3 and Enterprise 3.5 products. We'll review... more
Geo-based Content Processing Using HBase
Ravi Veeramachaneni
Chicago Data Summit, April 26, 2011
Excerpt: NAVTEQ uses Cloudera Distribution including Apache Hadoop (CDH) and HBase with Cloudera Enterprise support to... more
Chicago Data Summit, April 26, 2011
Excerpt: Hadoop is a new paradigm for data processing that scales near linearly to petabytes of data. Commodity hardwa... more
Data Processing with Hadoop: Scalable and Cost Effective, Doug Cutting, Apache Hadoop Co-founder
Doug Cutting
Chicago Data Summit, April 26, 2011
Excerpt: This is the keynote presentation from Chicago Data Summit. Doug Cutting takes us through the creation of Apach... more
April 5, 2011
Excerpt: Vantage Partners Leadership Expert, Steve O'Deegan, recently met with Michael Olson, CEO of Cloudera, to discu... more
March 25, 2011
Excerpt: Mike Olson, chief executive officer of Cloudera Inc., talks about the outlook for corporate use of data. Olson... more
March 24, 2011
Excerpt: Talk given at Large Scale Production Engineering meetup at Yahoo! on Thursday, March 24th 2011.... more
Santa Clara, March 24, 2011
Excerpt: This is the keynote presentation at EclipseCon 2011 featuring Todd Lipcon and Apache Hadoop. Todd explains the... more
Austin, February 24, 2011
Excerpt: Jon Hsieh covers multiple topics relating to Flume in this presentation (originally created for the Austin HUG... more
Avoiding Full GCs with MemStore-Local Allocation Buffers (Originally Presented at the HBase HUG)
Todd Lipcon
StumbleUpon, February 22, 2011
Excerpt: Cloudera's Todd Lipcon's presentation slides for the HBase HUG, "Avoiding Full GCs with MemStore-Local Allocat... more
Berlin, Germany, February 16, 2011
Excerpt: In this talk, Josh outlines some of the ways in which Nokia is using Apache Hadoop. He starts by having a quic... more
Apache Hadoop: Code Injection and Distributed Fault Injection
Konstantin Boudnik
EMC / Greenplum, February 11, 2011
Excerpt: Cloudera Software Engineer, Hadoop committer and Pig contributor, Konstantin Boudnik, presented at EMC / Green... more
Apache Hadoop: Code Injection and Distributed Fault Injection
Konstantin Boudnik
EMC / Greenplum, February 11, 2011
Excerpt: Cloudera Software Engineer, Hadoop committer and Pig contributor, Konstantin Boudnik, presented at EMC / Green... more
Las Vegas, Nevada, January 27, 2011
Excerpt: Cloudera founder, Dr. Amr Awadallah, gave a presentation titled, Apache Hadoop in the Enterprise at MicroStrat... more
Orbitz Ideas: Jeff Hammerbacher on Evolving A New Analytical Platform
Jeff Hammerbacher
Chicago, January 12, 2011
Excerpt: Handling massive amounts of data as is done at places like Facebook, eBay, and our very own Orbitz is no simpl... more
New York City, October 12, 2010
Excerpt: Fuzzy Table: Distributed Fuzzy Matching Database Presenter: Lalit Kapoor, Booz Allen Hamilton This presentati... more
Hadoop World 2010: Cloudera Roadmap & Release Plan
Charles Zedlewski, Cloudera
New York City, October 12, 2010
Excerpt: Cloudera: Roadmap & Release Plan Charles Zedlewski Cloudera... more
New York City, October 12, 2010
Excerpt: Apache Hadoop in the Enterprise slide deck from Yahoo!'s Arun Murthy at Hadoop World 2010.... more
Hadoop World 2010: Flume: Reliable Distributed Streaming Log Collection – Jon Hsieh, Cloudera
Jon Hsieh
New York City, October 12, 2010
Excerpt: Cloudera's Jon Hsieh on Flume: Reliable Distributed Streaming Log Collection at Hadoop World 2010... more
Hadoop World 2010: Membase, ShareThis, AOL – Better Ad, Offer and Content Targeting
James Phillips, Manu Mukerji, Ben Jackson
New York City, October 12, 2010
Excerpt: James Phillips, Manu Mukerji, Ben Jackson from Membase, ShareThis, AOL on Better Ad, Offer and Content Targeti... more
Hadoop World 2010: Quest – Exchanging Data with the Government, Guy Harrison
Guy Harrison, Quest
New York City, October 12, 2010
Excerpt: Quest's Guy Harrison on Exchanging data with the Elephant: Connecting Hadoop and an RDBMS using Sqoop from Had... more
Hadoop World 2010: The Explorys Network, Doug Meil
Doug Mail, Explorys
New York City, October 12, 2010
Excerpt: Explorys' Doug Meil explains HBase and Hadoop in the Explorys Network.... more
New York City, October 12, 2010
Excerpt: Hadoop World 2010 keynote presentation from Mike Olson, CEO of Cloudera. Title: "Hadoop: What's Next?" Quot... more
Hadoop World 2010: Business Analyst Tools & Applications for Hadoop
Amr Awadallah, Cloudera, Inc.
New York City, October 12, 2010
Excerpt: Amr Awadallah from Cloudera, Inc. presents at Hadoop World 2010... more
Hadoop World 2010: StumbleUpon Advertising Platform using HBase and Hadoop
Jean-Daniel Cryans, StumbleUpon
New York City, October 12, 2010
Excerpt: StumbleUpon's Jean-Daniel Cryans on HBase usage for advertising at Hadoop World 2010... more
Hadoop World 2010: Information Systems in an Entity-Centric World
Tim Estes, Digital Reasoning
New York City, October 12, 2010
Excerpt: Digital Reasoning's Tim Estes on Information Systems in an Entity-Centric World at Hadoop World 2010... more
Hadoop World 2010: Hadoop at eBay
Anil Madan, eBay
New York City, October 12, 2010
Excerpt: EBay's Anil Madan on Hadoop usage at eBay from Hadoop World 2010... more
Hadoop World 2010: HBase/Hadoop in the Explorys Network
Doug Meil, Explorys
New York City, October 12, 2010
Excerpt: Explorys's Doug Meil on HBase/Hadoop in the Explorys Network from Hadoop World 2010... more
Hadoop World 2010: Managing Derivatives Data with Hadoop
Joshua Bennett, CME
New York City, October 12, 2010
Excerpt: CME's Joshua Bennett on managing derivatives data with Hadoop at Hadoop World 2010... more
Hadoop World 2010: Flume: Reliable Distributed Streaming Log Collection
Jon Hsieh, Cloudera
New York City, October 12, 2010
Excerpt: Cloudera's Jon Hsieh on Flume: Reliable Distributed Streaming Log Collection at Hadoop World 2010... more
Hadoop World 2010: Large Scale Web Analytics utilizing Hadoop and Aster Data
Duckworth, ComScore
New York City, October 12, 2010
Excerpt: ComScore's Duckworth on Large Scale Web Analytics utilizing Hadoop and Aster Data at Hadoop World 2010... more
Hadoop World 2010: Hadoop and Hive at Orbitz
Jonathan Seidman, Orbitz
New York City, October 12, 2010
Excerpt: Orbitz's Jonathan Seidman on Hadoop and Hive at Orbitz from Hadoop World 2010... more
Hadoop World 2010: Scaling from 5 to 500 nodes: Best Practices and Real World Experience
Phil Day, HP
New York City, October 12, 2010
Excerpt: HP's Phil Day on Scaling from 5 to 500 nodes: Best Practices and Real World Experience from Hadoop World 2010... more
Hadoop World 2010: GE Sentiment Analysis: Powering Innovative Marketing Analysis Via Hadoop
Linden Hillenbrand, GE
New York City, October 12, 2010
Excerpt: GE's Linden Hillenbrand on GE Sentiment Analysis: Powering Innovative Marketing Analysis Via Hadoop from Hadoo... more
Hadoop World 2010: Hadoop Analytics: More Methods, Less Madness
Shevek, Karmasphere
New York City, October 12, 2010
Excerpt: Karmasphere's Shevek on Hadoop Analytics: More Methods, Less Madness from Hadoop World 2010... more
Hadoop World 2010: Migrating to CDH and Streaming Data Warehouse Loading
Christopher Gillett, Visible Measures
New York City, October 12, 2010
Excerpt: Visible Measures' Christopher Gillett on Migrating to CDH and Streaming Data Warehouse Loading from Hadoop Wor... more
Hadoop World 2010: Productionizing Hadoop: Lessons Learned
Eric Sammer, Cloudera
New York City, October 12, 2010
Excerpt: Cloudera's Eric Sammer on Productionizing Hadoop: Lessons Learned from Hadoop World 2010... more
Hadoop World 2010: Top 10 Lessons Learned from Deploying Hadoop and HBase
Rod Cope, OpenLogic
New York City, October 12, 2010
Excerpt: OpenLogic's Rod Cope on Top 10 Lessons Learned from Deploying Hadoop and HBase from Hadoop World 2010... more
Hadoop World 2010: Exchanging data with the Elephant: Connecting Hadoop and an RDBMS using Sqoop
Guy Harrison, Quest
New York City, October 12, 2010
Excerpt: Quest's Guy Harrison on Exchanging data with the Elephant: Connecting Hadoop and an RDBMS using Sqoop from Had... more
Hadoop World 2010: Voice over IP: Studying Traffice Characteristics for Quality of Service using R and Hadoop
Saptarshi Guha, Purdue University
New York City, October 12, 2010
Excerpt: Purdue University's Saptarshi Guha on Voice over IP: Studying Traffic Characteristics for Quality of Service u... more
Hadoop World 2010: Search Analytics with Flume & HBase
Otis Gospodnetic, Sematext
New York City, October 12, 2010
Excerpt: Sematext's Otis Gospodnetic on Search Analytics with Flume & HBase from Hadoop World 2010... more
Hadoop World 2010: SIFTing Clouds
Burkhardt, SRA
New York City, October 12, 2010
Excerpt: SRA's Burkhardt on SIFTing Clouds from Hadoop World 2010... more
Hadoop World 2010: MapReduce and Parallel Database Systems: Complementary or Competitive Technology?
Daniel Abadi, Yale University
New York City, October 12, 2010
Excerpt: Yale University's Daniel Abadi on MapReduce and Parallel Database Systems: Complementary or Competitive Techno... more
Hadoop World 2010: Apache ZooKeeper at Yahoo!
Mahadev Konar, Yahoo!
New York City, October 12, 2010
Excerpt: Yahoo!'s Mahadev Konar on Apache ZooKeeper at Yahoo! from Hadoop World 2010... more
Hadoop World 2010: HBase in Production at Facebook
Jonathan Gray, Facebook
New York City, October 12, 2010
Excerpt: Facebook's Jonathan Gray on HBase in Production at Facebook from Hadoop World 2010... more
Hadoop World 2010: Hadoop Based Intelligent Text Processing System
Rao & Uppuluri, AOL
New York City, October 12, 2010
Excerpt: AOL's Rao & Uppuluri on Hadoop Based Intelligent Text Processing System from Hadoop World 2010... more
Hadoop World 2010: Hadoop at Yahoo!: Ready for Business
Arun Murthy, Yahoo!
New York City, October 12, 2010
Excerpt: Yahoo!'s Arun Murthy on Hadoop at Yahoo!: Ready for Business from Hadoop World 2010... more
Hadoop World 2010: Multi Channel Behavioral Analytics
Stephen Groschupf, Datameer
New York City, October 12, 2010
Excerpt: Datameer's Stephen Groschupf on Multi Channel Behavioral Analytics from Hadoop World 2010... more
Hadoop World 2010: Open Cloud Consortium Image Processing for Disaster Relief
Andrew Levine, TexelTek
New York City, October 12, 2010
Excerpt: TexelTek's Andrew Levine on Open Cloud Consortium Image Processing for Disaster Relief from Hadoop World 2010... more
Hadoop World 2010: Techniques to use Hadoop with Scientific Data
Jerry Rolia, HP
New York City, October 12, 2010
Excerpt: HP's Jerry Rolia on Techniques to use Hadoop with Scientific Data from Hadoop World 2010... more
Hadoop World 2010: The Hadoop Ecosystem at Twitter
Kevin Weil, Twitter
New York City, October 12, 2010
Excerpt: Twitter 's Kevin Weil on The Hadoop Ecosystem at Twitter from Hadoop World 2010... more
Hadoop World 2010: AOL’s Data Layer
Ian Holsman, AOL
New York City, October 12, 2010
Excerpt: AOL's Ian Holsman on AOL's Data Layer from Hadoop World 2010... more
Hadoop World 2010: ScaleIn Collecting and Querying log data in near real-time
Anurag Phadke, Mozilla
New York City, October 12, 2010
Excerpt: Mozilla's Anurag Phadke on ScaleIn Collecting and Querying log data in near real-time from Hadoop World 2010... more
Hadoop World 2010: SHARD Triple-Store: Tools for Web-Scale SemWeb
Kurt Rohloff, BBN Technologies
New York City, October 12, 2010
Excerpt: BBN Technologies's Kurt Rohloff on SHARD Triple-Store: Tools for Web-Scale SemWeb from Hadoop World 2010... more
Hadoop World 2010: Optimizing Hadoop Workloads
Nurcan Coskan, Intel
New York City, October 12, 2010
Excerpt: Intel's Nurcan Coskan on Optimizing Hadoop Workloads from Hadoop World 2010... more
Hadoop World 2010: RDBMS and Hadoop: A Powerful Combination
Jacque Istok, GreenPlum
New York City, October 12, 2010
Excerpt: GreenPlum's Jacque Istok on RDBMS and Hadoop: A Powerful Combination from Hadoop World 2010... more
Hadoop World 2010: Putting Analytics in Big Data Analytics
Jake Cornelius
New York City, October 12, 2010
Excerpt: Pentaho's Jake Cornelius on Putting Analytics in Big Data Analytics from Hadoop World 2010... more
Hadoop World 2010: Hadoop: What’s Next?
Mike Olson, Cloudera
New York City, October 12, 2010
Excerpt: Cloudera's Mike Olson on Hadoop: What's Next? Keynote Presentation from Hadoop World 2010... more
Hadoop World 2010: Hadoop – Lessons Learned from Deploying Enterprise Clusters
Shinishi Yamada, NTT Data
New York City, October 12, 2010
Excerpt: NTT Data's Shinishi Yamada on Hadoop - Lessons Learned from Deploying Enterprise Clusters from Hadoop World 20... more
New York City, October 12, 2010
Excerpt: HP Labs's Jerome Rolia on Techniques to Use Hadoop with Scientific Data at Hadoop World 2010... more
Hadoop World 2010: HBase in Production at Facebook
Jonathan Gray
New York City, October 12, 2010
Excerpt: Facebook's Jonathan Gray on HBase in Production at Facebook at Hadoop World 2010... more
Hadoop World 2010: AOL’s Data Layer
Ian Holsman
New York City, October 12, 2010
Excerpt: AOL's Ian Holsman on AOLs Data Layer at Hadoop World 2010... more
Hadoop World 2010: Putting Analytics in Big Data Analysis
Jake Cornelius
New York City, October 12, 2010
Excerpt: Dir. Of Product Management's Jake Cornelius on Putting Analytics in Big Data Analysis at Hadoop World 2010... more
Hadoop World 2010: Making Hadoop Security Work in Your IT Environment
Aaron T. Myers Todd Lipcon
New York City, October 12, 2010
Excerpt: Cloudera's Aaron T. Myers Todd Lipcon on Making Hadoop Security Work in Your IT Environment at Hadoop World 20... more
New York City, October 12, 2010
Excerpt: TexelTek's Andrew Levine on Hadoop Image Processing for Disaster Relief at Hadoop World 2010... more
Hadoop World 2010: Migrating to CDH and Streaming Data Warehouse Loading
Christopher Gillett
New York City, October 12, 2010
Excerpt: Visible Measure's Christopher Gillett on Migrating to CDH and Streaming Data Warehouse Loading at Hadoop World... more
Hadoop World 2010: Sentiment Analysis Powered by Hadoop
Linden Hillenbrand and Li Chen
New York City, October 12, 2010
Excerpt: GE's Linden Hillenbrand and Li Chen on Sentiment Analysis Powered by Hadoop at Hadoop World 2010... more
Hadoop World 2010: Optimizing Hadoop Workloads
Nurcan Coskun
New York City, October 12, 2010
Excerpt: Intel Software and Services Group's Nurcan Coskun on Optimizing Hadoop Workloads at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: BBN's Kurt Rohloff on SHARD: Storing and Querying Large-Scale at Hadoop World 2010... more
Hadoop World 2010: SIFTing Clouds
Paul Burkhardt
New York City, October 12, 2010
Excerpt: SRA International's Paul Burkhardt on SIFTing Clouds at Hadoop World 2010... more
Hadoop World 2010: Intelligent Text Information Processing System
Vaijanath Rao and Rohini Uppuluri
New York City, October 12, 2010
Excerpt: AOL's Vaijanath Rao and Rohini Uppuluri on Intelligent Text Information Processing System at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Twitter's Kevin Weil on The Hadoop Ecosystem at Twitter at Hadoop World 2010... more
Hadoop World 2010: Hadoop and Hive at Orbitz
Jonathan Seidman
New York City, October 12, 2010
Excerpt: Orbitz's Jonathan Seidman on Hadoop and Hive at Orbitz at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: HP's Phil Day on Hadoop: Best Practices and Real Experience Going from 5 to 500 Nodes at Hadoop World 2010... more
Hadoop World 2010: Hadoop at eBay
Anil Madan
New York City, October 12, 2010
Excerpt: EBay's Anil Madan on Hadoop at eBay at Hadoop World 2010... more
Hadoop World 2010: A Fireside Chat: Using Hadoop to Tackle Big Data at comScore
Will Duckworth and Martin Hall
New York City, October 12, 2010
Excerpt: ComScore's Will Duckworth and Karmasphere's Martin Hall on A Fireside Chat: Using Hadoop to Tackle Big Data at... more
New York City, October 12, 2010
Excerpt: Karmasphere's Shevek Mankin on Hadoop Analytics: More Methods, Less Madness at Hadoop World 2010... more
Hadoop World 2010: Mixing Real-Time Needs and Batch Processing: How StumbleUpon Built an Advertising Platform using HBase and Ha
Jean-Daniel Cryans
New York City, October 12, 2010
Excerpt: StumbleUpon's Jean-Daniel Cryans on Mixing Real-Time Needs and Batch Processing: How StumbleUpon Built an Adve... more
Hadoop World 2010: RDBMS and Hadoop: A Powerful Coexistence, Jacque Istok, Greenplum, now part of EMC Corp.
Jacque Istok
New York City, October 12, 2010
Excerpt: Greenplum's Jacque Istok on RDBMS and Hadoop: A Powerful Coexistence at Hadoop World 2010... more
Hadoop World 2010: Hadoop Security
Todd Lipcon, Aaron Myers, Cloudera
New York City, October 12, 2010
Excerpt: Cloudera's Aaron Myers and Todd Lipcon on Hadoop Security from Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Purdue University's Saptarshi Guha on Using R and Hadoop to Analyze VoIP Network Data for QoS at Hadoop World... more
Hadoop World 2010: Multi-Channel Behavioral Analytics
Stefan Groschupf
New York City, October 12, 2010
Excerpt: Datameer's Stefan Groschupf on Multi-Channel Behavioral Analytics at Hadoop World 2010... more
Hadoop World 2010: Apache ZooKeeper at Yahoo!
Mahadev Konar
New York City, October 12, 2010
Excerpt: Yahoo's Mahadev Konar on Apache ZooKeeper at Yahoo! at Hadoop World 2010... more
Hadoop World 2010: Business Analyst Tools for Hadoop
Amr Awadallah
New York City, October 12, 2010
Excerpt: Cloudera's Amr Awadallah on Business Analyst Tools for Hadoop at Hadoop World 2010... more
Hadoop World 2010: Search Analytics with Flume and HBase
Otis Gospodnetic
New York City, October 12, 2010
Excerpt: Sematext's Otis Gospodnetic on Search Analytics with Flume and HBase at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: OpenLogic's Rod Cope on Top 10 Lessons Learned from Deploying Hadoop and HBase at Hadoop World 2010... more
Hadoop World 2010: Hadoop at Yahoo! Ready for Business
Arun C. Murthy
New York City, October 12, 2010
Excerpt: Yahoo!'s Arun C. Murthy on Hadoop at Yahoo! Ready for Business at Hadoop World 2010... more
Hadoop World 2010: Closing Remarks
Mike Olson
New York City, October 12, 2010
Excerpt: Cloudera's Mike Olson on Closing Remarks at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: NTT Data's Shinichi Yamada on Hadoop - Lessons Learned from Enterprise Clusters at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Tim O'Reilly's Keynote Address at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Mozilla Corporation's Anurag Phadke on ScaleIn Collecting and Querying Log Data in Near Real-time at Hadoop Wo... more
Hadoop World 2010: Better Ad, Offer and Content Targeting with Membase and Hadoop
James Phillips, Manu Mukerji, Ben Jackson from Membase, ShareThis, AOL
New York City, October 12, 2010
Excerpt: James Phillips, Manu Mukerji, Ben Jackson from Membase, ShareThis, AOL on Better Ad, Offer and Content Targeti... more
New York City, October 12, 2010
Excerpt: Cloudera's Mike Olson on Hadoop World 2010 Keynote at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Infochimps's Philip Kromer on Millionfold Mashups at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Cloudera's Eric Sammer on Productionizing Hadoop: Lessons Learned at Hadoop World 2010... more
New York City, October 12, 2010
Excerpt: Fuzzy Table: Distributed Fuzzy Matching Database presented by Lalit Kapoor of Booz Allen Hamilton at Hadoop Wo... more
August 30, 2010
Excerpt: Cloudera's Josh Paterson presented how Hadoop is used as the platform for smartgrid technologies at the Tennes... more
Twitter, August 18, 2010
Excerpt: Dmitriy Ryaboy, a Twitter Analytics Engineer and a former Cloudera Intern, briefly explains to the Hadoop comm... more
Digg, August 17, 2010
Excerpt: Doug Cutting, a hadoop founder and part of Cloudera's management team, speaks at Digg. This video is about the... more
Flume: Reliable, Distributed Streaming Log Collection
Jonathan Hsieh, Henry Robinson, Patrick Hunt
July 18, 2010
Excerpt: The presentation gives an overview of the open source project called Flume. Flume provides a way to manage the... more
June 17, 2010
Excerpt: Jeff Hammerbacher shares his thoughts in a session entitled "Experiences Evolving a New Analytical Platform: W... more
June 6, 2010
Excerpt: Todd Lipcon gives an overview of Apache Hadoop for the Glue Conference in May 2010... more
Top 10 Tips & Tricks for Hadoop Success
Omer Trajman & Alex Loddengaard
Palo Alto, June 2, 2010
Excerpt: Many of you may be just starting out with Hadoop and are looking to avoid the mistakes made by others while so... more
Hadoop World 2009: Hadoop Applications at Yahoo! – Eric Baldeschwieler
Eric Baldeschwieler, VP Hadoop Software Development at Yahoo!
New York City, October 2, 2009
Hadoop World 2009: Protein Alignment – Paul Brown
Paul Brown, Booz Allen
New York City, October 2, 2009
Hadoop World 2009: Hadoop + Clojure – Tim Dysinger and Stuart Sierra
Tim Dysinger, VP of Engineering, Sonian Networks and Stuart Sierra, Asst. Director, Program on Law & Technology at Columbia Univ
New York City, October 2, 2009
Hadoop World 2009: Welcome to Hadoop World – Christophe Bisciglia
Christophe Bisciglia, Cloudera, Inc.
New York City, October 2, 2009
Hadoop World 2009: Making Hadoop Easy on Amazon Web Services – Peter Sirota
Peter Sirota, GM, Elastic MapReduce, Amazon.com
New York City, October 2, 2009
Hadoop World 2009: Enabling ad-hoc Analytics at Web Scale – Rod Smith
Rod Smith, VP Internet Emerging Technology, IBM
New York City, October 2, 2009
Hadoop World 2009: Rethinking the Data Warehouse with Hadoop and Hive – Ashish Thusoo
Ashish Thusoo, Facebook
New York City, October 2, 2009
Hadoop World 2009 Counting and Clustering and other Data Tricks – Derek Gottfrid
Derek Gottfrid, Senior Software Architect and Product Technologist, The New York Times
New York City, October 2, 2009
Hadoop World 2009: Closing Address from Cloudera CEO, Mike Olson
Mike Olson, CEO, Cloudera, Inc.
New York City, October 2, 2009
Hadoop World 2009: Analytics and Reporting – Neel Sundaresan
Neel Sundaresan, Sr. Director and Head, eBay Research Labs, eBay
New York City, October 2, 2009
Hadoop World 2009: Real-Time Business Intelligence – Bradford Stephens
Bradford Stephens, Lead Software Engineer, Visible Technologies
New York City, October 2, 2009
Hadoop World 2009: Cross Data Center Logs Processing – Stu Hood
Stu Hood, Rackspace
New York City, October 2, 2009
Hadoop World 2009: Matchmaking in the Cloud – Ben Hardy
Ben Hardy, Senior Software Engineer, eHarmony.com
New York City, October 2, 2009
Hadoop World 2009: Understanding Natural Language – Charles Ward and Karthik Balaji
Charles Ward and Karthik Balaji, General Sentiment
New York City, October 2, 2009
Hadoop World 2009: Hadoop for Bioinformatics – Deepak Singh
Deepak Singh, Business Development Manager - Amazon EC2 at Amazon Web Services
New York City, October 2, 2009
Hadoop World 2009: Next Steps for Hadoop – Doug Cutting
Doug Cutting, Cloudera, Inc.
New York City, October 2, 2009
Hadoop World 2009: Cloudera’s Distribution for Hadoop – Todd Lipcon
Todd Lipcon, Software Engineer at Cloudera, Inc.
New York City, October 2, 2009
Hadoop World 2009: Hadoop Development at Facebook: Hive and HDFS – Dhruba Borthakur and Zheng Shao
Dhruba Borthakur and Zheng Shao, Facebook
New York City, October 2, 2009
Hadoop World 2009: Cool Development Projects at Yahoo!: Automatic Tuning and Social Graph Analysis – Viraj Bhat and Jake Hofman
Viraj Bhat, Grid Solutions Engineer Yahoo! and Jake Hofman, Research Scientist at Yahoo!
New York City, October 2, 2009
Hadoop World 2009: Security and API Compatibility – Owen O’Malley
Owen O'Malley, Yahoo!
New York City, October 2, 2009
Hadoop World 2009: Practical HBase: Getting the most from your HBase install – Jonathan Gray and Ryan Rawson
Jonathan Gray, Streamy.com and Ryan Rawson, StumbleUpon.com
New York City, October 2, 2009
Hadoop World 2009: Monitoring Best Practices – Ed Capriolo
Ed Capriolo, About.com
New York City, October 2, 2009
Hadoop World 2009: Production Deep Dive with High Availability – Alex Dorman and Paul George
Alex Dorman, VP, Engineering, ContextWeb and Paul George, Senior Systems Architect, ContextWeb
New York City, October 2, 2009
Hadoop World 2009: Low Latency, Random Reads from HDFS – Jay Booth
Jay Booth, Elastic Platforms
New York City, October 2, 2009
Hadoop World 2009: Fingerpointing: Sourcing Performance Issues – Priya Narasimhan
Priya Narasimhan, Associate Professor, Electrical & Computer Engineering at Carnegie Mellon University
New York City, October 2, 2009
Hadoop World 2009: HadoopDB – Azza Abouzeid & Kamil Bajda-Pawlikowski
Azza Abouzeid & Kamil Bajda-Pawlikowski, Department of Computer Science, Yale University
New York City, October 2, 2009
Hadoop World 2009: Sqoop: Database Import for Hadoop – Aaron Kimball
Aaron Kimball, Cloudera, Inc.
New York City, October 2, 2009
Hadoop World 2009: Building Data Intensive Apps with Hadoop and EC2
Peter Skomoroch, Founder at Data Wrangling LLC
New York City, October 2, 2009
Hadoop World 2009: Terapot: Email Archiving with Hadoop – Jaesun Han
Jaesun Han, Founder & CEO, NexR, Korea
New York City, October 2, 2009
Hadoop World 2009: MapReduce over Tahoe – a Least-Authority Encrypted Distributed Filesystem – Aaron Cordova
Aaron Cordova, Booz Allen Hamilton
New York City, October 2, 2009
Hadoop World 2009: Hadoop Based Data Mining Platform for the Telecom Industry – Feng Cao
Feng Cao, China Mobile
New York City, October 2, 2009
Hadoop World 2009: Hadoop + Vertica – Omer Trajman
Omer Trajman, Senior Director for Cloud and Virtualization, Vertica Systems
New York City, October 2, 2009
Hadoop World 2009: Optimizing Hadoop Deployments – Nurcan Coskun
Nurcan Coskun, Intel
New York City, October 2, 2009
Hadoop World 2009: What’s new from Cloudera – Jeff Hammerbacher
Jeff Hammerbacher, Chief Scientist and VP of Product, Cloudera, Inc.
New York City, October 2, 2009