Cloudera Videos Hadoop World 2011: Replacing RDB/DW with Hadoop and Hive for Telco Big Data
This session will focus on the challenges of replacing existing Relational DataBase and Data Warehouse technologies with Open Source components. Jason Han will base his presentation on his experience migrating Korea Telecom (KTs) CDR data from Oracle to Hadoop, which required converting many Oracle SQL queries to Hive HQL queries. He will cover the differences between SQL and HQL; the implementation of Oracles basic/analytics functions with MapReduce; the use of Sqoop for bulk loading RDB data into Hadoop; and the use of Apache Flume for collecting fast-streamed CDR data. Hell also discuss Lucene and ElasticSearch for near-realtime distributed indexing and searching. Youll learn tips for migrating existing enterprise big data to open source, and gain insight into whether this strategy is suitable for your own data.
To view the resource, please fill out this registration form.
We believe strongly in user privacy.