Customer Spotlight: Nokia’s Big Data Ecosystem Connects Cloudera, Teradata, Oracle, and Others

Categories: CDH Cloudera Manager Hadoop Use Case

As Cloudera’s keeper of customer stories, it’s dawned on me that others might benefit from the information I’ve spent the past year collecting: the many use cases and deployment patterns for Hadoop amongst our customer base.

This week I’d like to highlight Nokia, a global company that we’re all familiar with as a large mobile phone provider, and whose Senior Director of Analytics – Amy O’Connor – will be speaking at tomorrow’s Cloudera Sessions event in Boston.

Fun fact: Nokia has been in business for more than 150 years, starting with the production of paper in the 1800s. When I first met Amy O’Connor in early 2012, she explained to me that Nokia has always been in the business of transforming resources into useful products — from paper and rubber over a century ago, to the electronics and mobile devices we’re familiar with today.

One of the hottest resources right now is data, so it makes sense that Nokia has implemented a robust big data platform to milk that data for all it’s worth. With their 2007 acquisition of geospatial information systems (GIS) pioneer NAVTEQ, Nokia found itself sitting on a mountain of valuable information, and it set out on a mission: to help people navigate the physical world using digital data.

To do this, Nokia needed a technology infrastructure that would support the collection, storage and analysis of virtually unlimited data types and volumes. This is what led Nokia to Apache Hadoop and Cloudera.

Nokia has implemented a robust big data platform to milk that data for all it’s worth.

Nokia’s big data ecosystem consists of a centralized, petabyte-scale Hadoop cluster that is interconnected with a 100-TB Teradata enterprise data warehouse (EDW), numerous Oracle and MySQL data marts, and visualization technologies that allow Nokia’s 60,000+ users around the world tap into the massive data store. Multi-structured data is constantly being streamed into Hadoop from the relational systems, and hundreds of thousands of Scribe processes run every day to move data from, for example, servers in Singapore to a Hadoop cluster in the UK. Nokia is also a big user of Apache Sqoop and Apache HBase.

This ecosystem is helping Nokia achieve its goals by ingesting terabytes of data — from phones in use, services, log files, and other sources — and processing that data for market insights and other analytics. Specifically, Nokia is using the data to help people navigate the physical world by building 3D digital maps that incorporate traffic models that understand speed categories, recent speeds on roads, historical traffic patterns, elevation, ongoing events, video streams of the world, and more.

Want to learn more?