Customers

Our customers have been successful in using Cloudera’s Distribution Including Apache Hadoop in production to help store, manage and analyze all of their data. Cloudera’s products and services have been proven to be highly valuable in some of the most innovative enterprises and complex data environments across different industries.

Contact UsIf you’re using CDH or Cloudera Enterprise, we’d like to share your story here!
  • Adconion

    Adconion

    Adconion performs two different types of computing with Hadoop; The first is a near real time feedback loop using Elastic MapReduce (EMR) running constantly, and the second use is tracking ad log files. With close to 300 million impressions a day, they compress approximately 60GB data everyday.

  • AdGooroo

    AdGooroo is a leading provider of advertising intelligence to internet marketers. Its proprietary technology tracks advertising activity in any given industry, empowering sophisticated agencies and advertisers with information on competitors’ search advertising, display advertising, and link building strategies.

  • Aggregate Knowledge

    Aggregate Knowledge

    To accommodate a large customer workload, Aggregate Knowledge spun up a CDH cluster with Amazon’s EC2. They benefited from the sophistication of CDH and the elasticity of EC2.

  • AOL Advertising

    AOL Advertising

    “AOL Advertising is working with Cloudera to leverage their Hadoop expertise in all key areas, training, consulting and support, all as part of its efforts to understand and leverage large data volumes aggregated from diverse sources for reporting, performance optimization and targeting. The combination of Cloudera’s expertise and Cloudera Enterprise has assisted AOL in improving Hadoop management and monitoring.”

    » Watch the replay: How AOL Accelerates Targeting Decisions with Hadoop and Membase

  • Apollo Group, Inc.

    Apollo Group, Inc.

    “Apollo Group, Inc., through its subsidiaries, University of Phoenix, College for Financial Planning, Insight Schools, Inc., Institute for Professional Development, and Western International University, is a leading provider of higher education programs for working adults. At Apollo, we are building a data infrastructure for academic analytics and research exploration based on Hadoop and other open source technology. Cloudera’s support and training is critical to our success.”

  • CBS Interactive

    CBS Interactive

    CBS Interactive leverages Cloudera to optimize the content that is displayed on their web pages to each user based on what they’re currently reading, and they are able to re-optimize page layouts for every user segment on an hourly basis. CBS Interactive relies on Cloudera Manager to keep their Hadoop cluster running efficiently.

    » Watch the webinar replay: How CBS Interactive uses Cloudera Manager to effectively manage their Hadoop cluster

  • Concurrent Computers

    Concurrent Computers

    “We leverage Cloudera and Hadoop to capture census level data measurement. Hadoop is unparalleled in scalability. We process billions of records a day and need a solution that does not incorporate an equivalent cost.”

    » Read the ComputerWorld article

  • DataSift

    DataSift

    CDH performs the Big Data heavy lifting to help deliver DataSift‘s Historics, a cloud-computing platform that enables entrepreneurs and enterprises to extract business insights from historical public tweets.

    » Watch the case study video
    » View the press release

  • The Walt Disney Company

    Disney is a massive company, but when it comes to its big data platform, the entertainment conglomerate looks a lot like a startup. Kind of, that is. By the sheer power of its will (and ingenuity), a small team has been able to craft a large custom platform out of Hadoop, NoSQL databases and other open-source technologies.

    » How Disney built a big data platform on a startup budget

  • eBay

    eBay

    eBay marketplace has been working hard on the next generation search infrastructure and software system, code-named Cassini. The new search engine processes over 250 million search queries and serves more than 2 billion page views each day. Its indexing platform is based on Apache Hadoop and Apache HBase.

    » View the HBaseCon 2012 presentation
    » View the Hadoop World 2011 presentation
    » View the Hadoop World 2010 presentation

  • Experian

    Experian

    Experian Marketing Services represents global suite of products and platforms that help marketers connect to customers. In this video, several members of the Experian team share their perspectives on the Experian Marketing Services use case for Hadoop, reasons for partnering with Cloudera, and impact to the business that has resulted from Cloudera-empowered gains in operational efficiency.

    » Watch the case study video

  • Treato

    Explorys Medical

    Explorys Medical uses CDH and HBase at the core of it’s medical informatics platform that enables subscribers to search and analyze patient populations, treatment protocols, and clinical outcomes. Explorys Medical provides uniquely powerful and HIPAA compliant solutions for accelerating life saving discovery.”

    » View Explorys Medical case study video
    » View Explorys Medical case study PDF

  • Groupon

    Groupon

    Groupon features a daily deal on the best stuff to do, see, eat, and buy in more than 300 markets and 35 countries, and soon beyond. We have about 1,000 people working in our Chicago headquarters, a growing office in Palo Alto, CA, as well as regional offices in Europe and Latin America and local account executives in many cities.

    » View the press release

  • Huron Consulting Group

    Huron Consulting Group

    Huron Consulting Group uses Hadoop as a data warehouse for document metadata. By analyzing these metadata files across projects they gain insights and metrics useful for improvement.

  • JiWire

    JiWire

    JiWire uses Cloudera’s Distribution including Apache Hadoop (CDH) to allow for a massively scalable location-based advertising platform. JiWire’s platform enables advertisers to identify and deliver ads to audience segments based on a person’s physical location while taking the venue type and brand into account.

  • JPMorgan Chase & Co.

    JPMorgan Chase & Co. (NYSE: JPM) is a leading global financial services firm with assets of $2 trillion and operations in more than 60 countries. The firm is a leader in investment banking, financial services for consumers, small business and commercial banking, financial transaction processing, asset management, and private equity.

    » View the press release

  • King.com

    King.com

    “With Hadoop we can manage vast amounts of data and create bespoke solutions for our analytics. Importantly, with the Cloudera solution we have been able to plug in multiple data feeds ranging from daily currency exchange rates from the European Central bank; multiple metadata feeds; and game, advertising and platform servers log files on an hourly basis. With over 3 billion games being played every month we needed a scalable, robust and future-proof solution. Cloudera has provided us with the ability to use Hadoop to manage the exponential growth in our business as well as the exponential increase in complexity.”

    » View the press release

  • Klout

    Klout

    Klout measures influence across the social web and uses Cloudera’s Distribution including Apache Hadoop (CDH3) to store, process, and analyze real time social media data streams. Klout’s platform analyzes signals as they travel through the social web and performs NLP, machine learning, and other analysis to measure topical and broad based influence.

  • Lyris

    Lyris

    “As digital marketing evolves into an increasingly complex and data-driven environment, marketing automation platforms fueled by big data are crucial for understanding and effectively connecting with customers. By choosing Cloudera and implementing CDH, Lyris is capitalizing on the vast opportunities inherent in the marriage of big data architecture and digital marketing.”

  • Mobile Posse

    Mobile Posse

    Mobile Posse, Inc. is the leading provider of next-generation mobile advertising and mCRM solutions for the active home screen. Using proprietary patent-pending technology, Mobile Posse enables advertisers, content providers, and wireless carriers to proactively reach consumers through the prime real-estate on the mobile phone.

    » View the Mobile Posse case study PDF

  • Morgan Stanley

    Cloudera has been awarded Morgan Stanley’s distinguished ‘CTO Award for Innovation’. Each year, Morgan Stanley selects one technology vendor to receive its ‘CTO Award for Innovation,’ honoring technology solutions that have been deemed innovative and have made a significant impact on Morgan Stanley’s business.

    » View the press release

  • Navteq

    NAVTEQ

    NAVTEQ uses Cloudera’s Distribution including Apache Hadoop (CDH) with Cloudera enterprise support for various distributed storage and processing needs. CDH use at NAVTEQ includes core Hadoop and HBase

  • NetApp

    NetApp

    To better support its customers, NetApp offers AutoSupport, an integrated and efficient monitoring and reporting technology that constantly checks the health of NetApp systems. AutoSupport needed a Big Data storage and analytic solution that would allow it to: store, manage, and analyze increasing volumes of unstructured data; gain insights from these large, complex datasets; and scale for continued growth. The team concluded that the NetApp Open Solution for Hadoop—comprised of NetApp storage coupled with Cloudera Enterprise —surpassed the other solutions in providing the ability to more deeply monitor customer solutions, and executing previously impossible data processing jobs and complex queries.

    » View the NetApp case study PDF

  • Nokia

    Nokia

    Nokia’s goal is to bring the world to the third phase of mobility: leveraging data to make it easier to navigate the physical world. Nokia relies on a technology ecosystem with Cloudera’s Distribution including Hadoop at its core to achieve this goal.

    » Watch the Nokia case study video
    » View the Nokia case study PDF

  • Opower

    Opower

    “The volume of data that utilities need to acquire, store, and analyze is rapidly expanding. Utilities with large smart meter deployments are now receiving terabytes of AMI data every year. These ever-increasing utility data streams are beyond the ability of typical software tools to capture, store, manage, and analyze. In addition, smart appliances, interactive user applications and sensors, provide increasing orders of magnitude worth of valuable data. Opower uses HBase and Hive to store, query and transform all of our time series and social data. This can be anything from power use for a household to the details about smart appliances. In addition, Opower uses Sqoop and Hive to securely centralize the data from at least two logical rdbms per utility provider. We are currently experimenting with using Hive to create a data warehouse so that Opower analysts and product managers can more easily understand our data. CDH provides a great toolchain for Opower to continue to derive ever increasing value as our data sizes grow exponentially.”

    » View the press release

  • Qualcomm

    Qualcomm

    Cloudera Enterprise, comprised of Cloudera Support and Cloudera Manager, is a subscription-based service designed to provide data-driven enterprises with visibility, reliability, automation and support for the CDH platform (Cloudera’s Distribution Including Apache Hadoop), which helps Qualcomm derive meaningful insights from its Big Data. Qualcomm chose Cloudera Enterprise 3.7 to manage the HBase and Hadoop clusters of several of its new products and services under development.

    » View the press release

  • Rackspace US, Inc.

    Rackspace US, Inc.

    Rackspace provides managed systems for enterprises one of which being Mailtrust. Mailtrust is used by over 1 million people and thousands of companies on hundreds of servers. Mail transfer on Rackspace generates around 150 GB per day of logs in various formats, which are stored with Hadoop to perform short-term customer support fixes as well as long-term analysis of the mail system.

  • Rapleaf

    Rapleaf

    RapLeaf assists their clients in personalizing their online experience. As a new kind of technology focused information company built for the internet they can instantly return data on a given email address. Businesses leverage this insight to better understand their customers in order to personalize deals and offers, show them more relevant content and give them a better experience online and off.

  • RelayHealth

    RelayHealth

    Apache Hadoop, an open-source platform, is increasingly gaining adoption within organizations trying to draw insight from all the big data being generated. RelayHealth, a McKesson subsidiary, is adopting analytical platforms built on Hadoop to turn big data into business value.

    » Watch the webinar replay: Realizing the Promise of Big Data with Hadoop, featuring Forrester

  • Research In Motion

    RIM

    BlackBerry Services generate 500TB of instrumentation data every day, and that volume is growing. They’ve converted their ETL services to Hadoop and are migrating data warehouse functions as well. As a result, RIM’s ETL code base has been reduced by 90%, ad hoc queries have been reduced from 4 days to 53 minutes, and RIM has achieved significant capital cost reductions.

    » Watch the webinar replay: The Business Advantage of Hadoop: Lessons from the Field

  • Samsung

    Samsung

    “Bioinformatics is a major new focus for Samsung. We’ve built a cloud service for bioinformatics with Cloudera. Integrating their products with existing proprietary bioinformatics systems was fast and very simple.”

  • Skybox Imaging

    Skybox Imaging

    Skybox Imaging is using Hadoop as the engine of its satellite image processing system. They use CDH to store and process vast quantities of raw satellite image data, enabling Skybox to create a system that scales as they launch larger numbers of ever more complex satellites.

    » Watch the Hadoop World 2011 presentation

  • SRA

    SRA

    “With the increasing availability of rich content media and the accessibility of scalable application development afforded by MapReduce, problems in computer vision can be applied against large-scale datasets. At SRA International we utilized Cloudera’s Distribution including Apache Hadoop (CDH3) to develop a scalable solution for the SIFT computer vision algorithm. The SIFT algorithm is challenging to cast into the MapReduce programming model but the flexibility of Hadoop permitted us to develop creative solutions. Our approach is amenable to other algorithms in computer vision and image processing, and were ultimately contributed to the Hadoop community.”

  • Treato

    Treato

    Treato uses Cloudera’s Distribution Including Apache Hadoop (CDH3) to store terabytes worth of Web pages in Hadoop’s Distributed Filing System (HDFS) and run analysis through Hadoop for Web page parsing, indexing, executing NLP algoritms, and statistical aggregation. This analysis would not be possible without HDFS, the distributed computing management, or the low cost of hardware.

    » Read the Treato blog post

  • Trend Micro

    Trend Micro Incorporated

    Trend Micro uses Cloudera’s Distribution including Apache Hadoop (CDH2) focusing majority of their usage in Hadoop and HBase. They maintain internal branches of Hadoop and HBase to run various applications.

  • Trulia

    Trulia

    Trulia uses Hadoop to manage log files generated from their Real Estate Web site.

  • Tynt

    Tynt

    Tynt uses Cloudera’s Distribution including Apache Hadoop to process and store data from over 30,000 Web sites amounting to an average of 8,000 events per second of input data. Using CDH Tynt assembles publishers’ summaries of what users are copying from their Web sites and to analyze user engagement on the Web.

  • YP

    YP

    YP, the new Yellow Pages, needed a solution that would allow them to support increased volumes of traffic data through their distribution network while supporting changing data complexities, adhering to tighter SLAs, and providing intra-day reporting capabilties. YP turned to Hadoop and has since realized lower total cost of ownership while improving productivity.

    » Watch the webinar replay: The Business Advantage of Hadoop: Lessons from the Field