Customers
Our customers have been successful in using Cloudera’s Distribution Including Apache Hadoop in production to help store, manage and analyze all of their data. Cloudera’s products and services have been proven to be highly valuable in some of the most innovative enterprises and complex data environments across different industries.
-
Adconion
Adconion performs two different types of computing with Hadoop; The first is a near real time feedback loop using Elastic MapReduce (EMR) running constantly, and the second use is tracking ad log files. With close to 300 million impressions a day, they compress approximately 60GB data everyday.
-
AdGooroo
AdGooroo is a leading provider of advertising intelligence to internet marketers. Its proprietary technology tracks advertising activity in any given industry, empowering sophisticated agencies and advertisers with information on competitors’ search advertising, display advertising, and link building strategies.
-
Aggregate Knowledge
To accommodate a large customer workload, Aggregate Knowledge spun up a CDH cluster with Amazon’s EC2. They benefited from the sophistication of CDH and the elasticity of EC2.
-
AOL Advertising
“AOL Advertising is working with Cloudera to leverage their Hadoop expertise in all key areas, training, consulting and support, all as part of its efforts to understand and leverage large data volumes aggregated from diverse sources for reporting, performance optimization and targeting. The combination of Cloudera’s expertise and Cloudera Enterprise has assisted AOL in improving Hadoop management and monitoring.”
-
Apollo Group, Inc.
“Apollo Group, Inc., through its subsidiaries, University of Phoenix, College for Financial Planning, Insight Schools, Inc., Institute for Professional Development, and Western International University, is a leading provider of higher education programs for working adults. At Apollo, we are building a data infrastructure for academic analytics and research exploration based on Hadoop and other open source technology. Cloudera’s support and training is critical to our success.”
-
Concurrent Computers
“We leverage Cloudera and Hadoop to capture census level data measurement. Hadoop is unparalleled in scalability. We process billions of records a day and need a solution that does not incorporate an equivalent cost.”
-
Explorys Medical
“Explorys uses CDH and HBase at the core of it’s medical informatics platform that enables subscribers to search and analyze patient populations, treatment protocols, and clinical outcomes. Explorys provides uniquely powerful and HIPAA compliant solutions for accelerating life saving discovery.”
» View Explory’s Medical case study Video
» View Explory’s Medical case study PDF -
First Life Research
First Life Research uses Cloudera’s Distribution including Apache Hadoop (CDH3) to store terabytes worth of Web pages in Hadoop’s Distributed Filing System (HDFS) and run analysis through Hadoop for Web page parsing, indexing, executing NLP algoritms, and statistical aggregation. This analysis would not be possible without HDFS, the distributed computing management, or the low cost of hardware.
-
Groupon
Groupon features a daily deal on the best stuff to do, see, eat, and buy in more than 300 markets and 35 countries, and soon beyond. We have about 1,000 people working in our Chicago headquarters, a growing office in Palo Alto, CA, as well as regional offices in Europe and Latin America and local account executives in many cities.
-
Huron Consulting Group
Huron Consulting Group uses Hadoop as a data warehouse for document metadata. By analyzing these metadata files across projects they gain insights and metrics useful for improvement.
-
JiWire
JiWire uses Cloudera’s Distribution including Apache Hadoop (CDH) to allow for a massively scalable location-based advertising platform. JiWire’s platform enables advertisers to identify and deliver ads to audience segments based on a person’s physical location while taking the venue type and brand into account.
-
Klout
Klout measures influence across the social web and uses Cloudera’s Distribution including Apache Hadoop (CDH3) to store, process, and analyze real time social media data streams. Klout’s platform analyzes signals as they travel through the social web and performs NLP, machine learning, and other analysis to measure topical and broad based influence.
-
Lyris
“As digital marketing evolves into an increasingly complex and data-driven environment, marketing automation platforms fueled by big data are crucial for understanding and effectively connecting with customers. By choosing Cloudera and implementing CDH, Lyris is capitalizing on the vast opportunities inherent in the marriage of big data architecture and digital marketing.”
-
Mobile Posse
Mobile Posse, Inc. is the leading provider of next-generation mobile advertising and mCRM solutions for the active home screen. Using proprietary patent-pending technology, Mobile Posse enables advertisers, content providers, and wireless carriers to proactively reach consumers through the prime real-estate on the mobile phone.
-
NAVTEQ
NAVTEQ uses Cloudera’s Distribution including Apache Hadoop (CDH) with Cloudera enterprise support for various distributed storage and processing needs. CDH use at NAVTEQ includes core Hadoop and HBase
-
Nokia
Nokia’s goal is to bring the world to the third phase of mobility: leveraging data to make it easier to navigate the physical world. Nokia relies on a technology ecosystem with Cloudera’s Distribution including Hadoop at its core to achieve this goal.
» Watch the Nokia case study video
» View the Nokia case study PDF -
OPower
“The volume of data that utilities need to acquire, store, and analyze is rapidly expanding. Utilities with large smart meter deployments are now receiving terabytes of AMI data every year. These ever-increasing utility data streams are beyond the ability of typical software tools to capture, store, manage, and analyze. In addition, smart appliances, interactive user applications and sensors, provide increasing orders of magnitude worth of valuable data. Opower uses HBase and Hive to store, query and transform all of our time series and social data. This can be anything from power use for a household to the details about smart appliances. In addition, Opower uses Sqoop and Hive to securely centralize the data from at least two logical rdbms per utility provider. We are currently experimenting with using Hive to create a data warehouse so that Opower analysts and product managers can more easily understand our data. CDH provides a great toolchain for Opower to continue to derive ever increasing value as our data sizes grow exponentially.”
-
Qualcomm
Cloudera Enterprise, comprised of Cloudera Support and Cloudera Manager, is a subscription-based service designed to provide data-driven enterprises with visibility, reliability, automation and support for the CDH platform (Cloudera’s Distribution Including Apache Hadoop), which helps Qualcomm derive meaningful insights from its Big Data. Qualcomm chose Cloudera Enterprise 3.7 to manage the HBase and Hadoop clusters of several of its new products and services under development.
-
Rackspace US, Inc.
Rackspace provides managed systems for enterprises one of which being Mailtrust. Mailtrust is used by over 1 million people and thousands of companies on hundreds of servers. Mail transfer on Rackspace generates around 150 GB per day of logs in various formats, which are stored with Hadoop to perform short-term customer support fixes as well as long-term analysis of the mail system.
-
Rapleaf
RapLeaf assists their clients in personalizing their online experience. As a new kind of technology focused information company built for the internet they can instantly return data on a given email address. Businesses leverage this insight to better understand their customers in order to personalize deals and offers, show them more relevant content and give them a better experience online and off.
-
Samsung
“Bioinformatics is a major new focus for Samsung. We’ve built a cloud service for bioinformatics with Cloudera. Integrating their products with existing proprietary bioinformatics systems was fast and very simple.”
-
Skybox Imaging
Skybox Imaging is using Hadoop as the engine of its satellite image processing system. They use CDH to store and process vast quantities of raw satellite image data, enabling Skybox to create a system that scales as they launch larger numbers of ever more complex satellites.
-
SRA
“With the increasing availability of rich content media and the accessibility of scalable application development afforded by MapReduce, problems in computer vision can be applied against large-scale datasets. At SRA International we utilized Cloudera’s Distribution including Apache Hadoop (CDH3) to develop a scalable solution for the SIFT computer vision algorithm. The SIFT algorithm is challenging to cast into the MapReduce programming model but the flexibility of Hadoop permitted us to develop creative solutions. Our approach is amenable to other algorithms in computer vision and image processing, and were ultimately contributed to the Hadoop community.”
-
Trend Micro Incorporated
Trend Micro uses Cloudera’s Distribution including Apache Hadoop (CDH2) focusing majority of their usage in Hadoop and HBase. They maintain internal branches of Hadoop and HBase to run various applications.
-
Trulia
Trulia uses Hadoop to manage log files generated from their Real Estate Web site.
-
Tynt
Tynt uses Cloudera’s Distribution including Apache Hadoop to process and store data from over 30,000 Web sites amounting to an average of 8,000 events per second of input data. Using CDH Tynt assembles publishers’ summaries of what users are copying from their Web sites and to analyze user engagement on the Web.


























