Your browser is out of date!

Update your browser to view this website correctly. Update my browser now

×

As an online retail pioneer for over 15 years, Connexity, Inc.—formerly Shopzilla, Inc.—operates a portfolio of shopping websites and is recognized as a leading source for connecting buyers and sellers in the digital world.

 

Read the case study

Overview

As an online retail pioneer since 1996, Connexity, Inc. operates a portfolio of shopping websites and is recognized as a leading source for connecting buyers and sellers in the digital world. Connexity touches a global audience of more than 40 million shoppers each month, linking them with over 100 million products from tens of thousands of retailers. Connexity’s portfolio of comparison shopping, merchant ratings, and review sites—in addition to its namesake site—includes BizrateBizrate InsightsBesoTada, and a more recent addition, Connexity, which provides audience targeting.

Solution

To accommodate its requirement to process and deliver insights on millions of pageviews or ten billion ad bid requests daily, reaching over 100 million unique visitors, Connexity has deployed Cloudera to complement its enterprise data warehouse (EDW) in a hybrid big data environment that meets the needs of a wide range of users. A large amount of processing, cleansing, transformation, and crunching is done in the Hadoop environment and then aggregated data is pushed into the data warehouse via Apache Sqoop for reporting. Connexity has written a custom tool, known internally as Forklift, to move data from the EDW into CDH in an optimized fashion.

Cloudera had the expertise, the experience, and the community support we needed to be successful. We initially worked with the company in 2011 when we started migrating some of our test systems to the Cloudera distribution—CDH. Questions were answered quickly, the Cloudera University training we took was very professional, and their support was terrific. It was only natural that we partner with Cloudera on this initiative.

Rony Sawdayi, Vice President, Engineering, Connexity

Impact: Faster Processing

For its merchandising process, Connexity takes over 15,000 feeds and 100 million products from retailers and processes them with Cloudera each day. What once took several days has been reduced to just a few hours, and a new approach is being tested that will reduce that down to minutes. The improved processing performance also benefits Connexity’s search engine marketing (SEM) activities.

With the processing power of our Cloudera platform, Connexity is able to score and bid on ten million keywords each day. For our audience solutions business, we’re in a position to touch 100 million unique visitors over the web, and collect billions of data points to feed our data science and create immensely rich shopping intent data.

Paramjit Singh, Director of Data, Connexity

Impact: Detailed Insight, Relevant Results

Connexity uses a combination of Apache Mahout and R running on Cloudera to perform classifications and user segmentation on its analytics and research cluster.

Data scientists don’t typically need to consume data warehouse resources now because all of the most recent data is available in Cloudera via R or Mahout… We needed enormous processing capabilities, scalability, full redundancy, and extensive storage -- all at a cost-effective price. Our Cloudera platform provides all that and more.

Rony Sawdayi, Vice President, Engineering, Connexity