Your browser is out of date!

Update your browser to view this website correctly. Update my browser now


Date: Friday, Dec 21 2012


This company leverages Cloudera Enterprise Core with Cloudera Enterprise RTD (Real-Time Delivery) in conjunction with Oracle RAC. The Cloudera and Oracle platforms work together for a closed loop analytical process; Sqoop allows two-way loads and active archiving between Hadoop and Oracle. Data is migrated between CDH and Oracle every 10 minutes. The company offloads data processing and historical storage from Oracle to CDH, and moves data as needed back into Oracle for reporting and analysis. They process a terabyte (TB) of data per day through multiple processing tiers: the data is processed through several Hadoop clusters, and trickles down to 120 million records which are loaded into Oracle. Oracle houses a few months of recent data which is available to business analysts for immediate reporting — both ad hoc and canned reports — whereas CDH is used for historical trend analysis (via Hive) of up to 25 years’ history. Oracle contains aggregated data; CDH captures all of the detailed data. The chart below demonstrates the different purposes and functionalities offered by each platform in the environment.