Cloudera and NetApp

Cloudera, the leader in Apache Hadoop-based big data systems, has teamed with NetApp to provide a ready to deploy solution for managing and extracting value from big data. The NetApp Solutions for Hadoop(NSH) integrates Cloudera's best-of-class Hadoop software with NetApp storage to deliver a revolutionary solution that combines self-healing, clustered storage, and distributed computing into a single, scalable system.


About NetApp

NetApp creates innovative products—storage systems and software that help customers around the world store, manage, protect, and retain one of their most precious corporate assets: their data. We are recognized throughout the industry for continually pushing the limits of today's technology so that our customers never have to choose between saving money and acquiring the capabilities they need to be successful.

Visit the NetApp Site to Learn More


The NetApp Solutions for Hadoop(NSH)

The NetApp Solutions for Hadoop(NSH) is a joint solution between NetApp and Cloudera that includes everything you need to get started on big data analytics. The NetApp Solutions for Hadoop and Cloudera provide an unprecedented level of analytical power and flexibility. Using virtually any programming language
or business intelligence tool, you can perform complex analysis across extremely large and diverse datasets, aggregating the results and feeding them into NetApp enterprise-grade storage systems.

Read the Case Study

Why NetApp and Cloudera

Easy to deploy and manage

  • Deploy a cluster in 3 easy steps and ensure optimal settings
  • Reduced software mirroring reduces cluster network congestion
  • Fewer copies means less storage, higher storage efficiency

Reduce risk to meet tight SLAs

  • Comprehensive Hadoop solution engineered for enterprise requirements
  • Enterprise grade protection of HDFS metadata
  • Disk failure detection with flexible on line repair prevents Hadoop job from restarting
  • Nonstop storage operation with online repair and global hot spares


  • Decouples compute nodes from storage. Independent scaling or scale together
  • Unrivaled NFS expertise allows easier access to HDFS data,
  • Choose the amount of storage capacity you want applied to nodes

Leverage existing IT investments

  • Hadoop works alongside existing IT investments
  • NOSH is compatible with most products in your current environment
  • Open analytical stack means higher interoperability within infrastructure

Put ALL your data to work

  • Hadoop handles/ingests any kind of data in any kind of schema
  • Process data in the same place it is stored
  • Analyze data how and when you need it, develop new analytical models to discover differentiated value.