Download StreamSets Data Collector | Cloudera
Your browser is out of date

Update your browser to view this website correctly. Update my browser now




  • A graphical IDE lets you design, test and debug ingest flows without requiring schema specification.

  • Built-in transformations help you sanitize, sample and route your data as needed.

  • Intelligent monitoring gives you runtime visibility to data flow performance, including stage-specific early warnings about anomalies and outliers.

  • Deep integration with the Hadoop ecosystem, including connectors for HDFS, HBase, Kafka and Solr

  • Flexible deployment of pipelines to edge servers or to the Enterprise Data Hub as a Spark Streaming application or MapReduce job.

  • Seamless management of infrastructure via Cloudera Manager and parcels



Operating Systems

Use one of the following operating systems and versions:

  • Mac OS X

  • CentOS 6 or 7

  • RedHat Enterprise Linux 6 or 7

  • Ubuntu 14.04


Oracle Java 7 or 8


Use the latest version of one of the following browsers:

  • Chrome

  • Firefox

  • Safari

Selected tab: systemrequirements

Want to Get Involved or Learn More?

Check out our other resources

Cloudera Community

Collaborate with your peers, industry experts, and Clouderans to make the most of your investment in Hadoop.

Cloudera University

Receive expert Hadoop training through Cloudera University, the industry's only truly dynamic Hadoop training curriculum that’s updated regularly to reflect the state of the art in big data.