Build a custom big data pipeline
Data ingestion and transformation is the first step in all big data projects. Hadoop's extensibility results from high availability of varied and complex data, but the identification of data sources and the provision of HDFS and MapReduce instances can prove challenging. Cloudera will architect and implement a custom ingestion and ETL pipeline to quickly bootstrap your big data solution.
A typical Cloudera Ingestion ETL Pilot
Lasts two weeks and consists of the following activities:
- Identify solution requirements to include data sources, transformations, and egress points
- Architect and develop a pilot implementation for up to three data sources, five transformations, and one target system
- Develop a deployment architecture that will result in a production deployment plan
- Review your Cloudera cluster and application configuration