Cloudera Search Guide

This guide provides Cloudera Search prerequisites, shows how to load and index data in search, and shows how to use Search to query data. In addition, this guide provides a tutorial, various indexing references, and troubleshooting information.

Cloudera Search integrates with CDH and uses Apache Solr to provide scalable and reliable search services. Search makes these services available to end users through tools that use familiar access and querying models.

  • Search integrates with the existing CDH ecosystem, so data can be stored, shared, and accessed using the various solutions included with CDH. This also prevents data silos and minimizes expensive data movement.
  • Search provides access to data stored in CDH without requiring the Java skills required for MapReduce jobs or the SQL skills required for Impala queries.
  • Search returns results typically within seconds, rather than the minutes or more that are often required for MapReduce jobs to complete.
  • Search allows you to customize what information is indexed. Indexes can optimize for completeness, size, or data types.