This is the documentation for Cloudera 5.5.x. Documentation for other versions is available at Cloudera Documentation.

About Hive

Apache Hive is a powerful data warehousing application built on top of Hadoop; it enables you to access your data using Hive QL, a language that is similar to SQL.

As of CDH 5, Hive includes HCatalog, but you still need to install HCatalog separately if you want to use it; see HCatalog Installation.

Install Hive on your client machine(s) from which you submit jobs; you do not need to install it on the nodes in your Hadoop cluster.


You need to deploy HiveServer2, an improved version of HiveServer that supports a Thrift API tailored for JDBC and ODBC clients, Kerberos authentication, and multi-client concurrency. The CLI for HiveServer2 is Beeline.


The original HiveServer and command-line interface (CLI) are deprecated; use HiveServer2 and Beeline.

Page generated January 14, 2016.