Apache HBase provides large-scale tabular storage for Hadoop using the Hadoop Distributed File System (HDFS). Cloudera recommends installing HBase in a standalone mode before you try to run it on a whole cluster.
Before using the instructions on this page to install or upgrade, install the Cloudera yum, zypper/YaST or apt repository, and install or upgrade CDH 5 and make sure it is functioning correctly. For instructions, see Installing the Latest CDH 5 Release and Upgrading Unmanaged CDH Using the Command Line.
When starting, stopping and restarting CDH components, always use the service (8) command rather than running scripts in /etc/init.d directly. This is important because service sets the current working directory to / and removes most environment variables (passing only LANG and TERM), to create a predictable environment for the service. If you run the scripts in /etc/init.d, locally-set environment variables could produce unpredictable results. If you install CDH from RPMs, service will be installed as part of the Linux Standard Base (LSB).
Use the following sections to install, update, and configure HBase:
- New Features and Changes for HBase in CDH 5
- Upgrading HBase
- Installing HBase
- Configuration Settings for HBase
- Starting HBase in Standalone Mode
- Configuring HBase in Pseudo-Distributed Mode
- Deploying HBase on a Cluster
- Accessing HBase by using the HBase Shell
- HBase Online Merge
- Using MapReduce with HBase
- Troubleshooting HBase
- Viewing the HBase Documentation
|<< Viewing the Flume Documentation||New Features and Changes for HBase in CDH 5 >>|