Installing Sqoop 2
Sqoop 2 Prerequisites
- An operating system supported by CDH 5
- Oracle JDK
Hadoop must be installed on the node which runs the Sqoop 2 server component.
- Services which you wish to use with Sqoop, such as HBase, Hive HCatalog, and Accumulo.
Sqoop checks for these services when you run it, and finds services which are installed
and configured. It logs warnings for services it does not find. These warnings, shown
> Warning: /usr/lib/sqoop/../hbase does not exist! HBase imports will fail. > Please set $HBASE_HOME to the root of your HBase installation. > Warning: /usr/lib/sqoop/../hive-hcatalog does not exist! HCatalog jobs will fail. > Please set $HCAT_HOME to the root of your HCatalog installation. > Warning: /usr/lib/sqoop/../accumulo does not exist! Accumulo imports will fail. > Please set $ACCUMULO_HOME to the root of your Accumulo installation.
Installing Sqoop 2
Sqoop 2 is distributed as two separate packages: a client package (sqoop2-client) and a server package (sqoop2-server). Install the server package on one node in the cluster; because the Sqoop 2 server acts as a MapReduce client this node must have Hadoop installed and configured.
Install the client package on each node that will act as a client. A Sqoop 2 client will always connect to the Sqoop 2 server to perform any actions, so Hadoop does not need to be installed on the client nodes.
Depending on what you are planning to install, choose the appropriate package and install it using your preferred package manager application.
To install the Sqoop 2 server package on a RHEL-compatible system:
Before using the instructions on this page to install or upgrade, install the Cloudera yum, zypper/YaST or apt repository, and install or upgrade CDH 5 and make sure it is functioning correctly. For instructions, see Installing the Latest CDH 5 Release and Upgrading Unmanaged CDH Using the Command Line.
$ sudo yum install sqoop2-server
To install the Sqoop 2 client package on a RHEL-compatible system:
$ sudo yum install sqoop2-client
To install the Sqoop 2 server package on a SLES system:
$ sudo zypper install sqoop2-server
To install the Sqoop 2 client package on a SLES system:
$ sudo zypper install sqoop2-client
To install the Sqoop 2 server package on an Ubuntu or Debian system:
$ sudo apt-get install sqoop2-server
To install the Sqoop 2 client package on an Ubuntu or Debian system:
$ sudo apt-get install sqoop2-client
Installing the sqoop2-server package creates a sqoop-server service configured to start Sqoop 2 at system startup time.
You are now ready to configure Sqoop 2. See the next section.
|<< About Sqoop 2||Configuring Sqoop 2 >>|