Installing HBase 0.94.x

on a Multi-node cluster with Ubuntu 14.04

Sabeur Aridhi

Introduction

This tutorial is a sequel of of Matteo Lissandrini's "Installing HDFS and Hadoop 2.X on a Multi-node cluster with Ubuntu 14.0.

That guide can also be used to install Hadoop 1.x (with minor if none modification); in this work we will assume that you have followed that tutorial and have installed Hadoop 1.x and HDFS.

Even thought HBase 0.94.x can run against both Hadoop 1.x and 2.x versions (see HBase 0.94 book) we highly recommend to use Hadoop 1.x for HBase 0.x and Hadoop 2.x for HBase 1.x and 2.x.

We wish also to inform you that also this tutorial can be applied to HBase 1.x and 2.x (with minor if none modification).

Installing

The following steps will be needed only once. Download HBase 0.94.X stable, to do so navigate in the List of Mirrors select one and decide which version to download. For the sake of simplicity from now on we will assume tho have chosen version 0.94.27.
For example wget can be used:

# from eu
wget https://www.eu.apache.org/dist/hbase/hbase-0.94.27/hbase-0.94.27.tar.gz
# from us
wget https://www.us.apache.org/dist/hbase/hbase-0.94.27/hbase-0.94.27.tar.gz

Then extract the tar to the final installation directory, fix also permission and create a version agnostic symlink.
In this tutorial we will use the standard /usr/local/ as installation directory but obviously you are free to chose the one you prefer.

# extract & copy
sudo tar -zxf hbase-0.94.27.tar.gz -C /usr/local/
# fix permission
sudo chown -R hduser:hadoop /usr/local/hbase-hbase-0.94.27/
# create symlink
sudo ln -s /usr/local/hbase-0.94.27/ /usr/local/hbase

Roland Versaworks 4 Download Link ((install)) Direct

While Roland VersaWorks 4.0 is a legacy software version, it remains a vital tool for users of older Roland DG inkjet printers and cutters. This version introduced significant enhancements like an advanced job log and an expanded metallic color library. Official Download and Access

  1. Advanced Spot Color Matching: VersaWorks 4 features an advanced spot color matching system that ensures accurate color reproduction, even for complex designs.
  2. Automated Print Queue Management: The software allows users to manage multiple print jobs efficiently, with automated queuing and prioritization.
  3. Enhanced Print Quality Control: VersaWorks 4 provides users with precise control over print quality, including adjustable settings for ink density, dot gain, and more.
  4. Support for Various File Formats: The software supports a wide range of file formats, including PDF, EPS, TIFF, and JPEG.
  5. Integration with Roland Printers: VersaWorks 4 is specifically designed to work seamlessly with Roland's large-format printers, ensuring optimal performance and compatibility.

Nodes Setup

Finally configure and initialize the other cluster nodes. List the machines that will act as region server in conf/regionservers, one address per line line.

If needed update /etc/hosts according to Hadoop tutorial hints.

Once done, propagate the setup throw the cluster:

 #!/bin/bash

 # Build configured HBase tar.
 mkdir -p /tmp/distr/
 tar -czf /tmp/distr/hbase.tgz /usr/local/hbase-0.94.27

 # Distribute to each region node
 while IFS='' read -r node_ip; do
     scp /etc/hosts hduser@$node_ip:~/
     scp ~/.profile ~/.vimrc hduser@$node_ip:~/

 	scp hbase.tgz hduser@$node_ip:~/

 	ssh -o StrictHostKeyChecking=no -tt hduser@$node_ip <<EOF
 sudo mv $HOME/hosts /etc/

 # Install & link & fix permission
 sudo tar -zxf $HOME/hbase.tgz -C /
 sudo ln -s /usr/local/hbase-0.94.27 /usr/local/hbase
 sudo chown -R hduser:hadoop /usr/local/hbase*

 # Create zookeeper directory (even if not needed)
 sudo mkdir -p /usr/local/zookeeper
 # Fix permission
 sudo chown -R hduser:hadoop /usr/local/zookeeper

 # Raise the limit for max opened files (DB srv)
 sudo sysctl -w fs.file-max=100000

 # Required due to -tt option
 exit
 EOF
 done < /usr/local/hbase/conf/regionservers

Start

That's the end of the journey: enjoy your new HBase cluster!

Start it running start-hbase.sh