I am trying to install Apache Accumulo 2.0 with Hadoop cluster and Zookeeper clusters already running as separate docker containers;
Now I must set the below environment variables as per installation instructions;
############################
# Variables that must be set
############################
## Hadoop installation
export HADOOP_HOME="${HADOOP_HOME:-/path/to/hadoop}"
## Hadoop configuration
export HADOOP_CONF_DIR="${HADOOP_CONF_DIR:-${HADOOP_HOME}/etc/hadoop}"
## Zookeeper installation
export ZOOKEEPER_HOME="${ZOOKEEPER_HOME:-/path/to/zookeeper}"
However these directories do not exist in local machine. Do I have to copy these directories from the individual containers of Hadoop and Zookeeper in order to make them available in local machine where I am trying to run Accumulo? Or there is some proper way to configure it?
Accumulo requires the Hadoop XML config files , as does any Hadoop client. It fines these using
$HADOOP_CONF_DIRor$HADOOP_HOME/confIt uses
$HADOOP_HOME/libto get Hadoop JARsI'm not sure what is used by having
$ZOOKEEPER_HOME, but I guess Accumulo does not come with Zookeeper JARs eitherSo, yes, you need to copy them out of the container or download Hadoop libraries on the host and volume mount them into the container instead. You should already have volume mounts for Zookeeper and the NameNode & Datanodes anyway.