We learnt about Flume in previous post. We will setup and run Flume agent with Avro source and a Java based client here.
We will setup single agent :
Note that here we are using following details for installation (for complete setup):
- Installation base directory:
Installation
For agent installation we will use one the three nodes setup earlier for agent JVM and all three HDFS nodes for sink (as setup described in earlier post ).We will setup single agent :
Note that here we are using following details for installation (for complete setup):
- Installation base directory:
- /home/anishsneh/installs
- anishsneh
- server01
-
Install Flume - we will use Apache Flume 1.5.0.1 (with Hadoop2)
- Download apache-flume-1.5.0.1-bin.tar.gz from Flume Website, note that we are using Hadoop 2 for sink
- Extract downloaded package to anishsneh@server01:/home/anishsneh/installs, such that we have:
[anishsneh@server01 installs]$ ls -ltr apache-flume-1.5.0.1-bin total 128 -rw-r--r--. 1 anishsneh anishsneh 1779 Mar 28 15:15 README -rw-r--r--. 1 anishsneh anishsneh 6172 Mar 28 15:15 DEVNOTES -rw-r--r--. 1 anishsneh anishsneh 22517 May 6 16:29 LICENSE -rw-r--r--. 1 anishsneh anishsneh 61591 Jun 10 13:56 CHANGELOG -rw-r--r--. 1 anishsneh anishsneh 249 Jun 10 14:08 NOTICE -rw-r--r--. 1 anishsneh anishsneh 1591 Jun 10 14:08 RELEASE-NOTES drwxr-xr-x. 10 anishsneh anishsneh 4096 Jun 10 15:10 docs drwxrwxr-x. 2 anishsneh anishsneh 4096 Sep 17 14:59 lib drwxrwxr-x. 2 anishsneh anishsneh 4096 Sep 17 14:59 tools drwxr-xr-x. 2 anishsneh anishsneh 4096 Sep 17 14:59 bin drwxr-xr-x. 2 anishsneh anishsneh 4096 Sep 17 14:59 conf
- Create hdfs://server01:9000/data/flume directory on HDFS and change its permissions to 777 on server01 (for this demo)
[anishsneh@server01 installs]$ hadoop fs -mkdir /data/flume
[anishsneh@server01 installs]$ hadoop fs -chmod 777 /data/flume