CDH3 Single Node Installation Guide Dell Server Configuration Guide Ubuntu LTS Desktop Installation Guide



Similar documents
Apache Hadoop 2.0 Installation and Single Node Cluster Configuration on Ubuntu A guide to install and setup Single-Node Apache Hadoop 2.

HADOOP. Installation and Deployment of a Single Node on a Linux System. Presented by: Liv Nguekap And Garrett Poppe

Hadoop Multi-node Cluster Installation on Centos6.6

How To Install Hadoop From Apa Hadoop To (Hadoop)

The objective of this lab is to learn how to set up an environment for running distributed Hadoop applications.

Hadoop Installation. Sandeep Prasad

Installing Hadoop. Hortonworks Hadoop. April 29, Mogulla, Deepak Reddy VERSION 1.0

WA1826 Designing Cloud Computing Solutions. Classroom Setup Guide. Web Age Solutions Inc. Copyright Web Age Solutions Inc. 1

HADOOP - MULTI NODE CLUSTER

CDH installation & Application Test Report

Hadoop (pseudo-distributed) installation and configuration

研 發 專 案 原 始 程 式 碼 安 裝 及 操 作 手 冊. Version 0.1

CDH 5 Quick Start Guide

Cassandra Installation over Ubuntu 1. Installing VMware player:

Procedure to Create and Duplicate Master LiveUSB Stick

User Manual - Help Utility Download MMPCT. (Mission Mode Project Commercial Taxes) User Manual Help-Utility

Installing Hadoop. You need a *nix system (Linux, Mac OS X, ) with a working installation of Java 1.7, either OpenJDK or the Oracle JDK. See, e.g.

How to install Apache Hadoop in Ubuntu (Multi node/cluster setup)

Running Knn Spark on EC2 Documentation

How to install Apache Hadoop in Ubuntu (Multi node setup)

Tutorial How to upgrade firmware on Phison S5 controller MyDigitalSSD

This handout describes how to start Hadoop in distributed mode, not the pseudo distributed mode which Hadoop comes preconfigured in as on download.

Introduction to Mirametrix EyeTracker

Hadoop Setup Walkthrough

Running Kmeans Mapreduce code on Amazon AWS

HADOOP CLUSTER SETUP GUIDE:

USB Bare Metal Restore: Getting Started

Tutorial How to upgrade firmware on Phison S8 controller MyDigitalSSD using a Windows PE environment

Single Node Hadoop Cluster Setup

Setup Hadoop On Ubuntu Linux. ---Multi-Node Cluster

VMWare Workstation 11 Installation MICROSOFT WINDOWS SERVER 2008 R2 STANDARD ENTERPRISE ED.

UBUNTU VIRTUAL MACHINE + CAFFE MACHINE LEARNING LIBRARY SETUP TUTORIAL

Using desktop ANYWHERE

Activation Key usage and Windows 7 Professional installation guide

BIOS Update SOP. Updating BIOS under Windows Mode for Intel Series (PXX/ZXX/X79)... 2

Dual-boot Windows 10 alongside Windows 8

cloud-kepler Documentation

CSA Software Listing Table of Contents. Both Windows and Mac platforms are supported.

Installation and Configuration Documentation

Using The Hortonworks Virtual Sandbox

How to Make a USB Bootable


CPE111 COMPUTER EXPLORATION

Hadoop Data Warehouse Manual

Installing Sun's VirtualBox on Windows XP and setting up an Ubuntu VM

Getting Hadoop, Hive and HBase up and running in less than 15 mins

Installing an open source version of MateCat

Tutorial- Counting Words in File(s) using MapReduce

Option nv, Gaston Geenslaan 14, B-3001 Leuven Tel Fax Page 1 of 14

Partek Flow Installation Guide

Setup Guide for HDP Developer: Storm. Revision 1 Hortonworks University

Installing Java. Table of contents

Plexxi Control Installation Guide Release 2.1.0

Cloudera Distributed Hadoop (CDH) Installation and Configuration on Virtual Box

Local Caching Servers (LCS): User Manual

PC Instructions for Miller LiveArc Software

Vess A2000 Series. NVR Storage Appliance. Windows Recovery Instructions. Version PROMISE Technology, Inc. All Rights Reserved.

WA2102 Web Application Programming with Java EE 6 - WebSphere RAD 8.5. Classroom Setup Guide. Web Age Solutions Inc. Web Age Solutions Inc.

Installing Proview on an Windows XP machine

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine

HSearch Installation

SSD Firmware Update Utility Guide

USB HSPA Modem. User Manual

Network/Floating License Installation Instructions

Puppy Linux Installation To a USB Flash Drive How to install Puppy Linux lupu Lucid to a Flash Drive

Acronis True Image 2015 REVIEWERS GUIDE

Revolution R Enterprise 7 Hadoop Configuration Guide

Hadoop Installation MapReduce Examples Jake Karnes

VERSION 9.02 INSTALLATION GUIDE.

Tableau Spark SQL Setup Instructions

DraganFly Guardian: API Instillation Instructions

Windows 8 UEFI BIOS Update (Step-by-Step Guide)

Hadoop Lab - Setting a 3 node Cluster. Java -

Hadoop MultiNode Cluster Setup

How to Setup and Connect to an FTP Server Using FileZilla. Part I: Setting up the server

Hadoop Tutorial. General Instructions

MiraCosta College now offers two ways to access your student virtual desktop.

Important Notice. (c) Cloudera, Inc. All rights reserved.

Connecticut Hazardous Waste Manifests Database

AMD RAID Installation Guide

M2GO & M1 Operating System Installation

VMware Horizon FLEX User Guide

Hadoop Training Hands On Exercise

How To Backup On Ubuntu (Amd64) With Back In Time (Amd) On Windows Xp) On A Windows Xp (Amd32) On Ubunus On A Ubuntu

LOCKSS on LINUX. Installation Manual and the OpenBSD Transition 02/17/2011

Copy Documents from your Computer (H Drive) to a Flash Drive

Print Server Application Guide. This guide applies to the following models.

mypro Installation and Handling Manual Version: 7

Colorfly Tablet Upgrade Guide

INSTALL ZENTYAL SERVER

Marvell SATA3 RAID Installation Guide

Addonics T E C H N O L O G I E S. NAS Adapter. Model: NASU Key Features

A Study of Data Management Technology for Handling Big Data

The Tor VM Project. Installing the Build Environment & Building Tor VM. Copyright The Tor Project, Inc. Authors: Martin Peck and Kyle Williams

Kognitio Technote Kognitio v8.x Hadoop Connector Setup

Avira Rescue System. HowTo

Ubuntu To install the required card reader software:

NIST/ITL CSD Biometric Conformance Test Software on Apache Hadoop. September National Institute of Standards and Technology (NIST)

SETTING UP A LAMP SERVER REMOTELY

Transcription:

CDH3 Single Node Installation Guide Dell Server Configuration Guide Ubuntu 10.04.4 LTS Desktop Installation Guide Created: 01-12-2015 Author: Hyun Kim Last Updated: 01-12-2015 Version Number: 0.1 Contact info: hyunk@loganbright.com Krish@loganbriht.com

Downloading Ubuntu 10.04.4 LTS Desktop 1. In order to run CDH3, we need an operating system. For this particular demonstration, we are going to use Ubuntu Desktop. No, you don t need to remove your current operating system. Ubuntu is quite light and you can install in ON your current operating system. The best part is, if you don t like it, you can easily remove it. No hard feelings. Sounds good? Let s get started. 2. Before you do ANYTHING and I mean ANYTHING, you need to check what operating system CDH3 supports. Our ultimate goal is to install CDH3 on the Ubuntu. You can check the requirements for CDH3 by clicking the link below: http://www.cloudera.com/content/cloudera/en/documentation /archives/cdh3/v3u6/cdh3-quick-start/cdh3qs_topic_2.html

3. For this demonstration, we are going to install Ubuntu 10.04 LTS (Lucid Lynx) Desktop 64-bit. CDH3 supports 32-bit operating systems. Yet, according to Cloudera, for production environments, 64-bit packages are recommended. Therefore, be aware. 4. Ubuntu is a free operating system and you can download it from the link below. Click the 64-bit PC (AMD64) desktop CD link and it will automatically start downloading. http://old-releases.ubuntu.com/releases/lucid/

5. If you successfully downloaded Ubuntu disk image file on Google Chrome like I did, it will be saved on your Downloads folder. However, if you are unsure where the file is saved, click the down arrow button next to the download icon and it will give you a list of options. Select Show in folders, which will open up the folder where the Ubuntu disk image is downloaded. 5. Done! Creating a bootable Ubuntu USB Flash Drive 1. We downloaded Ubuntu and now we need to install it. In this tutorial, I m trying to install the Ubuntu on a server. In order to do this, I have a couple options. However, I have a laptop with

Windows 7 installed on it and I happen to have a 7gb usb flash drive. If you looked at the picture above you know what I m going to do. We are going to create a bootable Ubuntu USB Flash Drive! This is already well explained on the official Ubuntu website. I will leave some links below. Creating a bootable USB stick on Windows. http://www.ubuntu.com/download/desktop/create-a-usb-stick-onwindows Creating a bootable USB stick on Ubuntu. http://www.ubuntu.com/download/desktop/create-a-usb-stick-on-ubuntu Creating a bootable USB stick on OS X http://www.ubuntu.com/download/desktop/create-a-usb-stick-on-macosx Download Universal USB Installer http://www.pendrivelinux.com/universal-usb-installer-easy-as-1-2- 3/#button

2. Since I m currently using Windows 7, I will click on the first link and follow the instruction. However, Universal USB installer failed to recognize my USB Flash Drive. Therefore, I had to activate the Now Showing All Drives button in order to select my USB Flash Drive. On my computer, F is the USB Flash Drive.

3. If you have anything on your USB flash drive, activate the format option. As a matter of fact, my USB driver was already formatted but just to be safe, I formatted it again on the installer. Now you may click create button to create a bootable Ubuntu USB Flash Drive.

4. Once the installation is successfully done, you will see what s shown in the picture below. You ve created a bootable Ubuntu USB Flash Drive.

Creating New Virtual Disk 1. In this tutorial, I m using a Dell PowerEdge server. Turn on the server and press Ctrl+R to run configuration utility. You will see the screen below.

2. Press F2 while Controller 0 is selected. Select Create New VD and press Enter. 3. I m going to set RAID Level: RAID-1. To select drivers, use the spacebar. Use the tab key to go to Basic Settings. Name the VD and I left the Advanced Settings unchanged. You may configure the Advanced Settings as you wish if your server allows to. Select OK and press Enter to create a new virtual disk.

4. Let the virtual disk to initialize. This may take a couple hours but it s better to get it done now than later. Once it s done, restart the server by using the Ctrl+Alt+Delete key command. Installing Ubuntu 1. Press F11 key to run BIOS boot manager after you restart the server. Insert the Ubuntu bootable USB Flash Drive to the server. 2. You will see the options as shown in the picture below. Select Hard Drive C: by using the down arrow key and select From USB: option on the list. This will boot your USB flash drive.

3. Select Install Ubuntu and press Enter.

4. Change settings appropriately and press Continue button until installation is being started.

5. Wait until the installation is completed. Once the installation is done, we are almost ready to install CDH3. Download JDK and Install it 1. Download JDK from the link below.

http://www.oracle.com/technetwork/java/javase/downloads/javaarchive-downloads-javase6-419409.html#jdk-6u26-oth-jpr 2. Since Cloudera recommends version 1.6.0_25 we will be installing that version of JDK. To extract and install jdk-6u26-linux-x64.bin, open Terminal and do the followings. 3. Copy the file to /usr/local by using the commands below $ cd Downloads (Assuming that the JDK file is saved on Downloads folder) $ sudo cp jdk-6u26-linux-x64.bin /usr/local (this copies the file to /usr/local) $ cd /usr/local $ sudo sh jdk-6u26-linux-x64.bin

Download and installing CDH3 package 1. Now we are finally ready to install CDH3. Click the link below to download CDH3 package. We installed Ubuntu 12.04 Lucid Lynx. http://www.cloudera.com/content/cloudera/en/documentation/archives/ cdh3/v3u6/cdh3-installation-guide/cdh3ig_topic_4_4.html $ cd $ sudo dpkg -i Downloads/cdh3-repository_1.0_all.deb $ sudo apt-get update

Install Hadoop packages $ apt-cache search hadoop $ sudo apt-get install hadoop-0.20 hadoop-0.20-native (Press y and then enter to continue installation) $ sudo apt-get install hadoop-0.20-namenode $ sudo apt-get install hadoop-0.20-datanode $ sudo apt-get install hadoop-0.20-secondarynamenode $ sudo apt-get install hadoop-0.20-jobtracker $ sudo apt-get install hadoop-0.20-tasktracker Set JAVA_HOME and HADOOP_HOME 1. Now we ve installed all the hadoop packages. Are we done? No, not quite yet. We need to set JAVA_HOME and HADOOP_HOME so that the system can recognize what s installed. $ cd /usr/lib/hadoop-0.20/bin $ gedit ~/.bashrc Once the.bashr is opened, on the bottom of the file, copy and paste these:

export HADOOP_HOME=/usr/lib/hadoop export PATH=$PATH:/usr/lib/hadoop/bin export JAVA_HOME=/usr/local/jdk1.6.0_26 export PATH=$PATH:/usr/local/jdk1.6.0_26/bin Leave everything else unchanged. Your java path might be different if you are using different Ubuntu version or different version of jdk. 2. Run the following commands to see if JAVA_HOME and HADOOP_HOME are set correctly. 3. If the commands output nothing, close the terminal and try the commands again by reopening terminal. 4. If you still don t get any output from the commands, go back to the previous step and see if you misspelled anything or if there is any extra comma when you edited.bashrc file. 5. If you see what s in the picture below, you ve set JAVA_HOME and HADOOP_HOOP properly.

Find out Hadoop and Java version 1. $ hadoop version $ java -version If it prints out something similar to what s shown in the picture above, you ve done everything correctly so far. Adding users to group hadoop $ sudo gpasswd -a hdfs hadoop $ sudo gpasswd -a mapred hadoop Change settings of core-site.xml

$ sudo mkdir /ysr/lib/hadoop/tmp $ cd /usr/lib/hadoops/ $ sudo chmod 750 tmp/ $ sudo chown hdfs:hadoop tmp/

hdfs-site.xml $ sudo mkdir /storage $ sudo chmod 775 /storage/ $ chown hdfs:hadoop /storage/ mapred-site.xml

$ sudo mkdir /home/ cdh3/mapred $ sudo chmod 775 /home/ cdh3 /mapred $ sudo chown mapred:hadoop /home/ cdh3 /mapred User Assignment Format namenode Type and enter the commands below. $ cd /usr/lib/hadoop/bin/ $ sudo -u hdfs hadoop namenode -format

When I tried to format namenode, this error occurred. In this case, we just need to edit a few things so that Hadoop-config can read jdk1.6.0_26. No big deal. First, to open hadoop-config $ cd /usr/lib/hadoop/bin/ $ sudo gedit hadoop-config.sh This should fix the problem. Save and try again. $ sudo -u hdfs hadoop namenode -format will give you this screen below

Start Daemons $ sudo /etc/init.d/hadoop-0.20-namenode start $ sudo /etc/init.d/hadoop-0.20-secondarynamenode start $ sudo /etc/init.d/hadoop-0.20-jobtracker start $ sudo /etc/init.d/hadoop-0.20-datanode start $ sudo /etc/init.d/hadoop-0.20-tasktracker start $ netstat -ptlen

Checking UI On your internet browser, type localhost:50030 to open the NameNode page

On your internet browser, type localhost:50070 to open the Map/Reduce administration page

If you see the pages above, you have successfully installed CDH3.