Qsoft Inc www.qsoft-inc.com



Similar documents
Big Data Course Highlights

Complete Java Classes Hadoop Syllabus Contact No:

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Peers Techno log ies Pv t. L td. HADOOP

BIG DATA HADOOP TRAINING

ITG Software Engineering

Workshop on Hadoop with Big Data

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Hadoop Job Oriented Training Agenda

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

COURSE CONTENT Big Data and Hadoop Training

Certified Big Data and Apache Hadoop Developer VS-1221

ITG Software Engineering

BIG DATA & HADOOP DEVELOPER TRAINING & CERTIFICATION

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

Hadoop: The Definitive Guide

Introduction to Big Data Training

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Implement Hadoop jobs to extract business value from large and varied data sets

Hadoop Ecosystem B Y R A H I M A.

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

Pro Apache Hadoop. Second Edition. Sameer Wadkar. Madhu Siddalingaiah

Deploying Hadoop with Manager

BIG DATA - HADOOP PROFESSIONAL amron

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

Hadoop Introduction. Olivier Renault Solution Engineer - Hortonworks

Data Analyst Program- 0 to 100

HADOOP. Revised 10/19/2015

MySQL and Hadoop. Percona Live 2014 Chris Schneider

HADOOP BIG DATA DEVELOPER TRAINING AGENDA

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February ISSN

Cloudera Certified Developer for Apache Hadoop

Hadoop IST 734 SS CHUNG

t] open source Hadoop Beginner's Guide ij$ data avalanche Garry Turkington Learn how to crunch big data to extract meaning from

Spring,2015. Apache Hive BY NATIA MAMAIASHVILI, LASHA AMASHUKELI & ALEKO CHAKHVASHVILI SUPERVAIZOR: PROF. NODAR MOMTSELIDZE

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

Session: Big Data get familiar with Hadoop to use your unstructured data Udo Brede Dell Software. 22 nd October :00 Sesión B - DB2 LUW

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Big Data Too Big To Ignore

Big Data Training - Hackveda

Hadoop Development & BI- 0 to 100

CURSO: ADMINISTRADOR PARA APACHE HADOOP

Cloudera Administrator Training for Apache Hadoop

Data processing goes big

Hadoop: The Definitive Guide

Has been into training Big Data Hadoop and MongoDB from more than a year now

Mr. Apichon Witayangkurn Department of Civil Engineering The University of Tokyo

Hadoop 101. Lars George. NoSQL- Ma4ers, Cologne April 26, 2013

TRAINING PROGRAM ON BIGDATA/HADOOP

Internals of Hadoop Application Framework and Distributed File System

<Insert Picture Here> Big Data

PassTest. Bessere Qualität, bessere Dienstleistungen!

Hadoop implementation of MapReduce computational model. Ján Vaňo

[Type text] Week. National summer training program on. Big Data & Hadoop. Why big data & Hadoop is important?

The Hadoop Eco System Shanghai Data Science Meetup

Open source Google-style large scale data analysis with Hadoop

MongoDB Developer and Administrator Certification Course Agenda

Apache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc.

MapReduce with Apache Hadoop Analysing Big Data

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Upcoming Announcements

Big Data Development CASSANDRA NoSQL Training - Workshop. March 13 to am to 5 pm HOTEL DUBAI GRAND DUBAI

Open source large scale distributed data management with Google s MapReduce and Bigtable

Prepared By : Manoj Kumar Joshi & Vikas Sawhney

Scaling Up 2 CSE 6242 / CX Duen Horng (Polo) Chau Georgia Tech. HBase, Hive

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Training Catalog. Summer 2015 Training Catalog. Apache Hadoop Training from the Experts. Apache Hadoop Training From the Experts

Bright Cluster Manager

Lecture 2 (08/31, 09/02, 09/09): Hadoop. Decisions, Operations & Information Technologies Robert H. Smith School of Business Fall, 2015

Dominik Wagenknecht Accenture

Large scale processing using Hadoop. Ján Vaňo

What We Can Do in the Cloud (2) -Tutorial for Cloud Computing Course- Mikael Fernandus Simalango WISE Research Lab Ajou University, South Korea

Constructing a Data Lake: Hadoop and Oracle Database United!

Hadoop Certification (Developer, Administrator HBase & Data Science) CCD-410, CCA-410 and CCB-400 and DS-200

Processing of massive data: MapReduce. 2. Hadoop. New Trends In Distributed Systems MSc Software and Systems

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

White Paper: What You Need To Know About Hadoop

Oracle Big Data Fundamentals Ed 1 NEW

From Relational to Hadoop Part 1: Introduction to Hadoop. Gwen Shapira, Cloudera and Danil Zburivsky, Pythian

Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. Big Data Management and Analytics

Hadoop Architecture. Part 1

Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related

Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware

Map Reduce & Hadoop Recommended Text:

APACHE HADOOP JERRIN JOSEPH CSU ID#

Lessons Learned: Building a Big Data Research and Education Infrastructure

Apache Hadoop: Past, Present, and Future

brief contents PART 1 BACKGROUND AND FUNDAMENTALS...1 PART 2 PART 3 BIG DATA PATTERNS PART 4 BEYOND MAPREDUCE...385

A Brief Outline on Bigdata Hadoop

Data-Intensive Programming. Timo Aaltonen Department of Pervasive Computing

Integrating SAP BusinessObjects with Hadoop. Using a multi-node Hadoop Cluster

Transcription:

Big Data & Hadoop Qsoft Inc www.qsoft-inc.com

Course Topics 1 2 3 4 5 6 Week 1: Introduction to Big Data, Hadoop Architecture and HDFS Week 2: Setting up Hadoop Cluster Week 3: MapReduce Part 1 Week 4: MapReduce Part 2 Week 5: Apache PIG Week 6: Apache Hive and HiveQL

Course Topics 7 8 9 10 11 12 Week 7: Apache Flume, Apache Sqoop, Apache Oozie Week 8: NoSQL Databases, MongoDB and Apache Cassandra Week 9: Apache HBase Week 10: Apache Zookeeper Week 11: Hadoop 2.0, YARN, MRv2 Week 12: Project and Certification

Week 1: Introduction to Big Data, Hadoop Architecture and HDFS What is Big Data and why it is important now Main vendors - Cloudera & Hortonworks Limitations of traditional large scale systems architecture How Hadoop is solving the overcoming of traditional large scale system architecture History of Hadoop Core components of Hadoop Hadoop Master-Slave Architecture NameNode, DataNode, Secondary Node JobTracker, TaskTracker HDFS Architecture Anatomy of Read and Write data on HDFS

Week 2: Setting up Hadoop Cluster MapReduce Framework Architecture Hadoop deployment Modes - Standalone, Single node, Multinode Configuration files in a Hadoop Cluster Web URL's for Hadoop Run HDFS and Linux commands Installation of Hadoop VM installation steps for Windows Manual for Multinode Hadoop Cluster installation on AWS

Week 3: MapReduce Part 1 MapReduce Process Anatomy of MapReduce Program MapReduce Flow Concept of Mappers, Reducers, Combiners Splits and Blocks Writing MapReduce Mappers, Reducers and combiners in Java using Eclipse

Week 4: MapReduce Part 2 Different Input Output Formats Hadoop Data Types Using writable interface and writable comparable Interface Custom Input Format Sequence Files JUnit and MRUnit Testing Frameworks, Writing and running unit test

Week 5: Apache PIG Introduction to PIG Why PIG not MapReduce Pig Components Pig Execution Modes Pig Shell - Grunt Pig Latin, Writing PIG Latin scripts Pig Data Types Pig Operators- Arithmetic, Relational Storage Types Diagnosing Pig commands UDF and External Scripts

Week 6: Apache Hive and HiveQL Introduction to Hive History of Hive and Facebook Pig Vs Hive Hive architecture, MetaStore Hive Data Types Hive DDL Hive DML commands HiveQL - Importing data, sorting and aggregating Writing join queries and inserting data back into Hive UDF and UDAF Choosing between PIG, Hive and MapReduce

Week 7: Apache Flume, Apache Sqoop, Apache Oozie Overview of Flume Flume Architecture Using Flume to load data into HDFS and Hive Overview of Sqoop Using Sqoop to import data from RDBMS into HDFS and Hive Using Sqoop to export data from HDFS into RDMBS Sqoop connectors Introduction to Oozie Oozie workflow jobs Oozie coordinator jobs Using HUE UI for Oozie Using CLI to run and track workflows

Week 8: NoSQL Databases, MongoDB and Apache Cassandra Introduction to NoSQL database Types of NoSQL databases and their features Brewers CAP Theorem Advantage of NoSQL vs. traditional RDBMS Introduction to MongoDB MongoDB Architecture MongoDB documents and CRUD Operations Introduction to Apache Cassandra Overview of Cassandra - data model, reading/writing data, CQL MongoDB vs. Cassandra

Week 9: Apache HBase Introduction to HBase HBase Architecture - read and write paths HBase vs. RDBMS Installing and Configuration Schema design in HBase - column families, hotspotting Accessing data with HBase Shell Accessing data with HBase API SCAN and Advanced API

Week 10: Apache Zookeeper Overview of Zookeeper Uses of Zookeeper Zookeeper Service Zookeeper Data Model Using Zookeeper with HBase Building applications with Zookeeper

Week 11: Hadoop 2.0, YARN, MRv2 Features in Hadoop 2.0 NameNode High Availability Federation and Namespaces Schedulers Introduction to YARN YARN architecture Upgrading MRv1 to MRv2 Developing application using MapReduce version 2

Week 12: Project and Certification Openly available large datasets Use Flume, Sqoop to load data into HDFS, use Hive, Pig, HBase to perform analysis of data Use Oozie to schedule and chain your Hadoop jobs Become a Certified Big Data Professional Cloudera Certified Professional: Data Scientist (CCP:DS) Cloudera Certified Developer for Apache Hadoop (CCDH) Cloudera Certified Administrator for Apache Hadoop (CCAH) Cloudera Certified Specialist in Apache HBase (CCSHB)

Thank You Qsoft Inc www.qsoft-inc.com