Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP



Similar documents
Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Qsoft Inc

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Peers Techno log ies Pv t. L td. HADOOP

Workshop on Hadoop with Big Data

ITG Software Engineering

#TalendSandbox for Big Data

Hadoop Job Oriented Training Agenda

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Hadoop Ecosystem B Y R A H I M A.

Complete Java Classes Hadoop Syllabus Contact No:

BIG DATA & HADOOP DEVELOPER TRAINING & CERTIFICATION

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Hadoop Introduction. Olivier Renault Solution Engineer - Hortonworks

Big Data Too Big To Ignore

[Type text] Week. National summer training program on. Big Data & Hadoop. Why big data & Hadoop is important?

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

Introduction to Big Data Training

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Keywords: Big Data, Hadoop, cluster, heterogeneous, HDFS, MapReduce

Bringing Big Data to People

BIG DATA HADOOP TRAINING

HADOOP. Revised 10/19/2015

Constructing a Data Lake: Hadoop and Oracle Database United!

Data processing goes big

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

Deploying Hadoop with Manager

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Cloudera Certified Developer for Apache Hadoop

Data Analyst Program- 0 to 100

Lessons Learned: Building a Big Data Research and Education Infrastructure

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

Big Data Explained. An introduction to Big Data Science.

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

COURSE CONTENT Big Data and Hadoop Training

QUEST meeting Big Data Analytics

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Certified Big Data and Apache Hadoop Developer VS-1221

Hadoop Development & BI- 0 to 100

Modernizing Your Data Warehouse for Hadoop

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Upcoming Announcements

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Big Data Management and Security

Hadoop 只 支 援 用 Java 開 發 嘛? Is Hadoop only support Java? 總 不 能 全 部 都 重 新 設 計 吧? 如 何 與 舊 系 統 相 容? Can Hadoop work with existing software?

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp

Hadoop IST 734 SS CHUNG

White Paper: What You Need To Know About Hadoop

ITG Software Engineering

Hadoop Introduction coreservlets.com and Dima May coreservlets.com and Dima May

BIG DATA - HADOOP PROFESSIONAL amron

Big Data Course Highlights

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

The Inside Scoop on Hadoop

Map Reduce & Hadoop Recommended Text:

t] open source Hadoop Beginner's Guide ij$ data avalanche Garry Turkington Learn how to crunch big data to extract meaning from

Oracle Big Data Fundamentals Ed 1 NEW

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Hadoop & Spark Using Amazon EMR

Implement Hadoop jobs to extract business value from large and varied data sets

BIG DATA TRENDS AND TECHNOLOGIES

Big Data on Microsoft Platform

How To Scale Out Of A Nosql Database

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Chase Wu New Jersey Ins0tute of Technology

Ankush Cluster Manager - Hadoop2 Technology User Guide

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Hadoop. Sunday, November 25, 12

Pro Apache Hadoop. Second Edition. Sameer Wadkar. Madhu Siddalingaiah

Hadoop: The Definitive Guide

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February ISSN

Fundamentals Curriculum HAWQ

How To Create A Data Visualization With Apache Spark And Zeppelin

and Hadoop Technology

BIG DATA TECHNOLOGY. Hadoop Ecosystem

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

Spring,2015. Apache Hive BY NATIA MAMAIASHVILI, LASHA AMASHUKELI & ALEKO CHAKHVASHVILI SUPERVAIZOR: PROF. NODAR MOMTSELIDZE

White Paper: Hadoop for Intelligence Analysis

Oracle Big Data Essentials

Apache Sentry. Prasad Mujumdar

Open source Google-style large scale data analysis with Hadoop

Apache Bigtop: 100% Apache Bigdata management distribution. (and so much more!)

Native Connectivity to Big Data Sources in MSTR 10

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Chapter 7. Using Hadoop Cluster and MapReduce

Has been into training Big Data Hadoop and MongoDB from more than a year now

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

TRAINING PROGRAM ON BIGDATA/HADOOP

HDFS. Hadoop Distributed File System

Lecture 2 (08/31, 09/02, 09/09): Hadoop. Decisions, Operations & Information Technologies Robert H. Smith School of Business Fall, 2015

Lecture 10: HBase! Claudia Hauff (Web Information Systems)!

Dominik Wagenknecht Accenture

Microsoft SQL Server 2012 with Hadoop

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

HDP Hadoop From concept to deployment.

Transcription:

Big-Data and Hadoop Developer Training with Oracle WDP What is this course about? Big Data is a collection of large and complex data sets that cannot be processed using regular database management tools or processing applications. In recent times, there has been a huge boom in the volumes of data being generated on a daily basis, making data handling a daunting task. Year 2011 witnessed 1.8 Zettabytes of data production; since then the rate of data production has been doubling every two years. Furthermore, over 90% of the world s data was generated in the past two years. For instance: E-bay handles a 90 petabyte data warehouse Facebook handles 50bn photos from its users Walmart generates 2560 Terabytes of data every hour and so on Hadoop was born to address the concerns associated with management of ever-increasing huge amount of data. It lets you stay on top of data explosion. Hadoop has now become the new mandate. The momentum of Hadoop has become unstoppable with its wildly grown roots that are trenching into enterprises. WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 1

Big Data Hadoop Course Agenda Lessons 1. Introduction to Big Data and Hadoop a. What is Big Data? b. Types of Data c. Need for Big Data d. Characteristics of Big Data e. Traditional IT Analytics Approach f. Big Data Use Cases g. Handling Limitations of Big Data h. Introduction to Hadoop i. History and Milestones of Hadoop 2. Getting Started With Hadoop a. VMware Player Introduction b. Installing VMware Player c. Setting up the Virtual Environment d. Oracle VirtualBox to Open a VM 3. Hadoop Architecture a. Hadoop Cluster in commodity hardware b. Hadoop core services and components c. Regular file system vs. Hadoop d. HDFS layer e. HDFS operation principle WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 2

4. Hadoop Deployment a. Introduction to Ubuntu Server b. Hadoop installation c. Single node and multi node configuration d. Hadoop Configuration in cluster environment e. Installing Hadoop 2.0 5. MapReduce a. Introdution to MapReduce b. Hadoop MapReduce example c. Hadoop MapReduce Characteristics d. Setting up your MapReduce Environment e. Building a MapReduce Program f. MapReduce Requirements and Features g. MapReduce Java Programming in Eclipse h. Checking Hadoop Environment for MapReduce i. MapReduce 2.0 6. Advanced HDFS & MapReduce a. HDFS Benchmarking b. Setting up HDFS Blocks c. Decommissioning a DataNode d. Advanced MapReduce e. Hadoop Data Types f. InputFormats in MapReduce g. OutputFormats in MapReduce h. Distributed Cache WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 3

i. Joins in MapReduce 7. PIG a. Introduction to PIG b. Components of Pig c. Pig Data Model d. Pig Modes e. Pig Vs. SQL f. Installing Pig Engine g. Datasets for Pig Development h. Pig Latin i. Filtering and Transforming Data j. Grouping and Sorting k. Combining and Splitting l. Pig Commmands 8. HIVE a. Why another data warehousing system b. What is HIVE c. Characteristics of Hive d. System Architecture and Components of Hive e. Hive Data Models f. Serialization/De-serialization g. Hive file formats h. Hive Query Language i. HIVE: Installing, running, and programming j. Hive Functions WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 4

k. Difference between Hive and PIG 9. HBase a. HBase introduction b. Characteristics of HBase c. HBase Architecture d. Storage Model of HBase e. When to use HBase f. HBase Data Model g. HBase Families h. HBase Components i. Row Distribution between region servers j. Data Storage k. Installation of HBase l. Configuration of HBase m. HBase Shell Commands 10. Commercial Distribution of Hadoop a. Cloudera b. Downloading Cloudera Quickstart VM c. Starting the Cloudera VM d. Exploring the Welcome Page e. Understanding Hue f. Understanding Cloudera Manager g. Hortonworks Data Platform h. MapR Data Platform i. Pivotal HD j. IBM InfoSphere BigInsights WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 5

11. ZooKeeper Sqoop and Flume a. Introduction to ZooKeeper b. Features of ZooKeeper c. Challenges faced in distributed applications d. Coordination e. ZooKeeper: Goals and Uses f. ZooKeeper: Entities, Data Model, Services g. Client APIs h. Recipes of Zookeeper i. Introduction to Sqoop (Why, what, processing, under the hood) j. Importing data into Hive k. Importing data into HBase l. Exporting data from Hadoop using Sqoop m. Sqoop Connectors n. Introduction to Flume o. Flume Use Cases p. Configuring and Running Flume Agents 12. Ecosystem and its Components a. Hadoop Ecosystem b. Components Overview c. Overview of Apache Oozie d. Overview of Mahout e. Overview of Apache Cassandra f. Apache Spark WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 6

13. Hadoop Administration and Troubleshooting a. Commands Used in Hadoop Programming b. Different configurations of Hadoop cluster c. Port Numbers for Individual Hadoop Services d. Performance monitoring e. Performance tuning f. Troubleshooting and Log observation g. Overview of Apache Ambari h. Hadoop Security Using Kerberos WE PROVIDE SKILLS FOR INTERNATIONAL CERTIFICATION 7