Data Analyst Program- 0 to 100

Similar documents
Hadoop Development & BI- 0 to 100

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Qsoft Inc

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Has been into training Big Data Hadoop and MongoDB from more than a year now

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

ITG Software Engineering

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

ITG Software Engineering

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

Big Data Course Highlights

Workshop on Hadoop with Big Data

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Certified Big Data and Apache Hadoop Developer VS-1221

The Big Data Ecosystem at LinkedIn. Presented by Zhongfang Zhuang

BIG DATA HADOOP TRAINING

Peers Techno log ies Pv t. L td. HADOOP

BIG DATA & HADOOP DEVELOPER TRAINING & CERTIFICATION

Bringing Big Data to People

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

From Relational to Hadoop Part 1: Introduction to Hadoop. Gwen Shapira, Cloudera and Danil Zburivsky, Pythian

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

Hadoop Job Oriented Training Agenda

White Paper: What You Need To Know About Hadoop

Building Scalable Big Data Pipelines

Information Builders Mission & Value Proposition

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Cloudera Manager Training: Hands-On Exercises

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Big Data Training - Hackveda

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

COURSE CONTENT Big Data and Hadoop Training

HDP Enabling the Modern Data Architecture

BIG DATA - HADOOP PROFESSIONAL amron

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

Virtual Machine (VM) These VMs are to be used for teaching: they are not workstations for calculation.

Cisco IT Hadoop Journey

HADOOP BIG DATA DEVELOPER TRAINING AGENDA

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

The Future of Data Management with Hadoop and the Enterprise Data Hub

Implement Hadoop jobs to extract business value from large and varied data sets

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Apache Hadoop: Past, Present, and Future

Native Connectivity to Big Data Sources in MSTR 10

Hadoop. for Oracle database professionals. Alex Gorbachev Calgary, AB September 2013

Modernizing Your Data Warehouse for Hadoop

HADOOP VENDOR DISTRIBUTIONS THE WHY, THE WHO AND THE HOW? Guruprasad K.N. Enterprise Architect Wipro BOTWORKS

Cloudera Certified Developer for Apache Hadoop

Self-service BI for big data applications using Apache Drill

Complete Java Classes Hadoop Syllabus Contact No:

Apache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc.

t] open source Hadoop Beginner's Guide ij$ data avalanche Garry Turkington Learn how to crunch big data to extract meaning from

MySQL and Hadoop. Percona Live 2014 Chris Schneider

HADOOP. Revised 10/19/2015

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Talend Big Data. Delivering instant value from all your data. Talend

The Inside Scoop on Hadoop

Dominik Wagenknecht Accenture

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

Big Data and Industrial Internet

Hadoop Ecosystem B Y R A H I M A.

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Cloudera Administrator Training for Apache Hadoop

Testing 3Vs (Volume, Variety and Velocity) of Big Data

HOW TO LIVE WITH THE ELEPHANT IN THE SERVER ROOM APACHE HADOOP WORKSHOP

The Future of Data Management

Hadoop implementation of MapReduce computational model. Ján Vaňo

Big Data Infrastructure at Spotify

Self-service BI for big data applications using Apache Drill

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Fundamentals Curriculum HAWQ

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

HDP Hadoop From concept to deployment.

Microsoft SQL Server 2012 with Hadoop

Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Session: Big Data get familiar with Hadoop to use your unstructured data Udo Brede Dell Software. 22 nd October :00 Sesión B - DB2 LUW

Large scale processing using Hadoop. Ján Vaňo

Big Data Too Big To Ignore

Hadoop & Spark Using Amazon EMR

Hadoop and MySQL for Big Data

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015

Designing Self-Service Business Intelligence and Big Data Solutions

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Cloudera Enterprise Data Hub in Telecom:

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

A bit about Hadoop. Luca Pireddu. March 9, CRS4Distributed Computing Group. (CRS4) Luca Pireddu March 9, / 18

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Constructing a Data Lake: Hadoop and Oracle Database United!

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Transcription:

Development Data Analyst Program- 0 to 100 Master the Data Analysis tools like Pig and hive Data Science Build a recommendation engine 1

Data Analyst Program- 0 to 100 HADOOP SCHOOL OF TRAINING Basics Learn the basics of Big Data and hadoop Hands On Play with Hadoop and hadoop ecosystem Data Analysis Become a top notch data analyst Data Analyst Program- 0 to 100 (40 Hours) Overview of the course: Data Analyst Program is a one stop course that introduces you to the domain of data analysis as well as gives you technical knowhow of the same. At the end of this course you will be able to earn a credential of data analyst and you will be capable of dealing with Terabyte scale of data and analyze it successfully. Who this course is for and not for? For: Typically professionals with basic knowledge of software development, programming languages, and databases will find this course really helpful. Basic knowledge should be enough to succeed at this course Not For: Students who are absolute beginners at software development as a discipline will find it difficult to follow the course 2

Phase 1: Hadoop Fundamentals (20 Hours) Getting the Basics Rights Big Data - What is Big Data - Dimensions of Big Data - Big Data in Advertising - Big Data in Banking - Big Data in Telecom - Big Data in ecommerce - Big Data in Healthcare - Big Data in Defence - Processing options of Big Data - Hadoop as an option Hadoop - What is Hadoop - How Hadoop Works - HDFS - Mapreduce - How Hadoop has an edge Hadoop Ecosystem - Sqoop - Oozie - Pig - Hive - Flume Hadoop Hands On - Setting up Hadoop on a Single node cluster - Running HDFS commands - Running your Mapreduce program - Running Sqoop Import and Sqoop Export - Creating Hive tables directly from Sqoop - Creating Hive tables - Querying Hive tables - Running an Oozie workflow - Analyzing twitter data using Flume Multinode Setup - Setting up Multinode setup on Amazon ec2 - Setting up multimode setup on the classroom machines - Setting up Cloudera Manager on the cloud - Setting up Cloudera Manager on local setup Cluster Capacity Planning Level 1: Mini Project Level 1: Evaluation Test (50 marks)

Phase 2: Data Analyst (20 Hours) Becoming a pro data analyst Pig - Basic Data Analysis - Complex Data Analysis - Multi Data Set Analysis - UDFs in Pig - Troubleshooting and Optimizing Pig - Pig Hands On Hive - Basic Data Analysis with Hive - Hive Data Management - Text Processing with Hive - Transformations in Hive - Optimizing Hive - Hive Hands On Data Analysis Using Pentaho as a ETL tool - Setting up Pentaho - Loading Data to HDFS - Loading Data to Hive - Aggregation through Mapreduce - Transforming Data with Hive - Transforming Data with Pig - Loading data from HDFS to RDBMS - Loading Data from hive to RDBMS - Reporting on HDFS Data - Reporting on Hive Data Impala - Data Analysis using Impala Mini Project using Pig and Hive Evaluation Test (50 marks) 4

Trainer Profile HADOOP SCHOOL OF TRAINING Experienced 8+ yrs of Enterprise Software Dev Exp. Certified Hadoop, Hbase and MapR certified Customers Analysis Served customers like Accenture, HP, Genpact, Mastek, and Cisco About the trainer Trainer s Certifications CCAH,CCHD, CCHSB MapR M5 Zend SCJP SCWCD