WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley



Similar documents
Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Workshop on Hadoop with Big Data

BIG DATA - HADOOP PROFESSIONAL amron

BIG DATA HADOOP TRAINING

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

ANALYTICS CENTER LEARNING PROGRAM

ITG Software Engineering

TRAINING PROGRAM ON BIGDATA/HADOOP

Data Analyst Program- 0 to 100

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

Chase Wu New Jersey Ins0tute of Technology

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

Big Data Analytics and Optimization

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Qsoft Inc

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

BIG DATA TRENDS AND TECHNOLOGIES

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

The 4 Pillars of Technosoft s Big Data Practice

Certified Big Data and Apache Hadoop Developer VS-1221

Big Data and Data Science: Behind the Buzz Words

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Big Data Analytics: Where is it Going and How Can it Be Taught at the Undergraduate Level?

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

Comprehensive Analytics on the Hortonworks Data Platform

Bringing Big Data to People

E6893 Big Data Analytics Lecture 2: Big Data Analytics Platforms

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

THE MCKINSEY GLOBAL INSTITUTE has predicted that by 2018, the US alone could face a shortage of between 140,000 to 190,000 people with deep

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

White Paper: Hadoop for Intelligence Analysis

Peers Techno log ies Pv t. L td. HADOOP

Implement Hadoop jobs to extract business value from large and varied data sets

Introduction to Big Data Training

Big Data and Data Science. The globally recognised training program

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

A Brief Outline on Bigdata Hadoop

The Big Data Ecosystem at LinkedIn. Presented by Zhongfang Zhuang

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Hadoop Development & BI- 0 to 100

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Hadoop Ecosystem B Y R A H I M A.

Oracle Big Data Essentials

HDP Hadoop From concept to deployment.

#TalendSandbox for Big Data

Advanced Big Data Analytics with R and Hadoop

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Apriori-Map/Reduce Algorithm

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

Big Data Too Big To Ignore

Transforming the Telecoms Business using Big Data and Analytics

[Type text] Week. National summer training program on. Big Data & Hadoop. Why big data & Hadoop is important?

Has been into training Big Data Hadoop and MongoDB from more than a year now

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

How To Handle Big Data With A Data Scientist

ITG Software Engineering

BIG DATA & HADOOP DEVELOPER TRAINING & CERTIFICATION

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

Big Data Analytics for Space Exploration, Entrepreneurship and Policy Opportunities. Tiffani Crawford, PhD

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Hadoop. Sunday, November 25, 12

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February ISSN

CSE 427 CLOUD COMPUTING WITH BIG DATA APPLICATIONS

Hadoop Job Oriented Training Agenda

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

QUICK FACTS. Delivering a Unified Data Architecture for Sony Computer Entertainment America TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

Apache Hadoop: Past, Present, and Future

Constructing a Data Lake: Hadoop and Oracle Database United!

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Big Data Course Highlights

NetView 360 Product Description

INTEGRATING R AND HADOOP FOR BIG DATA ANALYSIS

BIG DATA TECHNOLOGY. Hadoop Ecosystem

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee June 3 rd, 2008

Complete Java Classes Hadoop Syllabus Contact No:

HADOOP. Revised 10/19/2015

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

BIG DATA IN BUSINESS ENVIRONMENT

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

The Stratosphere Big Data Analytics Platform

Big Data on Microsoft Platform

Hadoop implementation of MapReduce computational model. Ján Vaňo

/ / TABLEAU TRAINING DURATION 30hrs

Application and practice of parallel cloud computing in ISP. Guangzhou Institute of China Telecom Zhilan Huang

Transcription:

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley Disclaimer: This material is protected under copyright act AnalytixLabs, 2011. Unauthorized use and/ or duplication of this material or any part of this material including data, in any form without explicit and written permission from AnalytixLabs is strictly prohibited. Any violation of this copyright will attract legal actions. Learn to Evolve

About AnalytixLabs AnalytixLabs is a capability building and training solutions firm led by McKinsey, IIM and IIT alumni with deep industry experience and a flair for coaching. We are focused at helping our clients develop skills in basic and advanced analytics to enable them to emerge as Industry Ready professionals and enhance their career opportunities. To know more about us or our faculty, please visit our website Bottom line Approach 80-20 focus on practical & theory Content World class course structure Surpasses industry requirements Faculty Seasoned analytics professionals Together we have 30 + years of experience with prestigious firms, like Kinsey, KPMG, Deloitte and AOL Job-oriented training Lucrative job prospects in high growth domain Support for relevant certifications and diplomas Career counseling and planning Personal attention and Individual counselling Cater to Standard certifications Regular sessions by industry experts Value for money with high return on investment Industry best practices High quality course material and real life case studies 1

Candidates trained by us are working in leading companies across industries www.analytixlabs.co.in 2

Global Big Data Talent Skill Gap McKinsey Global Institute estimates a shortage of nearly 1.7 million big data talents by 2018. This includes a shortage of 140,000 to 190,000 workers with deep technical and analytical expertise, and a shortage of 1.5 million managers and analysts equipped to work with and use big data outputs 3

Program Objective WCBDA program aims to provide its students an international, wide-spectrum qualification for job-readiness and seamless absorption in Big Data job roles. The program will expose the students and professionals to the roles of Big Data Analysts who have: Ability to think analytically Understanding of storage, retrieval and mining of data Explanatory analysis & predictive modeling skills Possess Outcome-Oriented and Global Industry-Specific expertise in Critical Data Analytics and Data Management Skills Hands-on practical skills on Big Data tools R and Hadoop (MapReduce, Hbase, Hive, Pig, Oozie, Sqoop, Mahout, ZooKeeper and Flume) and Data visualization - Tableau Application of analytics in various domains, like Retail, Telecom, BFSI etc. Skills to leverage analytics to drive smart business decisions AnalytixLabs has collaborated extensively with industry experts to put together a program that is rigorous, effective and relevant. 4

WCBDA is a comprehensive program and encompasses the following modules, along with projects in the end Module 1 Module 2 Analytics using R and Tableau 42 hours + Practice Introduction to the R- environment Data Input & Output Data Manipulation Visualization Basic statistics Big Data Hadoop 30 hours + Practice Introduction to Big Data & Hadoop Hadoop Architecture MapReduce R-Hadoop Introduction to Flume & Sqoop Advanced Analytics Data Visualization using Tableau Machine Learning using R Social Media Analytics using R Applying overall Learning PIG HIVE Hbase Mahout ZooKeeper Misc Components Applying overall Learning Crafted by team of experts and maintains a balance between theoretical concepts and practical applications 5

Business Analytics using R & Tableau Duration: 42 hours + Practice sessions Introduction to R- environment The Workspace Input/ Output Useful Packages (Base & other packages) in R Graphic User Interfaces (R studio) Customizing Startup Batch Processing Reusing Results Data Input & Output (Importing & Exporting) Data Structure & Data Types (Vectors, Matrices, factors, Data frames, and Lists) Importing Data (Importing data from csv, txt, Excel and other files) Keyboard Input (Creating input by entering data) Database Input (Connecting to database and use the data) Exporting Data (Exporting files into different formats) Viewing Data (Viewing partial data and full data) Variable & Value Labels Date Values Missing Data Data Manipulation Creating New Variables (calculations & Binning) Operators (Using multiple operators) Built-in Functions & User Defined Function Control Structures(conditional statements, Loops) Sorting Data Merging and Appending Data Aggregating Data Reshaping Data Sub setting Data Data Type Conversions Visualization Creating Graphs Histograms & Density Plot Dot Plots Bar Plots Line Charts Pie Charts Boxplots Scatterplots Basic Statistics (Exploratory Analysis) Descriptive Statistics(central tendency/variance) Frequency Tables /Summarization Hypothesis Testing t-tests/z-test (1-sample, independent sample, paired sample) Analysis of Variance(ANOVA) Correlations/chi-square test 1/2 6

Business Analytics using R & Tableau Duration: 42 hours + Practice sessions Advanced Analytics (Advanced Statistics) Introduction to predictive modeling & applications Linear(Simple & Multiple) Regression Logistic Regression Introduction to segmentation Segmentation using cluster analysis Data Visualization using Tableau Introduction to Tableau & Environment Building basic views & sharing your work- overview Data importing & manipulation Maps/Tables/Calculated fields Parameters Data visualization with Charts maps Building & customizing Reports Building & customizing Dashboards Social Media Analytics using R Social Media Characteristics of Social Media Applications of Social Media Analytics Metrics(Measures Actions) in social media analytics Examples & Actionable Insights using Social Media Analytics Text Analytics Sentiment Analysis using R Text Analytics Word cloud analysis using R Projects (Applying overall Learning) Solve Business problems using R/Tableau 2/2 Machine Learning using R What is Machine Learning? Applications of Machine Learning Algorithms Classification & Regression Problems Training & Testing concepts Cost & optimization functions Artificial Neural Networks(ANN) Support Vector Machines(SVM) Decision Tress & Random Forest Baysian Network case 7

Big Data Hadoop Duration: 30 hours + Practice sessions Introduction to Big Data & Hadoop What is Big Data? Types of Data Characteristics of Big Data Need for understanding Big Data (Application of Big Data) Traditional Approaches and its limitations Introduction to Hadoop and eco-system Getting Started with Hadoop (software installation etc.) Hadoop Architecture Hadoop Commercial version vs Apache Hadoop Hadoop Cluster in commodity hardware Hadoop core components HDFS layer HDFS operation principle Basic Hadoop commands MapReduce Introduction to MapReduce Hadoop MapReduce example Hadoop MapReduce Characteristics Setting up your MapReduce Environment Building a MapReduce Program Input Formats in MapReduce Output Formats in MapReduce Basic MapReduce Programming using R R-Hadoop Introduction to RHdfs, Rmr and Rhbase Develop Map reduce code using R for Local & Hadoop env Exploratory analysis using R-Hadoop Predictive analytics using R- Hadoop Overview of Parallelization using R without Hadoop Introduction to Flume & Sqoop Introduction to Sqoop (Why, what, processing, under the hood) Exporting data from Hadoop using Sqoop Introduction to Flume Flume Use Cases Hands on Exercise using Flume and Sqoop PIG Introduction to PIG Components of PIG PIG Data Model Creating MapReduce programs using PIG Hands on Exercise using PIG 1/2 8

Big Data Hadoop Duration: 30 hours + Practice sessions HIVE Introduction to HIVE and its characteristics Components of HIVE HIVE Data Models Serialization/De-serialization HIVE file formats HIVE Query Language HIVE Functions Difference between HIVE and PIG Hands on Exercise using HIVE HBase HBase introduction and its Characteristics HBase Architecture Storage Model of HBase When to use HBase HBase Data Model HBase Families HBase Components Data Storage Hands on Exercise using Hbase ZooKeeper Introduction to ZooKeeper & its Features Features of ZooKeeper Challenges faced in distributed applications Coordination ZooKeeper: Goals and Uses ZooKeeper: Entities, Data Model, Services 2/2 Misc Components Overview of Apache Oozie Overview of Storm Overview of Apache Cassandra Overview of Apache Spark Overview of H2O Social Media Analytics(Text Analysis, Word cloud) Project (Applying overall Learning) Solve Business problems using all the components of Hadoop Mahout Mahout introduction and its Characteristics Mahout Architecture When to use Mahout What are the Machine Learning topics are covered in Mahout Hands on Exercise using Mahout 9

Time and investment Big Data Analytics: 72 hours + Practice, INR 32,000/ $700 (introductory price) Certification Cost: INR 2000/ $50 (only applicable for WCBDA students) Duration: 12 Weekend, 72 hrs live training - Saturday, Sunday (3 hours each) + Practice Training mode: Fully interactive online class (In addition to the above, you will also get access to the recordings for self study and practice) Components: Content Resources Print and e-format, Simulations and Videos, Virtual Lab with software and datasets, Industry- relevant project work Certification: Participants will be awarded an International certificate on successful completion of the stipulated requirements including an evaluation at the end Placements: AnalytixLabs has an extensive Industry Network to facilitate Placements for its students 10

AnalytixLabs

Free career counseling and job assistance Extensive Industry Network to facilitate Placements Identify available career options for an individual Resume writing and interview preparation Recommend additional skill set and training/course to enhance employability Structure and define a career path Estimate the economic goals and compensation trends Advice and design strategies to meet individual goals 12

We provide trainings in a fully interactive online Saves commuting time and resources in today s chaotic world Delivered lectures are recorded and can be replayed by individuals as per their needs Fully interactive live online class with personal attention Best use of time and resources One of strongest global trends in education, even in developing countries Access to quality training and 24x7 practice sessions available at the comfort of your place Studies prove that online education beats the conventional classroom 13

Contact us Visit us on: http://www.analytixlabs.in/ For course registration, please visit: http://www.analytixlabs.co.in/course-registration/ For more information, please contact us: http://www.analytixlabs.co.in/contact-us/ Or email: info@analytixlabs.co.in Call us we would love to speak with you: (+91) 88021-73069 Join us on: Twitter - http://twitter.com/#!/analytixlabs Facebook - http://www.facebook.com/analytixlabs LinkedIn - http://www.linkedin.com/in/analytixlabs Blog - http://www.analytixlabs.co.in/category/blog/ 14