Oracle Big Data Fundamentals Ed 1

Similar documents
Oracle Big Data Fundamentals Ed 1 NEW

Oracle Big Data Essentials

Workshop on Hadoop with Big Data

Constructing a Data Lake: Hadoop and Oracle Database United!

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

Hadoop Ecosystem B Y R A H I M A.

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Big Data Course Highlights

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Introduction to Big Data Training

BIG DATA HADOOP TRAINING

Qsoft Inc

Connecting Hadoop with Oracle Database

BIG DATA - HADOOP PROFESSIONAL amron

Big Data Approaches. Making Sense of Big Data. Ian Crosland. Jan 2016

Certified Big Data and Apache Hadoop Developer VS-1221

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Peers Techno log ies Pv t. L td. HADOOP

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine

Safe Harbor Statement

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

<Insert Picture Here> Big Data

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

ITG Software Engineering

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Oracle Data Integrator for Big Data. Alex Kotopoulis Senior Principal Product Manager

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Oracle Big Data SQL Technical Update

Apache Sentry. Prasad Mujumdar

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Native Connectivity to Big Data Sources in MSTR 10

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

Hadoop & Spark Using Amazon EMR

Oracle Big Data Handbook

Hadoop Meets Exadata. Presented by: Kerry Osborne. DW Global Leaders Program Decemeber, 2012

ITG Software Engineering

Session: Big Data get familiar with Hadoop to use your unstructured data Udo Brede Dell Software. 22 nd October :00 Sesión B - DB2 LUW

OLH: Oracle Loader for Hadoop OSCH: Oracle SQL Connector for Hadoop Distributed File System (HDFS)

How To Use A Data Center With A Data Farm On A Microsoft Server On A Linux Server On An Ipad Or Ipad (Ortero) On A Cheap Computer (Orropera) On An Uniden (Orran)

Fundamentals Curriculum HAWQ

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Big Data on Microsoft Platform

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Complete Java Classes Hadoop Syllabus Contact No:

Hadoop MapReduce and Spark. Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015

Spark. Fast, Interactive, Language- Integrated Cluster Computing

Scaling Out With Apache Spark. DTL Meeting Slides based on

Unified Big Data Analytics Pipeline. 连 城

The Inside Scoop on Hadoop

[Type text] Week. National summer training program on. Big Data & Hadoop. Why big data & Hadoop is important?

Big Data SQL and Query Franchising

Pro Apache Hadoop. Second Edition. Sameer Wadkar. Madhu Siddalingaiah

brief contents PART 1 BACKGROUND AND FUNDAMENTALS...1 PART 2 PART 3 BIG DATA PATTERNS PART 4 BEYOND MAPREDUCE...385

Data Domain Profiling and Data Masking for Hadoop

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Data-Intensive Programming. Timo Aaltonen Department of Pervasive Computing

Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia

BIG DATA & DATA SCIENCE

Why Spark on Hadoop Matters

Big Data Too Big To Ignore

HADOOP BIG DATA DEVELOPER TRAINING AGENDA

Dell In-Memory Appliance for Cloudera Enterprise

Hadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Hadoop 101. Lars George. NoSQL- Ma4ers, Cologne April 26, 2013

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

Introduc8on to Apache Spark

Upcoming Announcements

Apache Spark 11/10/15. Context. Reminder. Context. What is Spark? A GrowingStack

Integrate Master Data with Big Data using Oracle Table Access for Hadoop

Deep Quick-Dive into Big Data ETL with ODI12c and Oracle Big Data Connectors Mark Rittman, CTO, Rittman Mead Oracle Openworld 2014, San Francisco

Deploying Hadoop with Manager

Hadoop: The Definitive Guide

Big Data Training - Hackveda

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Welkom! Copyright 2014 Oracle and/or its affiliates. All rights reserved.

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Oracle Big Data SQL. Architectural Deep Dive. Dan McClary, Ph.D. Big Data Product Management Oracle

BIG DATA & HADOOP DEVELOPER TRAINING & CERTIFICATION

A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle

Hadoop Job Oriented Training Agenda

Prepared By : Manoj Kumar Joshi & Vikas Sawhney

Ankush Cluster Manager - Hadoop2 Technology User Guide

Apache Spark : Fast and Easy Data Processing Sujee Maniyam Elephant Scale LLC sujee@elephantscale.com

Cloudera Administrator Training for Apache Hadoop

TRAINING PROGRAM ON BIGDATA/HADOOP

Data Discovery and Systems Diagnostics with the ELK stack. Rittman Mead - BI Forum 2015, Brighton. Robin Moffatt, Principal Consultant Rittman Mead

HADOOP. Revised 10/19/2015

Oracle Big Data for Dummies

WHAT S NEW IN SAS 9.4

Transcription:

Oracle University Contact Us: 001-855-844-3881 Oracle Big Data Fundamentals Ed 1 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, learn to use Oracle's Integrated Big Data Solution to acquire, process, integrate and analyze big data. Learn To: Define Big Data. Describe Oracle's Integrated Big Data Solution and its components. Use the Hadoop Distributed File System (HDFS). Acquire big data using the Command Line Interface, Flume, and Oracle NoSQL Database. Process big data using MapReduce, YARN, Hive, Pig, Oracle XQuery for Hadoop, Solr, and Spark. Integrate big data and warehouse data using Scoop, Oracle Big Data Connectors, Copy to BDA, Oracle Big Data SQL, Oracle Data Integrator, and Oracle GoldenGate. Analyze big data using Oracle Big Data SQL, Oracle Advanced Analytics technologies, and Oracle Big Data Discovery. Use and manage Oracle Big Data Appliance. Benefits To You Increase your Big Data technology portfolio by learning to use a wide range of big data acquisition, processing, integration, and analysis techniques. In addition, you learn about Oracle s engineered systems for Big Data, which provide a variety of data integration and analysis capabilities. Analysis options include Oracle Big Data SQL, Oracle Data Mining, Oracle R Enterprise, and Oracle Big Data Discovery. Benefit from a hands-on, case-study approach while learning about Oracle s Integrated Big Data Solution.Live Virtual Class Format A Live Virtual Class (LVC) is exclusively for registered students; unregistered individuals may not view an LVC at any time. Registered students must view the class from the country listed in the registration form. Unauthorized recording, copying, or transmission of LVC content may not be made. Audience Application Developers Database Administrators Hadoop Programmer Hadoop/BigData Cluster Administrator Copyright 2013, Oracle. All rights reserved. Page 1

Related Training Required Prerequisites Database Basics (optional) Exposure to Big Data Basic knowledge of Hadoop Course Objectives Examine MapReduce programs and balance MapReduce jobs Use the Oracle BigDataLite Virtual Machine Review Oracle s Big Data Management Architecture and Engineered Systems Define Big Data Identify Big Data Use Cases Define the Hadoop ecosystem and its components Examine MapReduce programs and balance MapReduce jobs Use Oracle NoSQL Database Use Oracle XQuery for Hadoop Install, use, and administer the Oracle Big Data Appliance Provide data security and enable resource management Course Topics Introduction Questions About You Course Objectives Course Road Map Practice Environment Connecting to the Course Environment (Oracle Big Data Lite Virtual Machine) Using VNC Starting the Oracle Big Data Lite Virtual Version Machine 4.01 Introducing the Movieplex Big Data and the Oracle Information Management System Big Data Opportunities and Challenges Oracle Information Management Architecture Optimizing/Simplifying Architecture with Engineered Systems Copyright 2013, Oracle. All rights reserved. Page 2

Using Oracle Big Data Lite Virtual Machine Overview of the Big Data product stack Access methods Review the Oracle Big Data Virtual Machine Home page Deep dive into the Oracle case study Identify the data structures used Understand the importance of filtering the data Identify the Hadoop Command Guide URL, and review the fs and version commands that are used in the practice Introduction to the Big Data Ecosystem Computer Clusters Distributed Computing The Hadoop Ecosystem Hadoop Core Components Choosing a Hadoop Distribution and Version Types of Analysis That Use Hadoop Cloudera s Distribution Including Apache Hadoop (CDH) Architecture Introduction to the Hadoop Distributed File System (HDFS) Hadoop Distributed Filesystem (HDFS) Acquire Data using CLI, Fuse-DFS, and Flume Introducing the CLI Examining Fuse DFS Using Flume Acquire and Access Data Using Oracle NoSQL Database Define Oracle NoSQL Database List Benefits Load data into the DB Access NoSQL Data Primary Administrative Tasks for Oracle NoSQL Database Plan an Oracle NoSQL Database installation and Node configuration Configure and Deploy a KVStore Using the GUI web interface (monitoring the KVStore) Use the NoSQL Database Table Model (both CLI and Java API) Introduction to MapReduce (Generic and not BDA specific) MapReduce Interacting with MapReduce MapReduce Daemons (Services) update based on YARN Interacting With MapReduce Fault Tolerance MapReduce Examples Resource Management Using Yarn YARN Overview Copyright 2013, Oracle. All rights reserved. Page 3

YARN: Theme 1 YARN: Theme 2 Job Submission in YARN YARN Features MapReduce 2.0: Overview YARN Services Overview of Apache Hive and Apache Pig Apache Hive Apache Pig Overview of Cloudera Impala Hadoop: Analysis Options Examining Cloudera Impala Integrating Hadoop and Oracle Using Oracle XQuery for Hadoop Extensible Markup Language (XML) Simple XML Document XML Elements Markup Rules for Elements XML Attributes What is XML Path (XPath) Language? XPath Terminology: Node Types XPath Terminology: Family Relationships Overview of Solr What is Apache Solr (Cloudera Search)? Cloudera Search: Key Capabilities Cloudera Search: Features Cloudera Search: Tasks Indexing in Cloudera Search Types of Indexing Schema.xml Creating a Solr Collection Apache Spark What is Apache Spark? Introduction to Spark Resilient Distributed Datasets (RDD) Directed Acyclic Graph (DAG) Execution Engine Spark: Architecture Overview of Scala Language Options for Integrating Your Big Data Main Themes of Data Integration Typical use cases: The World of Fast Exploration of Big Data Which technology do I use to integrate data? Introducing Data Integration Options Installation Details Current Hadoop Certification Matrix Copyright 2013, Oracle. All rights reserved. Page 4

Overview of Apache Sqoop Define Sqoop Architecture Dataflow in Sqoop Importance of Scoop Role of Scoop in moving data Comparison of Scoop Scoop Connectors Importing Data From Scoop Into Hive Using Oracle Loader for Hadoop (OLH) What is OLH? Installation Pre-requisites Installation Steps OLH Architecture OLH New Feature: Selective Load of Hive Partitions Input Formats Output Modes OLH Modes Using Copy To BDA Define Copy to BDA Where it fits in (bundled under the Big Data SQL license) Sample Scenario Installation Pre-requisites Using Copy to BDA feature Datatype Conversion list (tabular format) Best practices Advantages Using Oracle SQL Connector for HDFS (OSCH) Oracle SQL Connector for HDFS Software Prerequisites Installation Setup: Oracle Database Installation Setup: Hadoop Cluster Granting User Access to OSCH OSCH: Features OSCH: New Feature in 3.0 OSCH: Three Simple Steps Using Oracle Big Data SQL Context: Exadata and Big Data Appliance What is Big Data SQL? Configuring Oracle Big Data SQL Create Oracle Tables over HDFS data Leverage the Hive Metastore to Access Data in Hadoop Apply Oracle Database Security Policies Over Data in Hadoop Combine HDFS and Oracle data for analysis (SQL Pattern Matching) Using Oracle Data Integrator and Oracle GoldenGate with Hadoop Overview of ODI Overview of OGG Copyright 2013, Oracle. All rights reserved. Page 5

Using ODI Using OGG Copyright 2013, Oracle. All rights reserved. Page 6