Big Data in Cloud. Round table



Similar documents
Taming the Elephant with Big Data Management. Deep Dive

Data Integration Hub

Modernizing Your Data Warehouse for Hadoop

Bringing Big Data to People

Ganzheitliches Datenmanagement

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

SQL Server What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

The Future of Data Management

Cisco IT Hadoop Journey

HDP Hadoop From concept to deployment.

Cloud Ready Data: Speeding Your Journey to the Cloud

Roadmap Talend : découvrez les futures fonctionnalités de Talend

The Inside Scoop on Hadoop

Azure Data Lake Analytics

DEMYSTIFYING THE CLOUD

Big Data Approaches. Making Sense of Big Data. Ian Crosland. Jan 2016

The Future of Data Management with Hadoop and the Enterprise Data Hub

BIG DATA TRENDS AND TECHNOLOGIES

Building Your Big Data Team

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Native Connectivity to Big Data Sources in MSTR 10

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Building a BI Solution in the Cloud

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

New hybrid cloud scenarios with SQL Server Matt Smith 6/4/2014

Microsoft Big Data. Solution Brief

Course MS20467C Designing Self-Service Business Intelligence and Big Data Solutions

QUEST meeting Big Data Analytics

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Big Analytics in the Cloud. Matt Winkler PM, Big

How To Extend An Enterprise Bio Solution

Driving principals of cloud adoption

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

AtScale Intelligence Platform

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

More Data in Less Time

Next-Generation Cloud Analytics with Amazon Redshift

PLATFORM-AS-A-SERVICE, DEVOPS, AND APPLICATION INTEGRATION. An introduction to delivering applications faster

Virtualizing Apache Hadoop. June, 2012

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

GROW WITH BIG DATA. Third Eye Consulting Services & Solutions LLC.

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Cisco IT Hadoop Journey

Bring your data to life with Microsoft Power BI. Peter Myers Bitwise Solutions

Getting Started & Successful with Big Data

Oracle Database 12c Plug In. Switch On. Get SMART.

Introduction to Big Data Training

Data Security in Hadoop

Designing Self-Service Business Intelligence and Big Data Solutions

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

BIG DATA AND MICROSOFT. Susie Adams CTO Microsoft Federal

Introduction to Cloud Computing

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014

CLOUD COMPUTING & WINDOWS AZURE

Microsoft Power BI. Nov 21, 2015

NEXT UP: John Sanderson, Windows Azure Specialist (Denver) Page 1

How To Use Windows Small Business Server 2011 Essentials

Has been into training Big Data Hadoop and MongoDB from more than a year now

HDP Enabling the Modern Data Architecture

Microsoft Azure for IT Professionals 55065A; 3 days

The Virtualization Practice

Data Services Advisory

NCTA Cloud Architecture

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Cloud Big Data Architectures

The cloud that s built for your business.

From Lab to Factory: The Big Data Management Workbook

Oracle Big Data Fundamentals Ed 1 NEW

MS 20532B - Developing Microsoft Azure Solutions

BEDIFFERENT A C E I N T E R N A T I O N A L

Microsoft SharePoint Architectural Models

Hadoop & Spark Using Amazon EMR

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

WINDOWS AZURE DATA MANAGEMENT

Data Protection & Cloud. Corradino Milone PreSales Commvault Italia

Big Data Web Analytics Platform on AWS for Yottaa

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

Necto on Azure The Ultimate Cloud Solution for BI

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Please give me your feedback

IBM SmartCloud Application Performance and Monitoring. RTView for APM Webinar

New Modeling Challenges: Big Data, Hadoop, Cloud

Hexaware E-book on Q & A for Cloud BI Hexaware Business Intelligence & Analytics Actionable Intelligence Enabled

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Migrating SaaS Applications to Windows Azure

Register on projectbotticelli.com. Introduction to BI & Big Data DAX MDX Data Mining

Big Data Analytics - Accelerated. stream-horizon.com

MICROSTRATEGY ON AWS

Big Data Technologies Compared June 2014

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Real Time Big Data Processing

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Supporting Cloud Services

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Advanced Analytics & IoT Architectures

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Transcription:

Big Data in Cloud Round table

Big Data Management Introduction

Traditional DI Environments Start simple. Build as you grow

Big Data World! Sentry Kerberos Knox Ranger Map Reduce Spark Stream Exec Engines Spark Tez Pig Impala Avro Security ORC HDFS Storage Layers Data Formats S3 Too many decisions to begin with Azure Blob Text RC Parquet CDH Distributions Map R Sequence Legacy HW Mongo DB Compr ession GZip LZO Relational ERP Data Hadoop No SQL HBase Red Shift BZip 2 Snappy

Big Data World with Informatica BDM Deploy anywhere No SQL Data Storage Layers Security Distributions Exec Engines Data Formats Data Compres sion Connections Configuration Data Objects Informatica Big Data Management Edition Abstract and streamline your data flow Focus on business logic not integration Build for data not technology Build once, run anywhere Mappings Build once

Overview of the Data Integration Solutions PowerCenter Big Data Management Cloud Data Integration Traditional Workloads Next-Gen Workloads Cloud & SaaS Workloads Data Warehousing Agile BI Real-time DI Data Migration Apps Integration (onprem) DW Offloading/ Optimization Data Lakes Big Data Analytics NoSQL Integration Apps Integration (Hybrid) Cloud & Hybrid DI DW & Analytics (Cloud DBs)

3 pillars of Big Data Management Single, Comprehensive and Integrated Platform for End-to-End Big Data Management Data Integration Data Quality & Governance Data Security

Custom coding vs. Informatica BDM

Custom coding vs. Informatica BDM Simple, Graphical User Interface Import and Validate Existing Power Center Mappings Ensure Ongoing Maintainability and Reuse

What s new? 10.0 Platform Dynamic Schemas, Mappings Parameterization Team Based Development / Versioning Scheduler Service Enhanced monitoring Connectivity Partitioning Big Data Exclusive Blaze Live Data Map 10.0 Update 1 PC Reuse Report Blaze Enhancements Connectivity & Partitioning Amazon EMR support Azure HDInsight support 10.1 Blaze enhancements OS Profiles SQOOP DI on Spark SQL to Mapping Live Data Map 2.0 Intelligent Data Lake

Big Data Management Cloud deployments

Challenges with on premise deployments Inflexible and static infrastructure Difficulty in keeping up with evolving technologies Lost of planning to get clusters up and running Limited in-house Hadoop expertise High Total cost of ownership

On premise deployment Setup hardware Manually scale cluster Select Hadoop distro Monitor cluster Configure cluster Design data flows

Cloud deployment Setup hardware Manually Auto scale cluster Select Hadoop distro Monitor cluster Configure cluster Design data flows

Informatica BDM: On Premise & Cloud Informatica Big Data Management Edition On-Premise Cloud

Informatica BDM: Cloud connectivity Informatica Big Data Management Edition Blob SQL Server S3 support Redshift HDFS Azure HDInsight Amazon

Azure Marketplace

BDM on HDInsight Setup 02 Node Settings 04 Database 05 Cluster 01 Basics 03 Domain 06 Infrastructure

Demonstration DEMO on BDM capabilities

DEMO Use case Industry: Entertainment Goal: Leverage Hadoop in Cloud Scenario: GetFlix is a entertainment organization that streams movies and TV shows. GetFlix relies on 3 rd party vendors to provide the ratings of various titles and individual episodes for each TV show s season. GetFlix would like to analyze this data in identifying user interests and recommend new movies/shows Challenges: Access the individual rating data in the cloud Process the data in the cloud and store it back in the cloud

Live DEMO

Summary Customer challenge Solution Informatica Features showcased Access data in the cloud and onpremise Process data in the cloud and on Hadoop Reuse components between onpremise and cloud Quickly spin off BDM environments in Microsoft Azure Rely on hybrid connectivity Rely on industry s leader in cloud & big data Abstract design-time processes from run-time Launch BDM from Azure Marketplace Use PowerExchange for Hybrid connectivity Informatica BDM on cloud Smart Executor Azure HD Insight MarketPlace

Questions??!

User Groups Informatica User Groups are a great way for you to invest in your professional development and learn about new Informatica offerings. Local Chapter Leaders manage each IUG online and via in person meetings Network and Socialize Find and share content, best practices & tips Learn about the latest technologies and solutions from Informatica Discover how colleagues and peers use Informatica https://network.informatica.com/welcome/ LEARN MORE AT IW16 : Go to the Solutions Expo Informatica Pavilion / Ecosystem & Innovation Area: Talk to regional user group leaders Learn about meeting plans Join your regional user group When: Monday 6:00pm 8:30pm Tuesday 10:45am 2:15pm Wednesday 10:30am 1:45pm Where: Moscone West Hall Level One

EMR Cluster

EMR Cluster

EC2 Nodes

Informatica Administrator

Informatica Monitoring

Hadoop Monitoring

Connections

EMR Cluster

Hive on Amazon S3

Mapping s execution on EMR

Hive on Amazon S3