Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer"

Transcription

1 Automated Data Ingestion Bernhard Disselhoff Enterprise Sales Engineer

2 Agenda Pentaho Overview Templated dynamic ETL workflows Pentaho Data Integration (PDI) Use Cases

3 Pentaho Overview

4 Overview What we will address today Automated self-service solutions Templated dynamic ETL workflows Manage the data pipeline at enterprise scale

5 Pentaho Product Components ETL, Job Orchestration & Big Data Pentaho Data Integration (PDI) Data Science Weka & R Data Modeling Pentaho Metadata & Mondrian Data Discovery Pentaho Analyzer Operational Reports Pentaho Report Designer & Interactive Reports Dashboards Pentaho Dashboard Designer & CTools Pentaho provides a complete platform for end-to-end data and analytics solutions

6 Templated Dynamic ETL Workflows ETL Metadata Injection

7 Traditional ETL Hardcoded Metadata Metadata details (fields, datatypes, etc.) are required for various steps within a transformation: sources, targets, and/or transformation steps. Extract Source Step 1 Transform Step 2 Step 3 Load Target Legacy ETL tools require you to hardcode the metadata at development time. Metadata

8 Dynamic ETL ETL Metadata Injection lets you inject the metadata into a template at runtime. Extract Source Step 1 Template Transform Step 2 Step 3 Load Target Metadata (blank)

9 Use Case 1 Scalability / Reuse Same workflow, many different files/tables, etc. Maintain metadata in a list/table and reuse a single workflow template. Extract Source Step 1 Template Transform Step 2 Step 3 Load Target Example: migrate 1,500 tables Metadata (blank)

10 Use Case 2 Self-service Allow user/customer to enter metadata in a simple web form Extract Template Transform Load Example: select fields for a template to pull data from Hadoop and build an on-demand data mart Source Metadata (blank) Step 1 Step 2 Step 3 Target

11 Use Case 3 Auto-Discovery Parse out metadata dynamically at runtime. Example: Dynamically parse messages of varying formats Extract Source Step 1 Template Transform Step 2 Step 3 Load Target Metadata (blank)

12 DRY Principle Don t Repeat Yourself Use a Templated Approach Use Cases Scalability: simplified data onboarding & management Large & Gas Co. Auto-Discovery: dynamic parsing of log files for cybersecurity Self-service: customer on-boarding Scalability: large data migration Major Professional Services Firm

13 Pentaho Data Integration (PDI)

14 Big Data challenges Pentaho addresses Network NoSQL Big Data Challenges Location Web Hadoop Cluster BI Tools EDW is too rigid, too slow, too expensive, for big data Steep learning curves, scarcity of talent Social Media Semi/un/structured data parsing, extraction, processing, data quality Blending big data with traditional data for 360 view Customer Data Access, Governance Provisioning Real time EDW Data Marts Data Science Billing

15 Connectivity The broadest data connectivity and a robust data integration engine Relational Big Data Applications Much More

16 Integrate ALL Data in an Intuitive Way A user-friendly graphical interface to build complete data pipelines 100+ Transformation Steps Drag & drop Development 100% GUI-based Configuration Model, Analyze, and Visualize as you go

17 Concept Data Transformations INPUT(S) PROCESS(ES) OUTPUT(S)

18 Concept Jobs (orchestrate) START CHECK WATCH EXECUTE NOTIFY - FINISH

19 Job Orchestration Toolkit

20 Integrate ALL Data in an Intuitive Way Apply familiar ETL techniques to new data and technologies Data Profiling and Data Quality Validate Cleanse De-duplicate Filter Transform Sort, Aggregate and Group Normalize & De-normalize Calculate, rank and score In-flight encryption & compression Data Blending Join disparate data sources Data caching Output to multiple targets Control Structures Split & re-join data streams Dynamic variables with multiple scoping levels Define serial & parallel execution workflows Unlimited levels of job nesting

21 Go Beyond Standard ETL Operations Flexible capabilities to provide data services and deliver analytics Data Virtualization & Application Integration PDI JDBC & Web Services Extract Transform Report Automate Data Science Open Architecture Pluggable architecture Active community eco-system and marketplace Create your own data connectors and transformation steps Call your own code: Java, JavaScript, Shell scripts, & SQL Stored Procedures

22 Manage the Data Pipeline at Enterprise Scale Architected for Scalable Performance Scale Up PDI Clustering Visual Map/Reduce YARN

23 Manage the Data Pipeline at Enterprise Scale Enterprise-Grade Control and Security Job Orchestration Check resource availability, watch for file, etc. Execute transformations, nested jobs, shell scripts Logging, error handling, and Notifications Administration Enterprise scheduler Real-time performance monitoring Restart jobs at checkpoints on failure Bundled operations mart & reports to audit usage and access Security Active Directory / LDAP integration Access controls Version control

24 PDI Infrastructure Components Loosely Coupled Components Pentaho Data Integration (PDI) Server J2EE (Tomcat, JBoss) Data Source Data Target Data Virtualization PDI JDBC Interface Application Integration via HTTP/S Transformation returns JSON, XML, Text, etc. Web Service Call Enterprise Scheduler Initiates ETL via CLI SSH CLI Publish Repository Database ETL Versioning Security DB Connections PDI Cluster Configuration Partitioning Schemes Logging Scheduling Oracle, SQL Server, PostgreSQL, and MySQL are supported Developer Workstation ETL Development Monitoring Administration Scheduling Local execution Mac, Windows, Linux supported

25 Summary

26 Putting it all together Data Data Engineering Data Preparation Analytics Managing and Automating the Pipeline Administration Security Lifecycle Management Data Provenance Dynamic Data Pipeline Monitoring Automation

27 Pentaho & Hitachi Solutions Social Innovation, IoT, Smart Cities, & Vertical Solutions Turnkey BI and Big Data Solutions Embedded Solutions for Enterprise Data Governance and SaaS CLOUD Big Data since 2009 Traditional DI & BI Since 2004 SSO & Java Spring DB per Group/Tenant Row-level Multi-tenancy Object Multi-tenancy UI Multi-tenancy Scale-out Architecture Unified Compute Platform (UCP) Hyper-Scale-out Platform (HSP)

28 Summary What we addressed today Automated self-service solutions Templated dynamic ETL workflows Manage the data pipeline at enterprise scale

29 Thank You! Besuchen Sie uns am Hitachi Demopunkt im Foyer

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent

More information

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate

More information

Data Integration Checklist

Data Integration Checklist The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media

More information

Contents. Pentaho Corporation. Version 5.1. Copyright Page. New Features in Pentaho Data Integration 5.1. PDI Version 5.1 Minor Functionality Changes

Contents. Pentaho Corporation. Version 5.1. Copyright Page. New Features in Pentaho Data Integration 5.1. PDI Version 5.1 Minor Functionality Changes Contents Pentaho Corporation Version 5.1 Copyright Page New Features in Pentaho Data Integration 5.1 PDI Version 5.1 Minor Functionality Changes Legal Notices https://help.pentaho.com/template:pentaho/controls/pdftocfooter

More information

Unified Batch & Stream Processing Platform

Unified Batch & Stream Processing Platform Unified Batch & Stream Processing Platform Himanshu Bari Director Product Management Most Big Data Use Cases Are About Improving/Re-write EXISTING solutions To KNOWN problems Current Solutions Were Built

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics platform. PENTAHO PERFORMANCE ENGINEERING

More information

SELF-SERVICE DATA LAKES ON HADOOP

SELF-SERVICE DATA LAKES ON HADOOP SELF-SERVICE DATA LAKES ON HADOOP Introduction A recent Gartner survey on Hadoop cited the two biggest challenges in working with Hadoop: Skills gaps continue to be a major adoption inhibitor for 57% of

More information

Jitterbit Technical Overview : Microsoft Dynamics CRM

Jitterbit Technical Overview : Microsoft Dynamics CRM Jitterbit allows you to easily integrate Microsoft Dynamics CRM with any cloud, mobile or on premise application. Jitterbit s intuitive Studio delivers the easiest way of designing and running modern integrations

More information

XpoLog Competitive Comparison Sheet

XpoLog Competitive Comparison Sheet XpoLog Competitive Comparison Sheet New frontier in big log data analysis and application intelligence Technical white paper May 2015 XpoLog, a data analysis and management platform for applications' IT

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION Make Big Available for Everyone Syed Rasheed Solution Marketing Manager January 29 th, 2014 Agenda Demystifying Big Challenges Getting Bigger Red Hat Big

More information

Ernesto Ongaro BI Consultant February 19, 2013. The 5 Levels of Embedded BI

Ernesto Ongaro BI Consultant February 19, 2013. The 5 Levels of Embedded BI Ernesto Ongaro BI Consultant February 19, 2013 The 5 Levels of Embedded BI Saleforce.com CRM 2013 Jaspersoft Corporation. 2 Blogger 2013 Jaspersoft Corporation. 3 Linked In 2013 Jaspersoft Corporation.

More information

XpoLog Center Log Management Solution For ANY type of Network system, Security devices, Business applications

XpoLog Center Log Management Solution For ANY type of Network system, Security devices, Business applications XpoLog Center Log Management Solution For ANY type of Network system, Security devices, Business applications XpoLog Center is an Enterprise Log Analysis and Management Solution Analyst "Most enterprises

More information

Jitterbit Technical Overview : Salesforce

Jitterbit Technical Overview : Salesforce Jitterbit allows you to easily integrate Salesforce with any cloud, mobile or on premise application. Jitterbit s intuitive Studio delivers the easiest way of designing and running modern integrations

More information

IBM Websphere Application Server as a Service

IBM Websphere Application Server as a Service Government Efficiency through Innovative Reform IBM Websphere Application Server as a Service Service Definition Copyright IBM Corporation 2014 Table of Contents IBM Cloud Overview... 2 IBM/Sentinel PaaS...

More information

Cisco IT Hadoop Journey

Cisco IT Hadoop Journey Cisco IT Hadoop Journey Srini Desikan, Program Manager IT 2015 MapR Technologies 1 Agenda Hadoop Platform Timeline Key Decisions / Lessons Learnt Data Lake Hadoop s place in IT Data Platforms Use Cases

More information

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros David Moses January 2014 Paper on Cloud Computing I Background on Tools and Technologies in Amazon Web Services (AWS) In this paper I will highlight the technologies from the AWS cloud which enable you

More information

What's New in SAS Data Management

What's New in SAS Data Management Paper SAS034-2014 What's New in SAS Data Management Nancy Rausch, SAS Institute Inc., Cary, NC; Mike Frost, SAS Institute Inc., Cary, NC, Mike Ames, SAS Institute Inc., Cary ABSTRACT The latest releases

More information

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Simplifying Big Data Analytics: Unifying Batch and Stream Processing John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Streaming Analy.cs S S S Scale- up Database Data And Compute Grid

More information

White Paper. Unified Data Integration Across Big Data Platforms

White Paper. Unified Data Integration Across Big Data Platforms White Paper Unified Data Integration Across Big Data Platforms Contents Business Problem... 2 Unified Big Data Integration... 3 Diyotta Solution Overview... 4 Data Warehouse Project Implementation using

More information

Unified Data Integration Across Big Data Platforms

Unified Data Integration Across Big Data Platforms Unified Data Integration Across Big Data Platforms Contents Business Problem... 2 Unified Big Data Integration... 3 Diyotta Solution Overview... 4 Data Warehouse Project Implementation using ELT... 6 Diyotta

More information

Jitterbit Technical Overview : Microsoft Dynamics AX

Jitterbit Technical Overview : Microsoft Dynamics AX Jitterbit allows you to easily integrate Microsoft Dynamics AX with any cloud, mobile or on premise application. Jitterbit s intuitive Studio delivers the easiest way of designing and running modern integrations

More information

brought to you by WebAction June 2016

brought to you by WebAction June 2016 brought to you by WebAction & June 2016 Executive Summary At Nugravity, our corporate vision is to make every customer s business more successful day by day through technology. We have been successfully

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and

More information

IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look

IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look IBM BigInsights Has Potential If It Lives Up To Its Promise By Prakash Sukumar, Principal Consultant at iolap, Inc. IBM released Hadoop-based InfoSphere BigInsights in May 2013. There are already Hadoop-based

More information

Actuate Content Services Process Designer

Actuate Content Services Process Designer Actuate Content Services Process Designer Process Designer Customer Communications Management solutions reduce the costs associated with traditional printing while facilitating better operational performance

More information

Big Data on Tap Jonathan Gray

Big Data on Tap Jonathan Gray Unified Integration for Data-Driven Applications Big Data on Tap Jonathan Gray Founder & CEO November 7, 2016 Hadoop Enables New Applications and Architectures ENTERPRISE DATA LAKES BIG DATA ANALYTICS

More information

RRF Reply Reporting Framework

RRF Reply Reporting Framework RRF Reply Reporting Framework Introduction The increase in the services provided in the telco market requires to carry out short and long-term analyses aimed at monitoring the use of resources and timely

More information

Pentaho Reporting Overview

Pentaho Reporting Overview Pentaho Reporting Copyright 2006 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

2014 Astera Software. Convergence of Data and Application Integration

2014 Astera Software. Convergence of Data and Application Integration Convergence of Data and Application Integration Headquartered in Simi Valley, CA Vision: to deliver intuitive, fast, and seamless data integration across a broad spectrum of platforms Mission: to be a

More information

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Data Governance in the Hadoop Data Lake. Michael Lang May 2015 Data Governance in the Hadoop Data Lake Michael Lang May 2015 Introduction Product Manager for Teradata Loom Joined Teradata as part of acquisition of Revelytix, original developer of Loom VP of Sales

More information

AngularJS, Javascript, Prototype-based OO concept, RESTful Design Pattern, GWT, HTML5, Database.

AngularJS, Javascript, Prototype-based OO concept, RESTful Design Pattern, GWT, HTML5, Database. Position Code Role Skillsets Years of Experience TPM01 Senior Back End Application Developer Enterprise Architecture, Java EE, Java Spring Framework, Design Pattern, SOA, JPA, OSGI, Linux, Maven, Tomcat,

More information

IP Expo 2014 Pentaho Big Data Analytics Accelerating the time to big data value London, UK

IP Expo 2014 Pentaho Big Data Analytics Accelerating the time to big data value London, UK IP Expo 2014 Pentaho Big Data Analytics Accelerating the time to big data value London, UK Zaf Khan PreSales Manager, EMEA 1 Blending Cloudera and Pentaho Evolving big data architectures Network Location

More information

Cloud Ready Data: Speeding Your Journey to the Cloud

Cloud Ready Data: Speeding Your Journey to the Cloud Cloud Ready Data: Speeding Your Journey to the Cloud Hybrid Cloud first Born to the cloud 3 Am I part of a Cloud First organization? Am I part of a Cloud First agency? The cloud applications questions

More information

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here Data Virtualization for Agile Business Intelligence Systems and Virtual MDM To View This Presentation as a Video Click Here Agenda Data Virtualization New Capabilities New Challenges in Data Integration

More information

The Inside Scoop on Hadoop

The Inside Scoop on Hadoop The Inside Scoop on Hadoop Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM The Inside Scoop

More information

Pentaho BI Capability Profile

Pentaho BI Capability Profile Pentaho BI Capability Profile InfoAxon s Pentaho BI Integration Capabilities InfoAxon s Pentaho BI Integration Capabilities Challenge Organizations are under continuous pressure to improve their business

More information

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco Decoding the Big Data Deluge a Virtual Approach Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco High-volume, velocity and variety information assets that demand

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

Improve your IT Analytics Capabilities through Mainframe Consolidation and Simplification

Improve your IT Analytics Capabilities through Mainframe Consolidation and Simplification Improve your IT Analytics Capabilities through Mainframe Consolidation and Simplification Ros Schulman Hitachi Data Systems John Harker Hitachi Data Systems Insert Custom Session QR if Desired. Improve

More information

Hadoop & Spark Using Amazon EMR

Hadoop & Spark Using Amazon EMR Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

Pentaho Data Integration 4 and MySQL. Matt Casters: Pentaho's Chief Data Integration Kettle Project Founder

Pentaho Data Integration 4 and MySQL. Matt Casters: Pentaho's Chief Data Integration Kettle Project Founder Pentaho Data Integration 4 and MySQL Matt Casters: Pentaho's Chief Data Integration Kettle Project Founder MySQL User Conference, Tuesday April 13th, 2010 Agenda Pentaho: an introduction Pentaho Data Integration

More information

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Roadmap Talend : découvrez les futures fonctionnalités de Talend Roadmap Talend : découvrez les futures fonctionnalités de Talend Cédric Carbone Talend Connect 9 octobre 2014 Talend 2014 1 Connecting the Data-Driven Enterprise Talend 2014 2 Agenda Agenda Why a Unified

More information

Open Source Business Intelligence Intro

Open Source Business Intelligence Intro Open Source Business Intelligence Intro Stefano Scamuzzo Senior Technical Manager Architecture & Consulting Research & Innovation Division Engineering Ingegneria Informatica The Open Source Question In

More information

Under the Hood: Cloud Analytics. CLD105 May 25 th Wednesday

Under the Hood: Cloud Analytics. CLD105 May 25 th Wednesday Under the Hood: Cloud Analytics CLD105 May 25 th Wednesday Session Agenda Introduction RedHat Informatica Demo Q&A Today s Presenters Ian Firman Abhisek Sinha Analytics Architect, Red Hat Inc. Principal

More information

Oracle Reference Architecture and Oracle Cloud

Oracle Reference Architecture and Oracle Cloud Oracle Reference Architecture and Oracle Cloud Anbu Krishnaswamy Anbarasu Enterprise Architect Social. Mobile. Complete. Global Enterprise Architecture Program Safe Harbor Statement The following is intended

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

Big Data at Cloud Scale

Big Data at Cloud Scale Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For

More information

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014 Cloud First Does Not Have to Mean Cloud Exclusively Digital Government Institute s Cloud Computing & Data Center Conference, September 2014 Am I part of a cloud first organization? Am I part of a cloud

More information

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform Communiqué 4 Standardized Communiqué 4 - fully implementing the JCR (JSR 170) Content Repository Standard, managing digital business information, applications and processes through the web. Communiqué

More information

Lofan Abrams Data Services for Big Data Session # 2987

Lofan Abrams Data Services for Big Data Session # 2987 Lofan Abrams Data Services for Big Data Session # 2987 Big Data Are you ready for blast-off? Big Data, for better or worse: 90% of world s data generated over last two years. ScienceDaily, ScienceDaily

More information

Copyright 2014, Oracle and/or its affiliates. All rights reserved.

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 1 Oracle Business Intelligence in the Cloud Gherardo Infunti Business Development Director EMEA Business Analytics 2 Disclaimer THE FOLLOWING IS INTENDED TO OUTLINE OUR GENERAL PRODUCT DIRECTION. IT IS

More information

Automate Your BI Administration to Save Millions with Command Manager and System Manager

Automate Your BI Administration to Save Millions with Command Manager and System Manager Automate Your BI Administration to Save Millions with Command Manager and System Manager Presented by: Dennis Liao Sr. Sales Engineer Date: 27 th January, 2015 Session 2 This Session is Part of MicroStrategy

More information

G Cloud 6 CDG Service Definition for Forgerock Software Services

G Cloud 6 CDG Service Definition for Forgerock Software Services G Cloud 6 CDG Service Definition for Forgerock Software Services Author: CDG Date: October 2015 Table of Contents Table of Contents 2 1.0 Service Definition 3 1.0 Service Definition Forgerock as a Platform

More information

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence

More information

A BUSINESS INTELLIGENCE PLATFORM

A BUSINESS INTELLIGENCE PLATFORM A BUSINESS INTELLIGENCE PLATFORM Transforming Data to Actionable Intelligence Rapid technology enablement by organizations has led to significant increase in the quantum of data generated by businesses.

More information

GeoKettle: A powerful open source spatial ETL tool

GeoKettle: A powerful open source spatial ETL tool GeoKettle: A powerful open source spatial ETL tool FOSS4G 2010 Dr. Thierry Badard, CTO Spatialytics inc. Quebec, Canada tbadard@spatialytics.com Barcelona, Spain Sept 9th, 2010 What is GeoKettle? It is

More information

www.ducenit.com Analance Data Integration Technical Whitepaper

www.ducenit.com Analance Data Integration Technical Whitepaper Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring

More information

CA Process Automation

CA Process Automation PRODUCT SHEET: CA Process Automation we can CA Process Automation CA Process Automation enables enterprise organizations to design, deploy and administer automation of manual, resource-intensive and often

More information

Hitachi Data Center Analytics

Hitachi Data Center Analytics Hitachi Data Center Analytics Agenda Storage analytics challenges Introducing Hitachi Data Center Analytics Storage analytics use cases and solutions Q&A Storage Analytics Challenges Storage Pain Points

More information

Assignment # 1 (Cloud Computing Security)

Assignment # 1 (Cloud Computing Security) Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual

More information

GoodData. Platform Overview

GoodData. Platform Overview GoodData Platform Overview GoodData Platform: 2 3 The GoodData Platform GoodData Platform GoodData has helped more than users make sense of their data with advanced business analytics. It s open Thanks

More information

Securing your business

Securing your business Securing your business Anders Askåsen Product Manager for OpenIDM * World Wide Coverage ForgeRock.com Enterprise Open Source Software ForgeRock Norway ForgeRock USA ForgeRock UK ForgeRock France Consulting

More information

Client Overview. Engagement Situation. Key Requirements for Platform Development :

Client Overview. Engagement Situation. Key Requirements for Platform Development : Client Overview Our client is the leading provider of software solutions for providing online and easy access to health insurance. Our client offers these services to a range of consumers from employees

More information

Integrating Ingres in the Information System: An Open Source Approach

Integrating Ingres in the Information System: An Open Source Approach Integrating Ingres in the Information System: WHITE PAPER Table of Contents Ingres, a Business Open Source Database that needs Integration... 3 Scenario 1: Data Migration... 4 Scenario 2: e-business Application

More information

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.

More information

Real Time Big Data Processing

Real Time Big Data Processing Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure

More information

SQL Server 2014. What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

SQL Server 2014. What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft. SQL Server 2014 What s New? Christopher Speer Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.com The evolution of the Microsoft data platform What s New

More information

OWB Users, Enter The New ODI World

OWB Users, Enter The New ODI World OWB Users, Enter The New ODI World Kulvinder Hari Oracle Introduction Oracle Data Integrator (ODI) is a best-of-breed data integration platform focused on fast bulk data movement and handling complex data

More information

SAP HANA SPS 09 - What s New? HANA IM Services: SDI and SDQ

SAP HANA SPS 09 - What s New? HANA IM Services: SDI and SDQ SAP HANA SPS 09 - What s New? HANA IM Services: SDI and SDQ (Delta from SPS 08 to SPS 09) SAP HANA Product Management November, 2014 2014 SAP SE or an SAP affiliate company. All rights reserved. 1 Agenda

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically

More information

www.sryas.com Analance Data Integration Technical Whitepaper

www.sryas.com Analance Data Integration Technical Whitepaper Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring

More information

ACEYUS REPORTING. Aceyus Intelligence Executive Summary

ACEYUS REPORTING. Aceyus Intelligence Executive Summary ACEYUS REPORTING Aceyus Intelligence Executive Summary Aceyus, Inc. June 2015 1 ACEYUS REPORTING ACEYUS INTELLIGENCE EXECUTIVE SUMMARY Aceyus Intelligence is a suite of products for optimizing contact

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition 12c delivers high-performance data movement and transformation among enterprise platforms with its open and integrated

More information

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney Introduction to Hadoop New York Oracle User Group Vikas Sawhney GENERAL AGENDA Driving Factors behind BIG-DATA NOSQL Database 2014 Database Landscape Hadoop Architecture Map/Reduce Hadoop Eco-system Hadoop

More information

Cloud Computing. With MySQL and Pentaho Data Integration. Matt Casters Chief Data Integration at Pentaho Kettle project founder

Cloud Computing. With MySQL and Pentaho Data Integration. Matt Casters Chief Data Integration at Pentaho Kettle project founder Cloud Computing With MySQL and Pentaho Data Integration Matt Casters Chief Data Integration at Pentaho Kettle project founder 1-2 Agenda Introduction to Kettle Introduction Use-cases + load demo Performance

More information

Upcoming Announcements

Upcoming Announcements Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC jmarkham@hortonworks.com Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within

More information

Cisco Integration Platform

Cisco Integration Platform Data Sheet Cisco Integration Platform The Cisco Integration Platform fuels new business agility and innovation by linking data and services from any application - inside the enterprise and out. Product

More information

IAN MASSINGHAM. Technical Evangelist Amazon Web Services

IAN MASSINGHAM. Technical Evangelist Amazon Web Services IAN MASSINGHAM Technical Evangelist Amazon Web Services From 2014: Cloud computing has become the new normal Deploying new applications to the cloud by default Migrating existing applications as quickly

More information

WHITE PAPER. Domo Advanced Architecture

WHITE PAPER. Domo Advanced Architecture WHITE PAPER Domo Advanced Architecture Overview There are several questions that any architect or technology advisor may ask about a new system during the evaluation process: How will it fit into our organization

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Big Business, Big Data, Industrialized Workload

Big Business, Big Data, Industrialized Workload Big Business, Big Data, Industrialized Workload Big Data Big Data 4 Billion 600TB London - NYC 1 Billion by 2020 100 Million Giga Bytes Copyright 3/20/2014 BMC Software, Inc 2 Copyright 3/20/2014 BMC Software,

More information

PPT Templates Refresh

PPT Templates Refresh GROUP ENTERPRISE PPT Templates Refresh A practical approach to Cloud adoption Tan Teck Sun, Snr Consultant Name 13 November 2013 date Disclaimer: This material that follows is a presentation of general

More information

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs Database Systems Journal vol. III, no. 1/2012 41 SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs 1 Silvia BOLOHAN, 2

More information

Implement Hadoop jobs to extract business value from large and varied data sets

Implement Hadoop jobs to extract business value from large and varied data sets Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to

More information

Eliminating Complexity to Ensure Fastest Time to Big Data Value

Eliminating Complexity to Ensure Fastest Time to Big Data Value Eliminating Complexity to Ensure Fastest Time to Big Data Value Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest

More information

Oracle Database 11g Comparison Chart

Oracle Database 11g Comparison Chart Key Feature Summary Express 10g Standard One Standard Enterprise Maximum 1 CPU 2 Sockets 4 Sockets No Limit RAM 1GB OS Max OS Max OS Max Database Size 4GB No Limit No Limit No Limit Windows Linux Unix

More information

Framework Adoption for Java Enterprise Application Development

Framework Adoption for Java Enterprise Application Development Framework Adoption for Java Enterprise Application Development Clarence Ho Independent Consultant, Author, Java EE Architect http://www.skywidesoft.com clarence@skywidesoft.com Presentation can be downloaded

More information

A Tipping Point for Automation in the Data Warehouse. www.stonebranch.com

A Tipping Point for Automation in the Data Warehouse. www.stonebranch.com A Tipping Point for Automation in the Data Warehouse www.stonebranch.com Resolving the ETL Automation Problem The pressure on ETL Architects and Developers to utilize automation in the design and management

More information

College of Engineering, Technology, and Computer Science

College of Engineering, Technology, and Computer Science College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology

More information

Mike Boyarski Jaspersoft Product Marketing mboyarski@jaspersoft.com. Business Intelligence in the Cloud

Mike Boyarski Jaspersoft Product Marketing mboyarski@jaspersoft.com. Business Intelligence in the Cloud Mike Boyarski Jaspersoft Product Marketing mboyarski@jaspersoft.com Business Intelligence in the Cloud Agenda Introductions Cloud BI Jaspersoft Open Source Powers the Cloud Jaspersoft Cloud BI Futures

More information

Architecting for the Internet of Things & Big Data

Architecting for the Internet of Things & Big Data Architecting for the Internet of Things & Big Data Robert Stackowiak, Oracle North America, VP Information Architecture & Big Data September 29, 2014 Safe Harbor Statement The following is intended to

More information

Oracle Business Activity Monitoring 11g New Features

Oracle Business Activity Monitoring 11g New Features Oracle Business Activity Monitoring 11g New Features Gert Schüßler Principal Sales Consultant Oracle Deutschland GmbH Agenda Overview Architecture Enterprise Integration Framework

More information

Search and Real-Time Analytics on Big Data

Search and Real-Time Analytics on Big Data Search and Real-Time Analytics on Big Data Sewook Wee, Ryan Tabora, Jason Rutherglen Accenture & Think Big Analytics Strata New York October, 2012 Big Data: data becomes your core asset. It realizes its

More information

HDP Hadoop From concept to deployment.

HDP Hadoop From concept to deployment. HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some

More information

Organisaties groot en klein, beginnen zich meer en meer te realiseren dat inzicht in (real-time) data helpt

Organisaties groot en klein, beginnen zich meer en meer te realiseren dat inzicht in (real-time) data helpt Data Virtualization, power to innovate with Agile data Drs. Patrick Berkhout, Enterprise en Software Architect, Trivento Organisaties groot en klein, beginnen zich meer en meer te realiseren dat inzicht

More information