The Future of Data Management Managing Data Beyond Boundaries

Similar documents
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

How To Make Sense Of Data With Altilia

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

SAP Agile Data Preparation

From Lab to Factory: The Big Data Management Workbook

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

Databricks. A Primer

Ganzheitliches Datenmanagement

<no narration for this slide>

April 2016 JPoint Moscow, Russia. How to Apply Big Data Analytics and Machine Learning to Real Time Processing. Kai Wähner.

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Databricks. A Primer

Data Integration Checklist

Integrating a Big Data Platform into Government:

Data Discovery, Analytics, and the Enterprise Data Hub

The Future of Data Management

INFORMATICA WORLD TOUR August 2012

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Multichannel Customer Listening and Social Media Analytics

ABSTRACT: [Type text] Page 2109

Cisco Data Preparation

HOW TO DO A SMART DATA PROJECT

Unleash your intuition

CALIFORNIA MUNICIPAL RATES GROUP SAS VISUAL ANALYTICS APRIL 29, 2016 JENNIFER WHALEY, SR SYSTEMS ENGINEER

locuz.com Big Data Services

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

A BUSINESS INTELLIGENCE PLATFORM

Augmented Search for IT Data Analytics. New frontier in big log data analysis and application intelligence

White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices.

Unified Batch & Stream Processing Platform

The Big Data Revolution: welcome to the Cognitive Era.

Amplify Serviceability and Productivity by integrating machine /sensor data with Data Science

Safe Harbor Statement

Direct-to-Company Feedback Implementations

Social Media Implementations

Business Intelligence Discover a wider perspective for your business

Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

Luncheon Webinar Series May 13, 2013

From Spark to Ignition:

Welcome. Host: Eric Kavanagh. The Briefing Room. Twitter Tag: #briefr

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

COMP9321 Web Application Engineering

SAP Predictive Analytics

BIG DATA & DATA SCIENCE

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

SAP HANA Vora : Gain Contextual Awareness for a Smarter Digital Enterprise

Oracle Big Data Discovery The Visual Face of Hadoop

Digital Business Platform for SAP

Big Data and Analytics: Challenges and Opportunities

I D C T E C H N O L O G Y S P O T L I G H T

Big Data. Fast Forward. Putting data to productive use

Big Data Executive Survey

Deploying Big Data to the Cloud: Roadmap for Success

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Big Data Analytics Roadmap Energy Industry

Oracle Big Data Building A Big Data Management System

A Look at Self Service BI with SAP Lumira Natasha Kishinevsky Dunn Solutions Group SESSION CODE: 1405

#mstrworld. No Data Left behind: 20+ new data sources with new data preparation in MicroStrategy 10

Data Science & Big Data Practice

Informatica and the Vibe Virtual Data Machine

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Big Data 101: Harvest Real Value & Avoid Hollow Hype

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

IBM Cloud Security Draft for Discussion September 12, IBM Corporation

Luis Melo Head of CRM/CX. Capventis. Policy Automation. Knowledge Management. Field Service Management. Web Customer Service

6 Steps to Faster Data Blending Using Your Data Warehouse

End-to-end Big Data Discovery with Platfora Date: May 2015 Author: Mike Leone, ESG Lab Analyst

HDP Hadoop From concept to deployment.

Key Findings Advanced, Predictive Analytics Breaking the Barriers to Adoption

How To Use Big Data For Business

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

How to Leverage Big Data in the Cloud to Gain Competitive Advantage

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Implementation of Big Data and Analytics Projects with Big Data Discovery and BICS March 2015

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

Understanding traffic flow

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

The Enterprise Data Hub and The Modern Information Architecture

Tableau Metadata Model

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Capital Market Day 2015

New Features in Oracle Application Express 4.1. Oracle Application Express Websheets. Oracle Database Cloud Service

Twitter Tag: #briefr 8/14/12

Qlik Sense Enterprise

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Big Data Integration: A Buyer's Guide

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

ORACLE ENTERPRISE DATA QUALITY PRODUCT FAMILY

ANALYTICS IN BIG DATA ERA

1.1.1 Introduction to Cloud Computing

Self-Service Business Intelligence

Augmented Search for Software Testing

Transcription:

The Future of Data Management Managing Data Beyond Boundaries Ron Agresta Director of Product Management - Data Management Product Line SAS #AnalyticsX

The Future of Data Management Managing Data Beyond Boundaries #analyticsx

1969

2006 2016

SAS Global Forum Abstract Term Comparison 2006 Enterprise Guide ODS IntrNet Macros Excel Model ETL Programmer PROC SQL Enterprise Miner 2016 Enterprise Miner Enterprise Guide Analytics Big Data Hadoop DATA Step DS2 Forecasting Amazon Grid #analyticsx

New Directions in Data Management

2006

The Problem with Data Management Today #analyticsx Manually intensive Desktop-based tools ETL-based processes Disenfranchised business users Opaque data provisioning techniques Advanced analytics often the end, not the means

Trends Scalability Hybrid Cloud Deployments Big Data Operational Integration Customers have an increasing number of data domains fueled by data from an increasing number of sources of many different types Customers have a desire to seamlessly combine data and processes operating locally and in the Cloud Customers want to take advantage of investments in and promise of new big data infrastructure #analyticsx Self-Service Customers see a proliferation of internal and external APIs and task-specific applications that beg for an integrated user experience Customers want to empower their own technical and non-technical users with an App-like experience that minimizes IT involvement

Challenges Extracting Value from Data #analyticsx Prioritizing data to generate insights 65% Integrating multiple sources of data 62% Determing the right data sets to collect and analyze 56% Managing data in a privacy-compliant environment 35% 0% 10% 20% 30% 40% 50% 60% 70% Source: Forbes Magazine, May 2016

#analyticsx Business Users Non-technical users looking for an intuitive, app-like experience that decreases the time it takes to go from data acquisition to answers. Coders, DBAs, IT Technical experts and traditional data owners who may be reluctant to give up control of data cleansing, ingestion, and provisioning responsibilities. Data Scientists Domain experts with deep knowledge looking for an environment where it s easy to onboard and transform data to foster rapid experimentation.

Managed Data Data Management Activities Monitored Data Governed Data ETL Developer or Coder Team/Department Repeated Use IT Staff Cross-Team Measured Use Data Stewards Cross-Department Administered Use Governance Coding, Job Building, Tuning, & Scheduling in-sas in-memory in-database

Data Management in Action Access Profile Clean Transform Load Schedule Monitor Share

Fad or Fundamental Changes? 1968 1970 #analyticsx 1969 1971

The New Normal Interactive data preparation Intelligent ingestion In-stream data management #analyticsx Cognitive data management Runtime auto-deployment and optimization Big data quality and governance Data management anywhere (on any data)

New Data Management in Action Discover Ingest Profile Categorize Analyze Suggest Interactively Transform Optimize Deploy Operationalize Monitor

Discover & Ingest Yesterday Relational databases Text files ERP systems Excel files XML Today Web services Twitter LinkedIn YouTube Data streams Cloud platforms JSON Hadoop HDFS Spark Machine data Doc management systems #analyticsx

#analyticsx Profile & Categorize Automatically generated profiling metrics let users quickly find problems in their data with little manual investigation. Combined data profiling and advanced analytics can lead to semantic tagging where the meaning of data can be inferred. Tagged data then leads to any number of automated actions.

#analyticsx Analyze & Suggest Software-delivered suggestions (Apple Siri, Google Now, Microsoft Cortana, Netflix, etc.) are now the norm. Users intuitively know that data available behind the scenes can greatly reduce manual investigation of problems, and once analyzed, can lead to insightful recommendations. These techniques: Inferring data meaning by tags Comparing data to other data Analyzing usage patterns Evaluating data decay and fetching new data automatically Tracking data provenance Can lead to: Automatic standardization Smart data parsing and merging Data invalidation alerts Real-time data ranking Security policy violation detection Advanced pattern discovery

Data Preparation #analyticsx Actions necessary to transform data into the appropriate structure and format for operational, reporting, business intelligence, or analytics use.

Interactive Transformation #analyticsx Interactive data preparation lets users explore, manipulate and combine data. Transformation can occur in real-time. Suggestions can be reviewed and then confirmed, modified, or rejected. A set of instructions that can be saved and delivered to the appropriate execution environment is generated as a result of user-driven data transformations. Clean, match, standardize, split, combine, filter, join, merge, hide, arrange, format, sort, append, augment, enrich

Big (and Little) Data on Big Servers #analyticsx Powerful commodity-based computing has changed some of the underlying assumptions about how to empower both business users and data experts working in the data preparation area. It is expected that complex transformations on millions of rows will be completed in seconds, not minutes or hours.

Optimize & Deploy Jobs or code generated from data management activities can be: #analyticsx Analyzed to determine an appropriate runtime environment Optimized for the target runtime environment Automatically deployed and monitored with processing metrics routed back to machine learning algorithms for improved results Code Generation, Optimization, Process Distribution, & Monitoring in-sas in-hadoop in-memory in-database in-cloud in-stream

Data Management Anywhere Interest is growing for data management that spans deployment locations yet provides integrated movement, transformation, monitoring, and governance capabilities. #analyticsx Local Data Local Processing On-Premises Only Remote Data Remote Processing On-Premises + Cloud Data On-Premises + Cloud Platform Cloud Only

The Promise of the Cloud Opportunities for integration with new and always current data sources Storage of large data volumes can be cheaper, especially for less frequently accessed data A way to test-drive new technologies Elastic infrastructure Platform for distributed teams and distributed applications #analyticsx Rapid on-boarding of cloud data sources through packaged connectors Little to no IT involvement Guaranteed up-time Self-service enablement Cloud providers may offer "value added" services

#analyticsx A Few Words about Data Governance This data free-for-all introduces new challenges for the enterprise that only a comprehensive data management environment with integrated data governance can address. TRUST

New Users Report builders Data scientists Data analysts Compliance staff Managed Data ETL Developer or Coder Team/Department Repeated Use Data Management Activities Monitored Data IT Staff Cross-Team Measured Use Governed Data Data Stewards Cross-Department Administered Use Governance Coding, Job Building, Tuning, & Scheduling New Processes Data discovery Different runtimes Alternate data locations Machine learning in-sas in-memory in-database

EXAMPLES: Interactive data quality & rule capture Ad Hoc Data A New Data Management Paradigm Data quality rule deployment Managed Data Data quality process monitoring Monitored Data Policy rules for data quality Governed Data Auto-profiling and data quality rule discovery Activated Data Data Scientist or Analyst ETL Developer or Coder IT Staff Data Stewards Automated Individual Self-Service Team/Department Repeated Use Cross-Team Measured Use Cross-Department Administered Use Cross-Organization Always-On Governance Code Generation, Optimization, Process Distribution, & Monitoring in-sas in-hadoop in-memory in-database in-cloud in-stream

Projects with collaboration built-in DESIGN PROTOTYPE

Import data on the fly DESIGN PROTOTYPE

Discovery, profiling, and categorization DESIGN PROTOTYPE

Data Preparation Data preparation with deep data quality DESIGN PROTOTYPE

Monitoring Process and data monitoring combined DESIGN PROTOTYPE

Data Lineage Comprehensive lineage and auditing DESIGN PROTOTYPE

Impact Analy sis Impact analysis from data sources to targets DESIGN PROTOTYPE

Data Management Evolves #analyticsx Self-service data management is here but it doesn t replace the need for managed, monitored, and governed data Cognitive data management is on the rise where deep data analysis uncovers hidden patterns and relationships that enable suggestions and data process automation

Data Management Evolves #analyticsx Technologies such as Hadoop, data streaming, inmemory processing and hybrid Cloud deployments require a mix of standard and novel approaches Democratizing data management is great as long as users are protected from actions that might lead to incorrect results or security risks

2016

#AnalyticsX