Metadata Management as part of Data Governance and Data Stewardship at ebay, Inc. Mark Uksusman Sr. Manager, Enterprise Data Architecture, ebay, Inc.

Size: px
Start display at page:

Download "Metadata Management as part of Data Governance and Data Stewardship at ebay, Inc. Mark Uksusman Sr. Manager, Enterprise Data Architecture, ebay, Inc."

Transcription

1 Metadata Management as part of Data Governance and Data Stewardship at ebay, Inc. Mark Uksusman Sr. Manager, Enterprise Data Architecture, ebay, Inc.

2 50,000 Categories of Products on ebay.com

3 Scale at ebay $60 billion per year in goods are sold on ebay >100k data elements >50 TB/day new data >100 Trillion pairs of informagon >100 PB/day >50k chains of logic 200 million items listed for sale AcGve/AcGve x7x Always online >7500 business users & analysts Big Data turning over a TB every second Millions of queries/day % Availability Near- Real- Gme

4 We manage more than 100 billion database calls per day 5 petabytes of live OLTP site data 500 Oracle based servers 80 petabytes of space available for analygcal data Teradata Hadoop With developers changing live schema every day! 4

5 ebay Data Warehouse 500+ concurrent users Analyze & Report OperaGonal AnalyGcs TransacGonal AnalyGcs High volume ad hoc queries 100+ concurrent users Discover & Explore Compare User AcGvity against last year Trending and Forecast Analysis (large history) >5 concurrent users Image FingerprinGng Image ClassificaGon Pabern RecogniGon Production Data Warehousing Large Concurrent User-base Enterprise- class System Structural Data Contextual-Complex Analytics Deep, Seasonal, Consumable Data Sets Low End Enterprise- class System Semi- Structural Data Structure the Unstructured Detect Patterns Java /C Developer System Teradata EDW/ODW 8+PB Teradata / Singularity 44+ PB Hadoop 50+ PB 450 Data Models; 7,000 tables and 100,000 columns

6 Data Warehouse SoluRon PorSolio It is not a Silver Bullet for a DW soluron Workload Management IO Concurrency CPU Storage EDW Singularity Hadoop Flexibility Governance

7

8 Virtual Data Mart Request

9 Why Metadata Management? Before: MulRple, Unrelated Metadata Now: One, Integrated Global Directory

10 Why Data Governance? The organizaron without data governance

11 Why Data Stewardship and Global Directory? Data TransformaGons from system to system Data and System Availability Data DefiniGon problems Business requests, Self Service AnalyGcs Source data (SOR) problems Lack Lack of of accountability + Failure to communicate Failure to communicate + Availability problems Security Data ClassificaGons Data Surprises! Lack of trust in the data = (and results)

12 Data Governance and Data Management Data Management Data Architecture Business Domain Modeling Physical Data Modeling Logical Data Modeling Database Design Data Governance and Stewardship Metadata Management Data Governance Enterprise InformaRon IntegraRon Data lineage

13 Data Architecture and Data Governance Framework Data Governance commicee Compliance Monitoring and Enforcement ExecuRon and Decision- Making Leadership Data PlaSorm and MDM Policy DefiniRon Planning and CoordinaRon Data Architecture, Data Quality and Stewardship Data Model / Metadata SOR / Data Topology / Data Flow Data Quality Assessment Ongoing Data Cleansing and Conversion Business Terms /Metrics / Data Quality Data Governance The formal orchestragon of people, processes, and technology to enable an organizagon to leverage data as an enterprise asset. Arbitrate issues and enforce the rules CoordinaGon and compliance Establish policies, procedures, success metrics and processes to maintain quality data IdenGfy all business and applicagon stakeholders data owners (stewards ) Conduct audit and control CommunicaGon and change management Coordinate creagon of data definigons and data lineage

14 Global Directory System IntegraGon

15 ebay Data PlaSorm and Global Directory

16 Data Stewardship and Data Stewards objecrves Data Stewardship is opera5onal aspect of data governance A Data Stewardship is a key part of an overall Data Governance program. Data Stewardship is fully integrated with creagon of AnalyGcal Reports and product development life cycle. The Data Stewardship team is allocagng required Data Stewards to finalize solugon and document findings in MDR Data Steward is a role assigned to a person that is responsible to define and maintain metrics, business definigons and system of record (SOR) for business data elements. The data stewards are decision- makers about the data Data Steward ensures that each assigned data element and metric are defined in MDR Data Stewards are creagng Business and Technical Metadata in MDR.

17 Data Governance OrganizaRon The diagram shows the organizaron structure of the Data Governance Body, relaronship between the Data Governance Lead, Data Owners, Enterprise Data Stewards and Domain Data Stewards. Data Governance commicee Business and Technology sponsors and main stakeholders Chief Data Steward / Manager of Data Stewardship team Enterprise Data Stewards Team of Data Stewards for each Business Unit / Business Domain - Business and Technology stewards Takes care of data assets from Business and Technology perspecgves Data Owner Owns physical data assets Business Data Owners Technology Data Owners Domain & Project Data Stewards Experts in specific domains and data areas Data Quality, SAE, PM, OPS, BSA, Data Architects, Domain Architects, Data Analysts

18 ebay, Inc. Master Data Management Program Link customer data across ebay, Inc. Making data more accessible to business Integrated Customer Integrate data sources across enterprise Enterprise data integration

19 MDM and 360 Customer view ebay, Inc. Customer Multiple Accounts Multiple Organizations Multiple Data Sources AWs357 AbGrWe11 GrWHockey24 ebay Shopping Bill Me Later Milo GSI Master Data Transactions Contacts/Event Master Data Transactions Contacts/Event Where PayPal Master Data Transactions Contacts/Event Stub Hub!

20 360 Customer Data Enrichment- What is Acxiom Data? Demographic data about US users Individual and household informagon Enrichment of ebay, PayPal and StubHub customer data from Acxiom 20

21 Which users are enriched with Acxiom data? U.S. ebay members who have been acgve from Jan 2004 PLUS all US users since Jan 2007 New users sent to Acxiom monthly to obtain demographic data and account linking. Full file is refreshed annually Quarterly updates (address changes) are received 21

22 IdenRficaRon Individual ID Sample name/address Household ID Account and User ID ebay / PayPal / StubHub / Shopping / Milo / Where 22

23 Household ComposiRon Marital Status Children Number of Children Number of Adults Household Size 23

24 Demographics Age in Two- Year Increments - Input Individual Gender - Input Individual Adult Age Ranges Present in Household Children s Age Ranges Present in Household 24

25 Residence InformaRon Home Owner/Renter- Premier Length of Residence Dwelling Type Property Type 25

26 Professional / EducaRonal Business Owner OccupaGon Income Household EducaGon 26

27 Buyer Behavior Mail Order Responder Categories Mail Order Buyer (various categories) Credit Card Users PayPal Users 27

28 Lifestyle / Interests Fashion Photography VCR / LD / DVD Movie Collector Pets - Dog Owner CollecGbles - General Sports Grouping Outdoors Grouping Travel Grouping Reading Grouping Cooking / Food Grouping Exercise / Health Grouping Stereo / Video Grouping Electronics / Computers Grouping Home Improvement Grouping InvesGng / Finance Grouping CollecGbles - AnGques - Grouping

29 MDM and 360 view of the customer Account: Gwin357 1 account GMB: $795 (C) GMV: $0 Primary Cat: Auto Parts Pays with PayPal. BBE: INR on 7/5/08 Individual: Greg Wingard 4 accounts (2 ebay, 2 PP) Male, 35 years old. GMB: $2,550 (B) GMV: $22,250 (Silver) TPV: $32,575 (A) Primary Cat: + Sports Sells Collectibles Household: Greg & Abigail Wingard 5 accounts (3 ebay, 2 PP) Married, 2 children Income: $250k Net Worth: $850k Own Home, $900k value, suburbs GMB: + $3,200 (B) Primary Category : + Fashion Off-eBay purchases: Media Interests: Fashion, Sports and Finance

30 Business Domain Model and Data Stewardship

31 Metadata Management and Agile Data Warehousing TransacRonal / Behavioral / Experimental / OperaRonal Data Agile Model Driven Development (AMDD) methodology is uglized Data Models are fully integrated with Global Directory Technical and Business Metadata are fully integrated Cross- domain analysis is available. Virtual Data Marts are generated in real Gme Developed powerful methodology for Big Data AnalyGcs

32 Keys to success of Data Stewardship, Metadata Management and MDM program Processes and Tools Data Architecture Steering Commibee Data Governance Agile Data Modeling Master Data Management Product Development Common Dimensional Modeling Metadata Management Technical and Business Metadata Compliance ReporGng System integragon ASG Rochade technology is used for Technical and Business Metadata

33 Global Directory / MDR Data Browsing Metadata is organized by Business Domains Lists of Data Models

34 Compliance ReporRng: Data Models vs. Systems Compliance report is automagcally generated and sent to distribugon list of execugves, managers and developers Teradata Data Model Compliance Audit Report Date Compliance Missing Tables Different Tables % Compliance 1/4 0.0 Report Date: Oct 24, :57:31 AM 8/ Wildcat Scan Date: Oct 24, :37:10 AM 8/ Caracal Scan Date: Oct 24, :36:21 AM / / Overall Audit Report Compliance: 98.4% 9/ / View Compliance by Subject Area 10/ Data Model Data Warehouse In Model but not in Data Warehouse Data Type Difference Between Data Model and Data Warehouse In Data Warehouse but not in Data Model 10/ / Comments A B C D E / Table 3,700 3, Count Column Count 57,793 53, A B C D E Comments Total count of tables (including persistent tables and indices) or columns in the data model. Total count of tables (including persistent tables and indices) and columns that are in the data warehouse. Count of tables (including persistent tables and indices) and columns that occur in the model but not in the data warehouse. Primarily consists of projects that are modeled but have not been released into production. Count of columns (and the associated tables) that occur in both the data model and the data warehouse, but the data types are different. The column must exist in both the data model and the data warehouse to be compared. Count of all objects in the data warehouse production databases, entire tables, persistent tables and indices and the associated columns that are missing from the data model plus a count of columns that are missing in the data model from tables that exist in both the data model and the data warehouse. Views and working databases are not included Number of Missing Tables /22 8/29 9/5 9/12 9/19 9/26 10/3 10/10 10/17 10/ Date Table Audit Trend 8/22 8/29 9/5 9/12 9/19 9/26 10/3 10/10 10/17 10/24 Date Number of Different Tables Missing Tables Different Tables

35 Wiki and Global Directory integration! MDR Wiki Real Rme metadata updates IntegraRon of Wiki and Metadata Management One single source of metadata Dynamic Wiki pages

36 Global Directory metaglossary ApplicaRon Business Glossary Business Terms Business Metrics Approval workflow Advanced search Contains Links to: External Docs(Wiki) Logical data objects Physical data objects Reports and metrics The total number of feedback comments received The total number of feedback comments received The total number of feedback comments received The number of posigve comments displayed as a percent of The number of posigve comments displayed as a percent of

37 Global Directory and Agile Data Architecture Guided Reports Dimensional Models MicroStrategy cubes

38 Questions?

39 Contact InformaGon If you have further quesgons or comments: Mark Uksusman Sr. Manager, Enterprise Data Architecture ebay, Inc

Analytics as a Service. OLIVER RATZESBERGER Sr. Director Architecture & Operations Cloud Computing, Analytics as a Service ebay inc.

Analytics as a Service. OLIVER RATZESBERGER Sr. Director Architecture & Operations Cloud Computing, Analytics as a Service ebay inc. Analytics as a Service OLIVER RATZESBERGER Sr. Director Architecture & Operations Cloud Computing, Analytics as a Service ebay inc. June 2009 ebay Inc Overview As of December 31, 2007 Founded in September

More information

TECHNOLOGY TRANSFER PRESENTS MIKE FERGUSON NEXT GENERATION DATA MANAGEMENT BUILDING AN ENTERPRISE DATA RESERVOIR AND DATA REFINERY

TECHNOLOGY TRANSFER PRESENTS MIKE FERGUSON NEXT GENERATION DATA MANAGEMENT BUILDING AN ENTERPRISE DATA RESERVOIR AND DATA REFINERY TECHNOLOGY TRANSFER PRESENTS MIKE FERGUSON NEXT GENERATION DATA MANAGEMENT BUILDING AN ENTERPRISE DATA RESERVOIR AND DATA REFINERY MAY 11-13, 2015 RESIDENZA DI RIPETTA - VIA DI RIPETTA, 231 ROME (ITALY)

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1 Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview

More information

Establish and maintain Center of Excellence (CoE) around Data Architecture

Establish and maintain Center of Excellence (CoE) around Data Architecture Senior BI Data Architect - Bensenville, IL The Company s Information Management Team is comprised of highly technical resources with diverse backgrounds in data warehouse development & support, business

More information

Enterprise Data Governance

Enterprise Data Governance Enterprise Aligning Quality With Your Program Presented by: Mark Allen Sr. Consultant, Enterprise WellPoint, Inc. (mark.allen@wellpoint.com) 1 Introduction: Mark Allen is a senior consultant and enterprise

More information

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

The Key Components of a Data Governance Program. John R. Talburt, PhD, IQCP University of Arkansas at Little Rock Black Oak Analytics, Inc

The Key Components of a Data Governance Program. John R. Talburt, PhD, IQCP University of Arkansas at Little Rock Black Oak Analytics, Inc The Key Components of a Data Governance Program John R. Talburt, PhD, IQCP University of Arkansas at Little Rock Black Oak Analytics, Inc My Background Currently University of Arkansas at Little Rock Acxiom

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

Using SAP Master Data Technologies to Enable Key Business Capabilities in Johnson & Johnson Consumer

Using SAP Master Data Technologies to Enable Key Business Capabilities in Johnson & Johnson Consumer Using SAP Master Data Technologies to Enable Key Business Capabilities in Johnson & Johnson Consumer Terry Bouziotis: Director, IT Enterprise Master Data Management JJHCS Bob Delp: Sr. MDM Program Manager

More information

Master Data Management. Zahra Mansoori

Master Data Management. Zahra Mansoori Master Data Management Zahra Mansoori 1 1. Preference 2 A critical question arises How do you get from a thousand points of data entry to a single view of the business? We are going to answer this question

More information

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014 Cloud First Does Not Have to Mean Cloud Exclusively Digital Government Institute s Cloud Computing & Data Center Conference, September 2014 Am I part of a cloud first organization? Am I part of a cloud

More information

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper Offload Enterprise Data Warehouse (EDW) to Big Data Lake Oracle Exadata, Teradata, Netezza and SQL Server Ample White Paper EDW (Enterprise Data Warehouse) Offloads The EDW (Enterprise Data Warehouse)

More information

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015 Bringing Strategy to Life Using an Intelligent Platform to Become Ready Informatica Government Summit April 23, 2015 Informatica Solutions Overview Power the -Ready Enterprise Government Imperatives Improve

More information

How to Enhance Traditional BI Architecture to Leverage Big Data

How to Enhance Traditional BI Architecture to Leverage Big Data B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...

More information

Oracle Data Integrator: Administration and Development

Oracle Data Integrator: Administration and Development Oracle Data Integrator: Administration and Development What you will learn: In this course you will get an overview of the Active Integration Platform Architecture, and a complete-walk through of the steps

More information

Master Data Management and Data Warehousing. Zahra Mansoori

Master Data Management and Data Warehousing. Zahra Mansoori Master Data Management and Data Warehousing Zahra Mansoori 1 1. Preference 2 IT landscape growth IT landscapes have grown into complex arrays of different systems, applications, and technologies over the

More information

Explore the Possibilities

Explore the Possibilities Explore the Possibilities 2013 HR Service Delivery Forum Best Practices in Data Management: Creating a Sustainable and Robust Repository for Reporting and Insights 2013 Towers Watson. All rights reserved.

More information

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy Satish Krishnaswamy VP MDM Solutions - Teradata 2 Agenda MDM and its importance Linking to the Active Data Warehousing

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

Washington State s Use of the IBM Data Governance Unified Process Best Practices

Washington State s Use of the IBM Data Governance Unified Process Best Practices STATS-DC 2012 Data Conference July 12, 2012 Washington State s Use of the IBM Data Governance Unified Process Best Practices Bill Huennekens Washington State Office of Superintendent of Public Instruction,

More information

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager steve.gonzales@thinkbiganalytics.com

More information

Enterprise Data Governance

Enterprise Data Governance DATA GOVERNANCE Enterprise Data Governance Strategies and Approaches for Implementing a Multi-Domain Data Governance Model Mark Allen Sr. Consultant, Enterprise Data Governance WellPoint, Inc. 1 Introduction:

More information

Improving your Data Warehouse s IQ

Improving your Data Warehouse s IQ Improving your Data Warehouse s IQ Derek Strauss Gavroshe USA, Inc. Outline Data quality for second generation data warehouses DQ tool functionality categories and the data quality process Data model types

More information

Investor Presentation. Second Quarter 2015

Investor Presentation. Second Quarter 2015 Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

IBM InfoSphere Discovery: The Power of Smarter Data Discovery

IBM InfoSphere Discovery: The Power of Smarter Data Discovery IBM InfoSphere Discovery: The Power of Smarter Data Discovery Gerald Johnson IBM Client Technical Professional gwjohnson@us.ibm.com 2010 IBM Corporation Objectives To obtain a basic understanding of the

More information

Ganzheitliches Datenmanagement

Ganzheitliches Datenmanagement Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist

More information

Apache Kylin Introduction Dec 8, 2014 @ApacheKylin

Apache Kylin Introduction Dec 8, 2014 @ApacheKylin Apache Kylin Introduction Dec 8, 2014 @ApacheKylin Luke Han Sr. Product Manager lukhan@ebay.com @lukehq Yang Li Architect & Tech Leader yangli9@ebay.com Agenda What s Apache Kylin? Tech Highlights Performance

More information

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com Data Warehousing and Analytics Infrastructure at Facebook Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com Overview Challenges in a Fast Growing & Dynamic Environment Data Flow Architecture,

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

The Role of the Analyst in Business Analytics. Neil Foshay Schwartz School of Business St Francis Xavier U

The Role of the Analyst in Business Analytics. Neil Foshay Schwartz School of Business St Francis Xavier U The Role of the Analyst in Business Analytics Neil Foshay Schwartz School of Business St Francis Xavier U Contents Business Analytics What s it all about? Development Process Overview BI Analyst Role Questions

More information

NOS for Data Management (801) September 2014 V1.3

NOS for Data Management (801) September 2014 V1.3 NOS for Data Management (801) September 2014 V1.3 NOS Reference ESKITP801301 ESKITP801401 ESKITP801501 ESKITP801601 NOS Title Assist in Delivering the Data Management Infrastructure to Support Data Analysis

More information

Oracle Business Intelligence 11g Business Dashboard Management

Oracle Business Intelligence 11g Business Dashboard Management Oracle Business Intelligence 11g Business Dashboard Management Thomas Oestreich Chief EPM STrategist Tool Proliferation is Inefficient and Costly Disconnected Systems; Competing Analytic

More information

James Serra Data Warehouse/BI/MDM Architect JamesSerra3@gmail.com JamesSerra.com

James Serra Data Warehouse/BI/MDM Architect JamesSerra3@gmail.com JamesSerra.com James Serra Data Warehouse/BI/MDM Architect JamesSerra3@gmail.com JamesSerra.com Agenda Do you need Master Data Management (MDM)? Why Master Data Management? MDM Scenarios & MDM Hub Architecture Styles

More information

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,

More information

Master Data Management Architecture

Master Data Management Architecture Master Data Management Architecture Version Draft 1.0 TRIM file number - Short description Relevant to Authority Responsible officer Responsible office Date introduced April 2012 Date(s) modified Describes

More information

Week 13: Data Warehousing. Warehousing

Week 13: Data Warehousing. Warehousing 1 Week 13: Data Warehousing Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots of buzzwords, hype slice & dice, rollup,

More information

Service Oriented Data Management

Service Oriented Data Management Service Oriented Management Nabin Bilas Integration Architect Integration & SOA: Agenda Integration Overview 5 Reasons Why Is Critical to SOA Oracle Integration Solution Integration

More information

Facebook s Petabyte Scale Data Warehouse using Hive and Hadoop

Facebook s Petabyte Scale Data Warehouse using Hive and Hadoop Facebook s Petabyte Scale Data Warehouse using Hive and Hadoop Why Another Data Warehousing System? Data, data and more data 200GB per day in March 2008 12+TB(compressed) raw data per day today Trends

More information

Before You Buy: A Checklist for Evaluating Your Analytics Vendor

Before You Buy: A Checklist for Evaluating Your Analytics Vendor Executive Report Before You Buy: A Checklist for Evaluating Your Analytics Vendor By Dale Sanders Sr. Vice President Health Catalyst Embarking on an assessment with the knowledge of key, general criteria

More information

Metadata Application Understanding Software Migration

Metadata Application Understanding Software Migration Metadata Application Understanding Software Migration Jens-Uwe Richter Mgr. of Development Agenda The Rochade Metadata Landscape Governance, Compliancy, Regulation The Art to Master it About Sharing Information

More information

What's New in SAS Data Management

What's New in SAS Data Management Paper SAS034-2014 What's New in SAS Data Management Nancy Rausch, SAS Institute Inc., Cary, NC; Mike Frost, SAS Institute Inc., Cary, NC, Mike Ames, SAS Institute Inc., Cary ABSTRACT The latest releases

More information

iway Roadmap Michael Corcoran Sr. VP Corporate Marketing

iway Roadmap Michael Corcoran Sr. VP Corporate Marketing 16.06.2015 iway Roadmap Michael Corcoran Sr. VP Corporate Marketing iway 7 Products 1 iway 7 Products iway 7 Products 360 Viewer Remediation Sentinel Portal Golden Record Search and View Omni Patient Data

More information

A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities

A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities Numerous roles and responsibilities will need to be acceded to in order to make data warehouse

More information

Why You Still Need to Master Your Data Before You Master Your Business (Intelligence) Business Imperatives Addressed By Reliable, Integrated View

Why You Still Need to Master Your Data Before You Master Your Business (Intelligence) Business Imperatives Addressed By Reliable, Integrated View Why You Still Need to Master Your Data Before You Master Your Business (Intelligence) Business Imperatives Addressed By Reliable, Integrated View David Jordan Data Management Product Specialist 1 2 A simple

More information

Existing Technologies and Data Governance

Existing Technologies and Data Governance Existing Technologies and Data Governance Adriaan Veldhuisen Product Manager Privacy & Security Teradata, a Division of NCR 10 June, 2004 San Francisco, CA 6/10/04 1 My Assumptions for Data Governance

More information

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 Ralph Kimball Associates 2014 The Data Warehouse Mission Identify all possible enterprise data assets Select those assets

More information

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What

More information

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff The Challenge IT Executives are challenged with issues around data, compliancy, regulation and making confident decisions on their business

More information

DISCIPLINE DATA GOVERNANCE GOVERN PLAN IMPLEMENT

DISCIPLINE DATA GOVERNANCE GOVERN PLAN IMPLEMENT DATA GOVERNANCE DISCIPLINE Whenever the people are well-informed, they can be trusted with their own government. Thomas Jefferson PLAN GOVERN IMPLEMENT 1 DATA GOVERNANCE Plan Strategy & Approach Data Ownership

More information

DATA GOVERNANCE AND DATA QUALITY

DATA GOVERNANCE AND DATA QUALITY DATA GOVERNANCE AND DATA QUALITY Kevin Lewis Partner Enterprise Management COE Barb Swartz Account Manager Teradata Government Systems Objectives of the Presentation Show that Governance and Quality are

More information

Real World Big Data Architecture - Splunk, Hadoop, RDBMS

Real World Big Data Architecture - Splunk, Hadoop, RDBMS Copyright 2015 Splunk Inc. Real World Big Data Architecture - Splunk, Hadoop, RDBMS Raanan Dagan, Big Data Specialist, Splunk Disclaimer During the course of this presentagon, we may make forward looking

More information

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora Oracle BI Application: Demonstrating the Functionality & Ease of use Geoffrey Francis Naailah Gora Agenda Oracle BI & BI Apps Overview Demo: Procurement & Spend Analytics Creating a ad-hoc report Copyright

More information

NoSQL for SQL Professionals William McKnight

NoSQL for SQL Professionals William McKnight NoSQL for SQL Professionals William McKnight Session Code BD03 About your Speaker, William McKnight President, McKnight Consulting Group Frequent keynote speaker and trainer internationally Consulted to

More information

Getting Started Practical Input For Your Roadmap

Getting Started Practical Input For Your Roadmap Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

BENEFITS OF AUTOMATING DATA WAREHOUSING

BENEFITS OF AUTOMATING DATA WAREHOUSING BENEFITS OF AUTOMATING DATA WAREHOUSING Introduction...2 The Process...2 The Problem...2 The Solution...2 Benefits...2 Background...3 Automating the Data Warehouse with UC4 Workload Automation Suite...3

More information

Cloud Ready Data: Speeding Your Journey to the Cloud

Cloud Ready Data: Speeding Your Journey to the Cloud Cloud Ready Data: Speeding Your Journey to the Cloud Hybrid Cloud first Born to the cloud 3 Am I part of a Cloud First organization? Am I part of a Cloud First agency? The cloud applications questions

More information

THOMAS RAVN PRACTICE DIRECTOR TRA@PLATON.NET. An Effective Approach to Master Data Management. March 4 th 2010, Reykjavik WWW.PLATON.

THOMAS RAVN PRACTICE DIRECTOR TRA@PLATON.NET. An Effective Approach to Master Data Management. March 4 th 2010, Reykjavik WWW.PLATON. An Effective Approach to Master Management THOMAS RAVN PRACTICE DIRECTOR TRA@PLATON.NET March 4 th 2010, Reykjavik WWW.PLATON.NET Agenda Introduction to MDM The aspects of an effective MDM program How

More information

Adopting the DMBOK. Mike Beauchamp Member of the TELUS team Enterprise Data World 16 March 2010

Adopting the DMBOK. Mike Beauchamp Member of the TELUS team Enterprise Data World 16 March 2010 Adopting the DMBOK Mike Beauchamp Member of the TELUS team Enterprise Data World 16 March 2010 Agenda The Birth of a DMO at TELUS TELUS DMO Functions DMO Guidance DMBOK functions and TELUS Priorities Adoption

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,

More information

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007 US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007 Task 18 - Enterprise Data Management 18.002 Enterprise Data Management Concept of Operations i

More information

The Data Reservoir as an enabler of differentiating Analytics initiatives

The Data Reservoir as an enabler of differentiating Analytics initiatives Mandy Chessell CBE FREng CEng FBCS Distinguished Engineer, Master Inventor Chief Architect, Solutions The Reservoir as an enabler of differentiating Analytics initiatives 3 rd March 2015 Agenda Changing

More information

W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership

W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership Sponsored by: Microsoft and Teradata Dan Vesset October 2008 Brian McDonough Global Headquarters:

More information

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges

More information

JOURNAL OF OBJECT TECHNOLOGY

JOURNAL OF OBJECT TECHNOLOGY JOURNAL OF OBJECT TECHNOLOGY Online at www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2008 Vol. 7, No. 8, November-December 2008 What s Your Information Agenda? Mahesh H. Dodani,

More information

NOS for Data Analysis (802) September 2014 V1.3

NOS for Data Analysis (802) September 2014 V1.3 NOS for Data Analysis (802) September 2014 V1.3 NOS Reference ESKITP802301 ESKITP802401 ESKITP802501 ESKITP802601 NOS Title Assist in Delivering Routine Data Analysis Studies Design and Implement Data

More information

IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE. Copyright 2012, SAS Institute Inc. All rights reserved.

IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE. Copyright 2012, SAS Institute Inc. All rights reserved. IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE ABOUT THE PRESENTER Marc has been with SAS for 10 years and leads the information management practice for canada. Marc s area of specialty

More information

What happens when Big Data and Master Data come together?

What happens when Big Data and Master Data come together? What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information

More information

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success Developing an MDM Strategy Key Components for Success WHITE PAPER Table of Contents Introduction... 2 Process Considerations... 3 Architecture Considerations... 5 Conclusion... 9 About Knowledgent... 10

More information

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by: BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE Cerulium Corporation has provided quality education and consulting expertise for over six years. We offer customized solutions to

More information

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business

More information

Customer Case Studies on MDM Driving Real Business Value

Customer Case Studies on MDM Driving Real Business Value Customer Case Studies on MDM Driving Real Business Value Dan Gage Oracle Master Data Management Master Data has Domain Specific Requirements CDI (Customer, Supplier, Vendor) PIM (Product, Service) Financial

More information

Enterprise Data Management in an In-Memory World

Enterprise Data Management in an In-Memory World Enterprise Data Management in an In-Memory World Tactics for Loading SAS High-Performance Analytics Server and SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Executive Summary.... 1

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent

More information

IBM Big Data in Government

IBM Big Data in Government IBM Big in Government Turning big data into smarter decisions Deepak Mohapatra Sr. Consultant Government IBM Software Group dmohapatra@us.ibm.com The Big Paradigm Shift 2 Big Creates A Challenge And an

More information

appmdmtm MASTER DATA MANAGEMENT

appmdmtm MASTER DATA MANAGEMENT appmdmtm MASTER DATA MANAGEMENT Chain-Sys Platform Multiple Domain Hubs ETL with No Programming MASTER DATA MANAGEMENT A Single Source System is designated as the System of Record for Master Data. appmdm

More information

TECHNOLOGY TRANSFER PRESENTS OCTOBER 16 2012 OCTOBER 17 2012 RESIDENZA DI RIPETTA - VIA DI RIPETTA, 231 ROME (ITALY)

TECHNOLOGY TRANSFER PRESENTS OCTOBER 16 2012 OCTOBER 17 2012 RESIDENZA DI RIPETTA - VIA DI RIPETTA, 231 ROME (ITALY) TECHNOLOGY TRANSFER PRESENTS RICK VAN DER LANS Data Virtualization for Agile Business Intelligence Systems New Database Technology for Data Warehousing OCTOBER 16 2012 OCTOBER 17 2012 RESIDENZA DI RIPETTA

More information

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition 1 What s New with Informatica Data Services & PowerCenter Data Virtualization Edition Kevin Brady, Integration Team Lead Bonneville Power Wei Zheng, Product Management Informatica Ash Parikh, Product Marketing

More information

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28 Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT

More information

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server Extending Hyperion BI with the Oracle BI Server Mark Ostroff Sr. BI Solutions Consultant Agenda Hyperion BI versus Hyperion BI with OBI Server Benefits of using Hyperion BI with the

More information

Data Warehouse Overview. Srini Rengarajan

Data Warehouse Overview. Srini Rengarajan Data Warehouse Overview Srini Rengarajan Please mute Your cell! Agenda Data Warehouse Architecture Approaches to build a Data Warehouse Top Down Approach Bottom Up Approach Best Practices Case Example

More information

NEWLY EMERGING BEST PRACTICES FOR BIG DATA

NEWLY EMERGING BEST PRACTICES FOR BIG DATA 2000-2012 Kimball Group. All rights reserved. Page 1 NEWLY EMERGING BEST PRACTICES FOR BIG DATA Ralph Kimball Informatica October 2012 Ralph Kimball Big is Being Monetized Big data is the second era of

More information

Big Data and Trusted Information

Big Data and Trusted Information Dr. Oliver Adamczak Big Data and Trusted Information CAS Single Point of Truth 7. Mai 2012 The Hype Big Data: The next frontier for innovation, competition and productivity McKinsey Global Institute 2012

More information

<Insert Picture Here> Master Data Management

<Insert Picture Here> Master Data Management Master Data Management 김대준, 상무 Master Data Management Team MDM Issues Lack of Enterprise-level data code standard Region / Business Function Lack of data integrity/accuracy Difficulty

More information

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER Page 1 of 8 ABOUT THIS COURSE This 5 day course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL Server

More information

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server Page 1 of 7 Overview This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL 2014, implement ETL

More information

Data Governance for Regulated Industries

Data Governance for Regulated Industries Data Governance for Regulated Industries Amir Halfon CTO, Worldwide Financial Service Agenda Components of Data Governance Challenges Solutions and Case Studies Q&A SLIDE: 2 Data Governance Considerations

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Disparate Data, Disparate Systems, Disparate User Groups (How to Architect The Enterprise Business Needs) Robert Schork, General Dynamics IT

Disparate Data, Disparate Systems, Disparate User Groups (How to Architect The Enterprise Business Needs) Robert Schork, General Dynamics IT Disparate Data, Disparate Systems, Disparate User Groups (How to Architect The Enterprise Business Needs) Robert Schork, General Dynamics IT April 27, 2011 2011 Waters North American Trading Architecture

More information

iway Roadmap Michael Corcoran Sr. VP Corporate Marketing

iway Roadmap Michael Corcoran Sr. VP Corporate Marketing iway Roadmap Michael Corcoran Sr. VP Corporate Marketing iway 7 Products iway 7 Products iway 7 Products 360 Viewer Remediation Sentinel Portal Golden Record Search and View Omni-Patient Data Exception

More information

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013 Petabyte Scale Data at Facebook Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013 Agenda 1 Types of Data 2 Data Model and API for Facebook Graph Data 3 SLTP (Semi-OLTP) and Analytics

More information

Class News. Basic Elements of the Data Warehouse" 1/22/13. CSPP 53017: Data Warehousing Winter 2013" Lecture 2" Svetlozar Nestorov" "

Class News. Basic Elements of the Data Warehouse 1/22/13. CSPP 53017: Data Warehousing Winter 2013 Lecture 2 Svetlozar Nestorov CSPP 53017: Data Warehousing Winter 2013 Lecture 2 Svetlozar Nestorov Class News Class web page: http://bit.ly/wtwxv9 Subscribe to the mailing list Homework 1 is out now; due by 1:59am on Tue, Jan 29.

More information

A very short talk about Apache Kylin Business Intelligence meets Big Data. Fabian Wilckens EMEA Solutions Architect

A very short talk about Apache Kylin Business Intelligence meets Big Data. Fabian Wilckens EMEA Solutions Architect A very short talk about Apache Kylin Business Intelligence meets Big Data Fabian Wilckens EMEA Solutions Architect 1 The challenge today 2 Very quickly: OLAP Online Analytical Processing How many beers

More information

The Business in Business Intelligence. Bryan Eargle Database Development and Administration IT Services Division

The Business in Business Intelligence. Bryan Eargle Database Development and Administration IT Services Division The Business in Business Intelligence Bryan Eargle Database Development and Administration IT Services Division Defining Business Intelligence (BI) Agenda Goals Identify data assets Transform data and

More information

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here Data Virtualization for Agile Business Intelligence Systems and Virtual MDM To View This Presentation as a Video Click Here Agenda Data Virtualization New Capabilities New Challenges in Data Integration

More information