Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006



Similar documents
Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

IST722 Data Warehousing

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Fluency With Information Technology CSE100/IMT100

Lection 3-4 WAREHOUSING

Data Mart/Warehouse: Progress and Vision

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.

Introduction to Data Warehousing. Ms Swapnil Shrivastava

Data Warehousing Systems: Foundations and Architectures

Data Warehousing and OLAP Technology for Knowledge Discovery

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

Getting it Right: How to Find the Right BI Package for the Right Situation Norma Waugh. RMOUG Training Days February 15-17, 2011

Turkish Journal of Engineering, Science and Technology

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Part 22. Data Warehousing

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Analance Data Integration Technical Whitepaper

The Oracle Enterprise Data Warehouse (EDW)

MDM and Data Warehousing Complement Each Other

Analance Data Integration Technical Whitepaper

By Makesh Kannaiyan 8/27/2011 1

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

Sterling Business Intelligence

OLAP Theory-English version

Oracle Business Intelligence Suite Enterprise Edition

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

A Service-oriented Architecture for Business Intelligence

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

Technology-Driven Demand and e- Customer Relationship Management e-crm

Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments.

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Data warehouse and Business Intelligence Collateral

Week 13: Data Warehousing. Warehousing

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Data W a Ware r house house and and OLAP Week 5 1

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

DATA MINING AND WAREHOUSING CONCEPTS

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Oracle Business Intelligence 11g Business Dashboard Management

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Business Intelligence at the University of Minnesota

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Designing a Dimensional Model

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Week 3 lecture slides

Business Intelligence and Healthcare

14. Data Warehousing & Data Mining

Microsoft Implementing Data Models and Reports with Microsoft SQL Server

Business Intelligence, Analytics & Reporting: Glossary of Terms

When to consider OLAP?

SAS BI Course Content; Introduction to DWH / BI Concepts

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Design of Electricity & Energy Review Dashboard Using Business Intelligence and Data Warehouse

Data Warehousing and Data Mining in Business Applications

DATA WAREHOUSING AND OLAP TECHNOLOGY

Data Warehousing and Data Mining

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing

TRANSFORMING YOUR BUSINESS

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Data Warehousing and Data Mining Introduction

HROUG. The future of Business Intelligence & Enterprise Performance Management. Rovinj October 18, 2007

BENEFITS OF AUTOMATING DATA WAREHOUSING

Service Oriented Data Management

The Benefits of Data Modeling in Data Warehousing

The Role of the BI Competency Center in Maximizing Organizational Performance

Data Warehouse Design

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.

SQL Server 2012 Business Intelligence Boot Camp

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

Data Mining for Successful Healthcare Organizations

BUSINESS ANALYTICS AND DATA VISUALIZATION. ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ

Understanding Data Warehousing. [by Alex Kriegel]

VisionWaves : Delivering next generation BI by combining BI and PM in an Intelligent Performance Management Framework

Introduction to Datawarehousing

A Knowledge Management Framework Using Business Intelligence Solutions

Building an Advancement Data Warehouse. created every year according to a study. Data Facilitates all Advancement Activities

INTERACTIVE DECISION SUPPORT SYSTEM BASED ON ANALYSIS AND SYNTHESIS OF DATA - DATA WAREHOUSE

LEARNING SOLUTIONS website milner.com/learning phone

IBM Cognos Performance Management Solutions for Oracle

Oracle Business Intelligence Foundation Suite 11g Essentials Exam Study Guide

Module 1: Introduction to Data Warehousing and OLAP

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Transcription:

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

What is a Data Warehouse? A data warehouse is a subject-oriented, integrated, time-varying, non-volatile collection of data in support of the management's decision-making process." ---Bill Inmon

What is a Data Warehouse? A Practitioners Viewpoint EDW UC Berkeley A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context. -- Barry Devlin, IBM Consultant

Data Warehouse V s V s OLTP Enterprise Data Warehouse Integrated Data Current/Historical Data Organized by Subject Non-Volatile Data Denormalized Data Descriptive Data Detailed/Summarized Data Knowledge user (Manager) Traditional Database Application-specific Data Current Data Organized for Data Entry Updated Data Normalized Data Encoded Data Raw Data Clerical User

What does a data warehouse do? Integrate divergent information from various systems which enable users to quickly produce powerful ad-hoc queries and perform complex analysis Create an infrastructure for reusing the data in numerous ways Create an open systems environment to make useful information easily accessible to authorized users Help managers make informed decisions

What are the users saying Data should be integrated across the enterprise Summary data had a real value to the organization Historical data held the key to understanding data over time What-if capabilities are required

Heterogeneous Information Sources Heterogeneities are everywhere Administrative Systems Scientific Databases World Wide Web Different interfaces Digital Libraries Different data representations Duplicate and inconsistent information Vertical fragmentation of informational systems (vertical stove pipes) Result of application (user)-driven development of operational systems

Goal: Unified Access to Data Integration System World Wide Web Digital Libraries Scientific Databases Personal Databases Collects and combines information Provides integrated view, uniform user interface Supports sharing

Key Concepts Data Warehouse vs. Data Mart Data Mart Single subject area Data Warehouse Integrated Data Marts Star Schema Dimensions Hierarchies Descriptive Attributes Fact Tables Metadata

Star Schema Architecture Dimension tables Textual descriptions of the dimensions of data Represents the data that you slice and dice Fact tables Place where numerical measurements of business are stored

Star Schema Data Model

Reporting Business intelligence tools: These are software applications that simplify the process of development and production of business reports based on data warehouse data. Executive information systems (known more widely as Dashboard (business): These are software applications that are used to display complex business metrics and information in a graphical way to allow rapid understanding. OLAP Tools: OLAP tools form data into logical multi-dimensional structures and allow users to select which dimensions to view data by. Data Mining: Data mining tools are software that allow users to perform detailed mathematical and statistical calculations on detailed data warehouse data to detect trends, identify patterns and analyze data.

Strengths of OLAP It is a powerful visualization paradigm. It provides fast, interactive response times. It is good for analyzing time series. It can be useful to find some clusters and outliners.

Typical OLAP Query Write a multi-table join to compare Budget for each Org YTD this year vs. last year. Repeat the above process to find the top 5 Donors to Funds. Repeat the above process to find the Funds of a Org to new vs. existing Donor.

Nature of OLAP Analysis Aggregation Comparison Budget vs. Expenses Ranking Access to detailed and aggregate data Complex criteria specification Visualization

Reporting EDW UC Berkeley

INTERACTIVE REPORTING

ANALYTICS Gain rich insight into your business.

Dashboard EDW UC Berkeley

Planning & Modeling

Scorecard EDW UC Berkeley

EDW Today EDW UC Berkeley BAIRSII Financial (Operational reporting) Actuals Tempbudg Permbudg Payroll BIS Financial (Analytical reporting) Actuals Tempbudg Permbudg Payroll HRMS (Operational reporting) Workforce Administration HRMS (Analytical reporting) Workforce Summary Foundation Student Pilot (In development)

Technical Environment Oracle 9i ETL Tool Informatica Reporting Tool Hyperion Brio Data Modeling Tool - ERwin

Enterprise Data Warehouse Report Request Data request, returned, formatted and presented

Informatica PowerCenter 7 Architecture

ETL (Extract, Transform, and Load) Data Extract Get Data from source Data Transformation Data Cleansing - Data Quality Assurance Data Scrubbing - Removing errors and inconsistencies Processing Calculations Applying Business Rules Changing Data Types Making the Data More Readable Replacing Codes with Actual Values Summarizing the Data Data Load Load data into Warehouse

Data Quality The reality Tempting to think that all that is there to creating a data warehouse is extracting operational data and entering into a data warehouse. Warehouse data comes from disparate questionable sources. Legacy systems no longer documented. Outside sources with questionable quality procedures. Production systems with no built in integrity checks and no integration.

EDW moving forward Hub and Spoke Architecture Streamline the existing process Standardize ETL processes Standardize Table/View design s Standardize Naming conventions Address meta data requirements

Data Warehouse to Data Mart

Integrated Data Subject Area Workforce analysis Financial analysis of operations. Financial analysis research and projects Curriculum and enrollment analysis Student pipeline analysis (pre-registration registration to alumni) Graduate support analysis Private support analysis (donors and gifts) Purchasing analysis Recharge analysis Classroom and facilities utilization analysis Absenteeism analysis Enterprise risk assessment

Warehouse Success Criteria Cross-function, campus-wide support Consistency and integration across subject areas. Integration mechanism should be explicit. Design should embody subject-matter expertise. Standardized data definitions. Security; access control Access limited by role and context. Controlled by data owners. Support for operational users Standard reports, customizable as needed. Good documentation at the point of use. Support flexible access for decision-makers Slice and dice many different ways. Tool support for this. Documentation of available data. Enough history for trend analysis. Reconcile alternative views of same information; e.g., as-reported vs. final.

Metadata Data about Data Business Meta Data Technical Meta Data Operational Meta Data Project Meta Data

Some of the challenges Governance and Prioritization Decentralized reporting systems Meta Data management Resource availability Central vs. local reporting needs Funding BAI vs. Campus enterprise