Part 22. Data Warehousing



Similar documents
Fluency With Information Technology CSE100/IMT100

IST722 Data Warehousing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

14. Data Warehousing & Data Mining

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Data Warehousing and OLAP Technology for Knowledge Discovery

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

DATA WAREHOUSING - OLAP

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

DATA WAREHOUSING AND OLAP TECHNOLOGY

SAS BI Course Content; Introduction to DWH / BI Concepts

Turkish Journal of Engineering, Science and Technology

Week 13: Data Warehousing. Warehousing

INFO 321, Database Systems, Semester

Introduction to Data Warehousing. Ms Swapnil Shrivastava

A Critical Review of Data Warehouse

Data Warehousing Systems: Foundations and Architectures

OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

Data W a Ware r house house and and OLAP Week 5 1

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

Data Warehouse: Introduction

Lection 3-4 WAREHOUSING

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

Hybrid Support Systems: a Business Intelligence Approach

DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

Week 3 lecture slides

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

DATA CUBES E Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES

Data Warehousing and OLAP

CHAPTER 4 Data Warehouse Architecture

Data Warehousing, OLAP, and Data Mining

This tutorial will help computer science graduates to understand the basic-toadvanced concepts related to data warehousing.

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?

A Design and implementation of a data warehouse for research administration universities

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing

Data Warehousing and Online Analytical Processing

B.Sc (Computer Science) Database Management Systems UNIT-V

CS2032 Data warehousing and Data Mining Unit II Page 1

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

CHAPTER 3. Data Warehouses and OLAP

Overview of Data Warehousing and OLAP

Technology-Driven Demand and e- Customer Relationship Management e-crm

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Data Warehousing and Data Mining

OLAP and Data Warehousing! Introduction!

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

IT0457 Data Warehousing. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Chapter 3, Data Warehouse and OLAP Operations

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.

Data Warehouse Database Design Student Guide

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

BUSINESS ANALYTICS AND DATA VISUALIZATION. ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Data Warehousing Concepts

When to consider OLAP?

Data W a Ware r house house and and OLAP II Week 6 1

Data Warehousing and Data Mining in Business Applications

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making

The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija,

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Chapter 23, Part A

What is Management Reporting from a Data Warehouse and What Does It Have to Do with Institutional Research?

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes

Overview. Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Data Warehousing. An Example: The Store (e.g.

Monitoring Genebanks using Datamarts based in an Open Source Tool

Business Intelligence, Analytics & Reporting: Glossary of Terms

Data Warehousing (DW) Online Analytical Processing (OLAP) Data Mining

Data warehousing. Han, J. and M. Kamber. Data Mining: Concepts and Techniques Morgan Kaufmann.

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing

Data Warehousing, Data Mining, OLAP and OLTP Technologies Are Essential Elements to Support Decision-Making Process in Industries

Data Warehousing Fundamentals Student Guide

Understanding Data Warehousing. [by Alex Kriegel]

Dimensional Modeling for Data Warehouse

Presented by: Jose Chinchilla, MCITP


Data Mart/Warehouse: Progress and Vision

Transcription:

Part 22 Data Warehousing

The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem Interactive Needs ad hoc query tools Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 2

Components of a DSS Data Store Business Data (internal and external) Business Model Data (generated from algorithms or "mined") Data Extraction Data Filtering End-user Query Tool (ad hoc query tool) End-user Presentation Tool Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 3

Data Collection Conversion From manual records From machine-readable records Via CORBA Data Purification In a large database anything that can occur will Data must not contain anomalies Data could be in read and append only format Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 4

Characteristics of DSS Data Time span Not just the current data, but covers a long time span Granularity Not every detail of every transaction (necessarily), but totals and summaries and derived data Dimensionality Data relationships in as many ways as might be relevant to the application area or problem Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 5

Differences Between Operational Data and DSS Data Attribute Operational Data DSS Data Alternate Name On-line transaction processing On-line analytical processing Acronym OLTP OLAP Characteristic Operational processing Informational processing Orientation Transaction Analysis Timeframe Current Historical Update On-line Batch Level of Detail Low Summarized Normalization Full Not required Transactions Updates Queries Query scope Narrow Broad Data volume Gigabyte Terabyte Users Clerks, database professionals Knowledge workers Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 6

Data Warehouse Integrated Centralized Consolidated Standardized Subject-Oriented Organized by topic Summarized by topic Multiple subjects of interest Historical or Time-Variant Time is a variable Multiple values with different time stamps Non-Volatile Data added, but never removed Always growing Batch update via appending Summaries may change Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 7

Building the Data Warehouse Data Extraction and Collection From existing operational data and external sources Data Filtering and Reduction To remove extraneous fields (such as SSN) To collect a sample when not all instances needed Data Cleaning and Scrubbing Consistent units of measure Consistent intervals of time Consistent accounting methods Consistent definitions Data Transformation and Coding Code to numerical from categorical Categorize numerical ranges Everything should ideally reduce to numbers Aggregation and Summarization Generate subtotals and totals Generate across various dimensions Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 8

Twelve Rules of Data Warehousing (Inmon and Kelley) 1. Data Warehouse separate from operational data 2. Data Warehouse is integrated 3. Data Warehouse contains historical data 4. Data Warehouse time components are a series of snapshots 5. Data Warehouse is subject-oriented 6. Data Warehouse is read-only except for periodic batch updates 7. Data Warehouse development is data driven 8. Data Warehouse contains multiple levels of detail, from operational detail to highly summarized 9. Data Warehouse transactions are read-only against large data sets 10. Data Warehouse traces data from source through transformations 11. Data Warehouse contains metadata 12. Data Warehouse has charge-back Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 9

Data Warehouse Architectures Multidimensional Data Model Data cube Star Snowflake Constellation Implementation ROLAP (relational) MOLAP (multidimensional) HOLAP (hybrid) Copyright 1971-2002 Thomas P. Sturm Data Warehousing Part 22, Page 10