Fluency With Information Technology CSE100/IMT100



Similar documents
OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Part 22. Data Warehousing

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

B.Sc (Computer Science) Database Management Systems UNIT-V

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Data Warehousing. Yeow Wei Choong Anne Laurent

14. Data Warehousing & Data Mining

Lection 3-4 WAREHOUSING

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Data Warehousing and Data Mining in Business Applications

Data Warehousing and OLAP Technology for Knowledge Discovery

DATA WAREHOUSING AND OLAP TECHNOLOGY

IST722 Data Warehousing

Foundations of Business Intelligence: Databases and Information Management

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Foundations of Business Intelligence: Databases and Information Management

Turkish Journal of Engineering, Science and Technology

DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

OLAP Theory-English version

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.

Course MIS. Foundations of Business Intelligence

Data Mining for Successful Healthcare Organizations

(Week 10) A04. Information System for CRM. Electronic Commerce Marketing

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Data Mart/Warehouse: Progress and Vision

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

An Overview of Database management System, Data warehousing and Data Mining

INFO 321, Database Systems, Semester

Mario Guarracino. Data warehousing

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Module Title: Business Intelligence

When to consider OLAP?

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

IT0457 Data Warehousing. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Data Mining + Business Intelligence. Integration, Design and Implementation

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

Importance or the Role of Data Warehousing and Data Mining in Business Applications

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

Technology-Driven Demand and e- Customer Relationship Management e-crm

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Database Marketing, Business Intelligence and Knowledge Discovery

Lecture Data Warehouse Systems

DSS based on Data Warehouse

CHAPTER 4 Data Warehouse Architecture

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

SQL Server 2005 Features Comparison

Data Warehousing, OLAP, and Data Mining

Framework for Data warehouse architectural components

Designing a Dimensional Model

DATA WAREHOUSE E KNOWLEDGE DISCOVERY

Data Warehousing Systems: Foundations and Architectures

Data Warehouse: Introduction

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Foundations of Business Intelligence: Databases and Information Management

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi

Data Warehouses & OLAP

Data Warehouse Overview. Srini Rengarajan

Building a Data Warehouse

Data Warehousing and Data Mining

Chapter 6 - Enhancing Business Intelligence Using Information Systems

Data Warehousing Concepts

LEARNING SOLUTIONS website milner.com/learning phone

INTERACTIVE DECISION SUPPORT SYSTEM BASED ON ANALYSIS AND SYNTHESIS OF DATA - DATA WAREHOUSE

Data Warehouses and Business Intelligence ITP 487 (3 Units) Fall Objective

Distance Learning and Examining Systems

Business Intelligence, Analytics & Reporting: Glossary of Terms

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

BENEFITS OF AUTOMATING DATA WAREHOUSING

1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining

SQL Server 2012 Business Intelligence Boot Camp

OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

Week 13: Data Warehousing. Warehousing

BUSINESS ANALYTICS AND DATA VISUALIZATION. ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ

Business Intelligence: Using Data for More Than Analytics

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

Introduction to Data Mining

A Review of Data Mining Techniques

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

Methodology Framework for Analysis and Design of Business Intelligence Systems

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.

Foundations of Business Intelligence: Databases and Information Management

BUILDING OLAP TOOLS OVER LARGE DATABASES

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

IMPLEMENTATION OF DATA WAREHOUSE SAP BW IN THE PRODUCTION COMPANY. Maria Kowal, Galina Setlak

Decision Support and Business Intelligence Systems. Chapter 1: Decision Support Systems and Business Intelligence

A Review of Data Warehousing and Business Intelligence in different perspective

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Week 3 lecture slides

Data Warehouse & Business Intelligence Glossary

The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making

Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

Transcription:

Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

Enterprise Database Systems Learning Objectives Understand needs and concepts of decision support systems Understand concepts and issues of the Data Warehouse Understand concepts of On-Line Analytical Processing (OLAP) Understand concepts of data mining

Data Warehousing Overview The Need for Data Analysis Decision Support Systems The Data Warehouse On-Line Analytical Processing (OLAP) Data Mining

The Need for Data Analysis Executives compete at decision making Constant pressure from external and internal forces requires prompt tactical and strategic decisions. The decision-making cycle time is reduced, while problems are increasingly complex with a growing number of internal and external variables. Managers need support systems for facilitating quick decision making in a complex environment. Decision support systems (DSS).

Decision Support Systems (DSS) Decision Support is a methodology (or a series of methodologies) designed to extract information from data and to use such information as a basis for decision making. A decision support system (DSS) is an arrangement of computerized tools used to assist managerial decision making within a business. A DSS usually requires extensive data massaging to produce information. The DSS is used at all levels within an organization and is often tailored to focus on specific business area or problems. The DSS is interactive and provides ad hoc query tools to retrieve data and to display data in different formats.

Decision Support Systems Four Components of a DSS The data store component is a basically a DSS database. The data extraction and filtering component is used to extract and validate the data taken from the operational database and the external data sources. The end user query tool is used by the data analyst to create the queries that access the database. The end user presentation tool is used by the data analyst to organize and present the data.

Main Components of a DSS

Decision Support Systems Three Main Areas in Which DSS Data Differ from Operational Data Time span Operational data represent current (atomic) transactions. DSS data tend to cover a longer time frame. Granularity Operational data represent specific transactions that occur at a given time. DSS data must be presented at different levels of aggregation. Dimensionality Operational data focuses on representing atomic transactions. DSS data can be analyzed from multiple dimensions.

Decision Support Systems End-User Analytical Interface The DSS DBMS must support advanced data modeling and data presentation tools, data analysis tools, and query generation and optimization components. The end user analytical interface is one of the most critical components. Database Size Requirements The DBMS must be capable of supporting very large databases (VLDB).

The Data Warehouse The Data Warehouse is an integrated, subjectoriented, time-variant, non-volatile database that provides support for decision making. Integrated The Data Warehouse is a centralized, consolidated database that integrates data retrieved from the entire organization. Subject-Oriented The Data Warehouse data is arranged and optimized to provide answers to questions coming from diverse functional areas within a company.

The Data Warehouse Time Variant The Warehouse data represent the flow of data through time. It can even contain projected data. Non-Volatile Once data enter the Data Warehouse, they are never removed. The Data Warehouse is always growing.

Creating a Data Warehouse

The Data Mart Data Mart A Data Mart is a small, single-subject Data Warehouse subset that provides decision support to a small group of people. Data Marts can serve as a test vehicle for companies exploring the potential benefits of Data Warehouses. Data Marts address local or departmental problems, while a Data Warehouse involves a company-wide effort to support decision making at all levels in the organization.

The Data Warehouse Twelve Rules That Define a Data Warehouse 1. The Data Warehouse and operational environments are separated. 2. The Data Warehouse data are integrated. 3. The Data Warehouse contains historical data over a long time horizon. 4. The Data Warehouse data are snapshot data captured at a given point in time. 5. The Data Warehouse data are subject-oriented. 6. The Data Warehouse data are mainly read-only with periodic batch updates from operational data. No online updates are allowed. 7. The Data Warehouse development life cycle differs from classical systems development. The Data Warehouse development is data driven; the classical approach is process driven.

The Data Warehouse 8. The Data Warehouse contains data with several levels of detail; current detail data, old detail data, lightly summarized, and highly summarized data. 9. The Data Warehouse environment is characterized by read-only transactions to very large data sets. The operational environment is characterized by numerous update transactions to a few data entities at the time. 10. The Data Warehouse environment has a system that traces data resources, transformation, and storage. 11. The Data Warehouse s metadata are a critical component of this environment. The metadata identify and define all data elements. The metadata provide the source, transformation, integration, storage, usage, relationships, and history of each data element. 12. The Data Warehouse contains a charge-back mechanism for resource usage that enforces optimal use of the data by end users.

On-Line Analytical Processing On-Line Analytical Processing (OLAP) is an advanced data analysis environment that supports decision making, business modeling, and operations research activities. Four Main Characteristics of OLAP Use multidimensional data analysis techniques. Provide advanced database support. Provide easy-to-use end user interfaces. Support client/server architecture.

On-Line Analytical Processing Additional Functions of Multidimensional Data Analysis Techniques Advanced data presentation functions Advanced data aggregation, consolidation, and classification functions Customer segment analysis on ecommerce web sites Advanced computational functions Advanced data modeling functions

On-Line Analytical Processing (OLAP) OLAP Architecture Three Main Modules OLAP Graphical User Interface (GUI) OLAP Analytical Processing Logic OLAP Data Processing Logic OLAP systems are designed to use both operational and Data Warehouse data.

OLAP Client/Server Architecture

OLAP Server Arrangement

OLAP Server with Local Data Marts

Star Schema Dimensions Dimensions are qualifying characteristics that provide additional perspectives to a given fact. Dimensions are stored in dimension tables.

A Simple Star Schema

A Three-Dimensional View of Sales

Slice and Dice View of Sales

Star Schema Attribute Hierarchies Attributes within dimensions can be ordered in a well-defined attribute hierarchy. The attribute hierarchy provides a top-down data organization that is used for two main purposes: Aggregation. Drill-down/roll-up data analysis.

A Location Attribute Hierarchy

Attribute Hierarchies in Multidimensional Analysis

Star Schema Performance-Improving Techniques Normalization of dimensional tables Multiple fact tables representing different aggregation levels Denormalization of fact tables Table partitioning and replication

Data Warehouse Implementation The Data Warehouse as an Active Decision Support Network A Data Warehouse is a dynamic support framework. Implementation of Data Warehouse is part of a complete databased-system-development infrastructure for companywide decision support. Its design and implementation must be examined in the light of the entire infrastructure.

Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data for anomalies and possible relationships, thereby identifying problems that have not yet been identified by the end user. Data mining tools -- based on algorithms that form the building blocks for artificial intelligence, neural networks, inductive rules, and predicate logic -- initiate analyses to create knowledge.

Data Mining Four Phases of Data Mining 1. Data Preparation Identify and cleanse data sets. Data Warehouse is usually used for data mining operations. 2. Data Analysis and Classification Identify common data characteristics or patterns using Data groupings, classifications, clusters, or sequences. Data dependencies, links, or relationships. Data patterns, trends, and deviations.

Data Mining 3. Knowledge Acquisition Select the appropriate modeling or knowledge acquisition algorithms. Examples: neural networks, decision trees, rules induction, genetic algorithms, classification and regression tree, memory-based reasoning, or nearest neighbor and data visualization). 4. Prognosis Predict future behavior and forecast business outcomes using the data mining findings.

Data Mining