OLAP. Business Intelligence OLAP definition & application Multidimensional data representation



Similar documents
OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

The Benefits of Data Modeling in Business Intelligence

DATA WAREHOUSING - OLAP

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers

How To Model Data For Business Intelligence (Bi)

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Data W a Ware r house house and and OLAP II Week 6 1

CS2032 Data warehousing and Data Mining Unit II Page 1

DATA WAREHOUSING AND OLAP TECHNOLOGY

BUSINESS ANALYTICS AND DATA VISUALIZATION. ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

14. Data Warehousing & Data Mining

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing

Analytics with Excel and ARQUERY for Oracle OLAP

Fluency With Information Technology CSE100/IMT100

Data Warehouses & OLAP

FEATURES TO CONSIDER IN A DATA WAREHOUSING SYSTEM

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes

IAF Business Intelligence Solutions Make the Most of Your Business Intelligence. White Paper November 2002

Basics of Dimensional Modeling

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

DATA WAREHOUSE E KNOWLEDGE DISCOVERY

Why Business Intelligence

Week 3 lecture slides

OLAP Theory-English version

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Unit -3. Learning Objective. Demand for Online analytical processing Major features and functions OLAP models and implementation considerations

A Brief Tutorial on Database Queries, Data Mining, and OLAP

Implementing Data Models and Reports with Microsoft SQL Server

Business Intelligence & Product Analytics

Business Intelligence

OLAP and Data Warehousing! Introduction!

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Microsoft Data Warehouse in Depth

VisionWaves : Delivering next generation BI by combining BI and PM in an Intelligent Performance Management Framework

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

8. Business Intelligence Reference Architectures and Patterns

COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

Data Warehousing & OLAP

BUILDING OLAP TOOLS OVER LARGE DATABASES

OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

B.Sc (Computer Science) Database Management Systems UNIT-V

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

Data Warehouse: Introduction

MS 50511A The Microsoft Business Intelligence 2010 Stack

Business Intelligence, Analytics & Reporting: Glossary of Terms

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Chapter 23, Part A

Data Warehousing and OLAP Technology for Knowledge Discovery

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option

Building Cubes and Analyzing Data using Oracle OLAP 11g

Part 22. Data Warehousing

CHAPTER 4 Data Warehouse Architecture

Designing a Dimensional Model

Essbase Calculations: A Visual Approach

Decision Support and Business Intelligence Systems. Chapter 1: Decision Support Systems and Business Intelligence

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

New Approach of Computing Data Cubes in Data Warehousing

Justice Data Warehousing and Court Business Intelligence. Technical Introduction. Harris County Courts

OLAP Systems and Multidimensional Expressions I

SQL Server 2012 End-to-End Business Intelligence Workshop

End to End Microsoft BI with SQL 2008 R2 and SharePoint 2010

UNIT-3 OLAP in Data Warehouse

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

Module 1: Introduction to Data Warehousing and OLAP

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

University of Gaziantep, Department of Business Administration

Multi-dimensional index structures Part I: motivation

With business intelligence, we create a learning organization that adapts quickly to market changes and stays one step ahead of the competition.

The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led

Adobe Insight, powered by Omniture

IST722 Data Warehousing

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

Web Log Data Sparsity Analysis and Performance Evaluation for OLAP

A Technical Review on On-Line Analytical Processing (OLAP)

Overview of Data Warehousing and OLAP

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Data Warehousing and OLAP

A model for Business Intelligence Systems Development

Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

CASE PROJECTS IN DATA WAREHOUSING AND DATA MINING

Data Warehouse design

LEARNING SOLUTIONS website milner.com/learning phone

Advanced Data Management Technologies

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Transcription:

OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1

Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for more powerful access tools that provide advanced analytical capabilities. There are two main types of access tools available to meet this demand, namely Online Analytical Processing (OLAP) and data mining. 2

Business Intelligence OLAP and Data Mining differ in what they offer the user and because of this they are complementary technologies. An environment that includes a data warehouse (or more commonly one or more data marts) together with tools such as OLAP and /or data mining are collectively referred to as Business Intelligence (BI) technologies. BI = DW + OLAP BI = DW + DM 3

OLAP definition & application The dynamic synthesis, analysis, and consolidation of large volumes of multidimensional data, Codd (1993). Describes a technology that uses a multi-dimensional view of aggregate data to provide quick access to strategic information for the purposes of advanced analysis. 4

OLAP definition & application Can easily answer who? and what? questions, however, ability to answer what if? and why? type questions distinguishes OLAP from generalpurpose query tools. Types of analysis ranges from basic navigation and browsing (slicing and dicing) to calculations, to more complex analyses such as time series and complex modeling. 5

OLAP definition & application Examples of OLAP applications 6

OLAP definition & application Although OLAP applications are found in widely divergent functional areas, they all have the following key features: multi-dimensional views of data support for complex calculations time intelligence 7

OLAP - multi-dimensional views of data Core requirement of building a realistic business model. Provides basis for analytical processing through flexible access to corporate data. The underlying database design that provides the multi-dimensional view of data should treat all dimensions equally. 8

OLAP - support for complex calculations Must provide a range of powerful computational methods such as that required by sales forecasting, which uses trend algorithms such as moving averages and percentage growth. Mechanisms for implementing computational methods should be clear and non-procedural. 9

OLAP time intelligence Key feature of almost any analytical application as performance is almost always judged over time. Time hierarchy is not always used in the same manner as other hierarchies. Concepts such as year-to-date and period-over-period comparisons should be easily defined. 10

Representation of Multidimensional Data Example of two-dimensional query. What is the total revenue generated by property sales in each city, in each quarter of 2004? Choice of representation is based on types of queries end-user may ask. Compare representation - three-field relational table versus twodimensional matrix. 11

Multi-dimensional Data as Three-field table versus Twodimensional Matrix Pearson Education Limited 1995, 2005 12

Representation of Multidimensional Data Example of three-dimensional query. What is the total revenue generated by property sales for each type of property (Flat or House) in each city, in each quarter of 2004? Compare representation - fourfield relational table versus three-dimensional cube. 13

Multi-dimensional Data as Four-field Table versus Threedimensional Cube Pearson Education Limited 1995, 2005 14

Representation of Multidimensional Data A sample data cube 15

Representation of Multidimensional Data Cube represents data as cells in an array. Relational table only represents multidimensional data in two dimensions. Multi-dimensional structures are best visualized as cubes of data, and cubes within cubes of data. Each side of a cube is a dimension. A cube can be expanded to include other dimensions. 16

Representation of Multidimensional Data As number of dimensions increases, number of the cube s cells increases exponentially. However, majority of multidimensional queries use summarized, high-level data. Solution is to pre-aggregate (consolidate) all logical subtotals and totals along all dimensions. 17

Representation of Multidimensional Data Pre-aggregation is valuable, as typical dimensions are hierarchical in nature. (e.g. Time dimension hierarchy - years, quarters, months, weeks, and days) Predefined hierarchy allows logical preaggregation and, conversely, allows for a logical drill-down. 18

Representation of Multidimensional Data Supports common analytical operations Consolidation Drill-down Slicing and dicing 19

Representation of Multidimensional Data Pivoting select two dimensions used to aggregate a measure e.g., the Sales for a Product in a particular City Product City Date Sales Juice NY 10 NY CA MN Cola CA 20 Juice 40 x x Juice NY 30 Cola 10 20 x Cola NY 10 Cream x x 10 Cream MN 10 20

Representation of Multidimensional Data Rollup taking a current data and group-by further on another dimension e.g., rollup the total sales on city by product Drill-down 21

22

23

Representation of Multidimensional Data Slice_and_dice reducing the dimensionality of data the table (city, date) of a specific product 24

25

26

Representation of Multidimensional Data Consolidation - aggregation of data such as simple roll-ups or complex expressions involving inter-related data. Drill-Down - is the reverse of consolidation and involves displaying the detailed data that comprises the consolidated data. 27

Representation of Multidimensional Data Slicing and Dicing - (also called pivoting) refers to the ability to look at the data from different viewpoints. 28

Representation of Multidimensional Data Can store data in a compressed form by dynamically selecting physical storage organizations and compression techniques that maximize space utilization. Dense data (that is, data that exists for a high percentage of cells) can be stored separately from sparse data (that is, a significant percentage of cells are empty). 29

Representation of Multidimensional Data Ability to omit empty or repetitive cells can greatly reduce the size of the cube and the amount of processing. Allows analysis of exceptionally large amounts of data. 30

Representation of Multidimensional Data In summary, pre-aggregation, dimensional hierarchy, and sparse data management can significantly reduce the size of the cube and the need to calculate values onthe-fly. Removes need for multi-table joins and provides quick and direct access to arrays of data, thus significantly speeding up execution of multi-dimensional queries. 31