Online Analytic Processing OLAP

Similar documents
OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

DATA WAREHOUSING - OLAP

Introduction to Data Warehousing. Ms Swapnil Shrivastava

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Overview of Data Warehousing and OLAP

Part 22. Data Warehousing

When to consider OLAP?

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers

Week 13: Data Warehousing. Warehousing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi

Monitoring Genebanks using Datamarts based in an Open Source Tool

IST722 Data Warehousing

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

Week 3 lecture slides

CHAPTER 4 Data Warehouse Architecture

OLAP OLAP. Data Warehouse. OLAP Data Model: the Data Cube S e s s io n

Data W a Ware r house house and and OLAP II Week 6 1

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?

BUSINESS ANALYTICS AND DATA VISUALIZATION. ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ

Data Warehousing & OLAP

Data Warehousing and OLAP

Turkish Journal of Engineering, Science and Technology

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing

CS2032 Data warehousing and Data Mining Unit II Page 1

DATA WAREHOUSING AND OLAP TECHNOLOGY

Data Warehousing and OLAP Technology for Knowledge Discovery

Business Intelligence & Product Analytics

(Week 10) A04. Information System for CRM. Electronic Commerce Marketing

OLAP Theory-English version

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

Multi-dimensional index structures Part I: motivation

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

14. Data Warehousing & Data Mining

Module 1: Introduction to Data Warehousing and OLAP

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing

Data Warehouse: Introduction

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

A Technical Review on On-Line Analytical Processing (OLAP)

Lecture Data Warehouse Systems

UNIT-3 OLAP in Data Warehouse

Building Data Cubes and Mining Them. Jelena Jovanovic

OLAP Systems and Multidimensional Expressions I

New Approach of Computing Data Cubes in Data Warehousing

Knowledge Discovery in Databases. Databases. date name surname street city account no. payment balance

Mario Guarracino. Data warehousing

On-Line Application Processing. Warehousing Data Cubes Data Mining

II. OLAP(ONLINE ANALYTICAL PROCESSING)

Unit -3. Learning Objective. Demand for Online analytical processing Major features and functions OLAP models and implementation considerations

Designing a Dimensional Model

Data Warehousing. Paper

1960s 1970s 1980s 1990s. Slow access to

This tutorial will help computer science graduates to understand the basic-toadvanced concepts related to data warehousing.

Optimizing Your Data Warehouse Design for Superior Performance

ROLAP with Column Store Index Deep Dive. Alexei Khalyako SQL CAT Program Manager

Data Warehousing, OLAP, and Data Mining

Lecture 2: Introduction to Business Intelligence. Introduction to Business Intelligence

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

DATA CUBES E Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES

Data Warehouse design

Namrata 1, Dr. Saket Bihari Singh 2 Research scholar (PhD), Professor Computer Science, Magadh University, Gaya, Bihar

Presented by: Jose Chinchilla, MCITP

University of Gaziantep, Department of Business Administration

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM

Data Warehousing Systems: Foundations and Architectures

Apache Kylin Introduction Dec 8,

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

CHAPTER 5: BUSINESS ANALYTICS

Hybrid OLAP, An Introduction

The strategic importance of OLAP and multidimensional analysis A COGNOS WHITE PAPER

A web-based contact center performance analytics application that gathers information from across your customer interaction technologies and provides

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

Data Warehousing OLAP

Ramesh Bhashyam Teradata Fellow Teradata Corporation

Overview. Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Data Warehousing. An Example: The Store (e.g.

OLAP and Data Warehousing! Introduction!

A very short talk about Apache Kylin Business Intelligence meets Big Data. Fabian Wilckens EMEA Solutions Architect

OLAP Systems and Multidimensional Queries II

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Oracle OLAP What's All This About?

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

Why Business Intelligence

Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making

Introducing Oracle Exalytics In-Memory Machine

Data Warehousing & OLAP

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Integrate multiple, heterogeneous data sources. Data cleaning and data integration techniques are applied

CHAPTER 4: BUSINESS ANALYTICS

A Design and implementation of a data warehouse for research administration universities

70-467: Designing Business Intelligence Solutions with Microsoft SQL Server

Business Intelligence, Analytics & Reporting: Glossary of Terms

Search and Data Mining Techniques. OLAP Anna Yarygina Boris Novikov

Decision Support. Chapter 23. Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1

Transcription:

OLAP QUERIES 1

Online Analytic Processing OLAP 2

OLAP OLAP: Online Analytic Processing OLAP queries are complex queries that Touch large amounts of data Discover patterns and trends in the data Typically expensive queries that take long time Also called decision-support queries In contrast to OLAP: OLTP: Online Transaction Processing Select salary From Emp Where ID = 100; OLTP queries are simple queries, e.g., over banking or airline systems OLTP queries touch small amount of data for fast transactions 3

OLTP vs. OLAP On-Line Transaction Processing (OLTP): technology used to perform updates on operational or transactional systems (e.g., point of sale systems) On-Line Analytical Processing (OLAP): technology used to perform complex analysis of the data in a data warehouse OLAP is a category of software technology that enables analysts, managers, and executives to gain insight into data through fast, consistent, interactive access to a wide variety of possible views of information that has been transformed from raw data to reflect the dimensionality of the enterprise as understood by the user. [source: OLAP Council: www.olapcouncil.org] 4

OLAP AND DATA WAREHOUSE OLAP Server OLAP Internal Sources Reports Operational DBs Data Integration Component Data Warehouse Query and Analysis Component Data Mining External Sources Meta data Client Tools 5

OLAP AND DATA WAREHOUSE Typically, OLAP queries are executed over a separate copy of the working data Over data warehouse Data warehouse is periodically updated, e.g., overnight OLAP queries tolerate such out-of-date gaps Why run OLAP queries over data warehouse?? Warehouse collects and combines data from multiple sources Warehouse may organize the data in certain formats to support OLAP queries OLAP queries are complex and touch large amounts of data They may lock the database for long periods of time Negatively affects all other OLTP transactions 6

OLAP ARCHITECTURE 7

EXAMPLE OLAP APPLICATIONS Market Analysis Find which items are frequently sold over the summer but not over winter? Credit Card Companies Given a new applicant, does (s)he a credit-worthy? Need to check other similar applicants (age, gender, income, etc ) and observe how they perform, then do prediction for new applicant OLAP queries are also called decisionsupport queries 8

MULTI-DIMENSIONAL VIEW Location Data is typically viewed as points in multi-dimensional space bread Items NY MA CA 10 Raw data cubes (raw level without aggregation) Orange juice Milk 2%fat Milk 1%fat 47 30 12 3/1 3/2 3/3 3/4 Time Typical OLAP applications have many dimensions 9

"#'&#* ANOTHER EXAMPLE!"#!$$%&#'() 10

APPROACHES FOR OLAP Relational OLAP (ROLAP) Multi-dimensional OLAP (MOLAP) Hybrid OLAP (HOLAP) = ROLAP + MOLAP 11

RELATIONAL OLAP: ROLAP Data are stored in relational model (tables) Special schema called Star Schema One relation is the fact table, all the others are dimension tables Large table Product Model Type Color Channel Facts Product Region Time Channel Revenue Expenses Units Region Nation District Dealer Time Week Year Small tables 12

CUBE vs. STAR SCHEMA bread Items Dimension tables describe the dimensions NY MA CA 10 Location Product Model Type Color Channel Facts Product Region Time Channel Revenue Expenses Units Region Nation District Dealer Time Week Year Orange juice Milk 2%fat Milk 1%fat 47 30 12 Data inside the cube are the fact records 3/1 3/2 3/3 3/4 Time 13

ROLAP: EXTENSIONS TO DBMS Schema design Specialized scan, indexing and join techniques Handling of aggregate views (querying and materialization) Supporting query language extensions beyond SQL Complex query processing and optimization Data partitioning and parallelism 14

SLICING & DICING Dicing how each dimension in the cube is divided Different granularities When building the data cube Slicing Selecting slices of the data cube to answer the OLAP query When answering a query bread Orange juice Items Milk 2%fat Milk 1%fat NY MA CA 10 47 30 12 Dicing Location by state Location 3/1 3/2 3/3 3/4 Time Dicing Time by day 15

SLICING & DICING: EXAMPLE 1 Dicing Slicing Slicing operation in ROLAP is basically: -- Selection conditions on some attributes (WHERE clause) + -- Group by and aggregation 16

SLICING & DICING: EXAMPLE 2 17

SLICING & DICING: EXAMPLE 3 18

DRILL-DOWN & ROLL-UP Drill-down (Group by Nation) Region Sales variance Africa 105% Asia 57% Europe 122% North America 97% Pacific 85% South America 163% Roll-up (group by Region) Nation Sales variance China 123% Japan 52% India 87% Singapore 95% 19

ROLAP: DRILL-DOWN & ROLL-UP Drill-down Roll-up 20

MOLAP Unlike ROLAP, in MOLAP data are stored in special structures called Data Cubes (Array-bases storage) Data cubes pre-compute and aggregate the data Possibly several data cubes with different granularities Data cubes are aggregated materialized views over the data As long as the data does not change frequently, the overhead of data cubes is manageable Sales 1996 1997 Red blob Blue blob Every day, every item, every city Every week, every item category, every city 21

MOLAP: CUBE OPERATOR Aggregation over the Z axis Aggregation over the X,Y Aggregation over the Y axis Raw-data (fact table) Aggregation over the X axis 22

MOLAP & ROLAP Commercial offerings of both types are available In general, MOLAP is good for smaller warehouses and is optimized for canned queries In general, ROLAP is more flexible and leverages relational technology ROLAP May pay a performance penalty to realize flexibility 23

OLTP vs. OLAP OLTP OLAP User Function DB Design Data View Usage Unit of work Access Operations # Records accessed #Users Db size Metric Clerk, IT Professional Day to day operations Application-oriented (E-R based) Current, Isolated Detailed, Flat relational Structured, Repetitive Short, Simple transaction Read/write Index/hash on prim. Key Tens Thousands 100 MB-GB Trans. throughput Knowledge worker Decision support Subject-oriented (Star, snowflake) Historical, Consolidated Summarized, Multidimensional Ad hoc Complex query Read Mostly Lots of Scans Millions Hundreds 100GB-TB Query throughput, response Source: Datta, GT 24

OLAP: SUMMARY OLAP stands for Online Analytic Processing and used in decision support systems Usually runs on data warehouse In contrast to OLTP, OLAP queries are complex, touch large amounts of data, try to discover patterns or trends in the data OLAP Models Relational (ROLAP): uses relational star schema Multidimensional (MOLAP): uses data cubes 25