Using ORACLE tools to generate Multidimensional Model in Warehouse.



Similar documents
Distance Learning and Examining Systems

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

SQL Server 2012 Business Intelligence Boot Camp

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Implementing Data Models and Reports with Microsoft SQL Server

LEARNING SOLUTIONS website milner.com/learning phone

Microsoft Implementing Data Models and Reports with Microsoft SQL Server

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

Implementing Data Models and Reports with Microsoft SQL Server

Business Intelligence, Analytics & Reporting: Glossary of Terms

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

PREFACE INTRODUCTION MULTI-DIMENSIONAL MODEL. Chris Claterbos, Vlamis Software Solutions, Inc.

Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778

CHAPTER 5: BUSINESS ANALYTICS

Data Integration and ETL with Oracle Warehouse Builder NEW

CS2032 Data warehousing and Data Mining Unit II Page 1

CHAPTER 4: BUSINESS ANALYTICS

Oracle OLAP What's All This About?

8902 How to Generate Universes from SAP Sybase PowerDesigner. Revision:

IBM Cognos Training: Course Brochure. Simpson Associates: SERVICE associates.co.uk

A Critical Review of Data Warehouse

Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition

Data W a Ware r house house and and OLAP II Week 6 1

Oracle OLAP 11g and Oracle Essbase

DATA WAREHOUSING - OLAP

Course: SAS BI(business intelligence) and DI(Data integration)training - Training Duration: 30 + Days. Take Away:

When to consider OLAP?

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Turkish Journal of Engineering, Science and Technology

Building Cubes and Analyzing Data using Oracle OLAP 11g

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

Business Intelligence for SUPRA. WHITE PAPER Cincom In-depth Analysis and Review

SQL SERVER BUSINESS INTELLIGENCE (BI) - INTRODUCTION

Monitoring Genebanks using Datamarts based in an Open Source Tool

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

SAP BO Course Details

Data Integration and ETL with Oracle Warehouse Builder: Part 1

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi

OLAP Systems and Multidimensional Expressions I

Implementing a Data Warehouse with Microsoft SQL Server

Data Warehousing. Paper

SAS BI Course Content; Introduction to DWH / BI Concepts

Oracle Warehouse Builder 11gR2: Getting Started

Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Budgeting and Planning with Microsoft Excel and Oracle OLAP

Part 22. Data Warehousing

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Dimensional Modeling for Data Warehouse

Oracle Warehouse Builder 10g

Sage 200 Business Intelligence Datasheet

Implementing a Data Warehouse with Microsoft SQL Server

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

MS 50511A The Microsoft Business Intelligence 2010 Stack

A Design and implementation of a data warehouse for research administration universities

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

ReportPortal Web Reporting for Microsoft SQL Server Analysis Services

Creating BI solutions with BISM Tabular. Written By: Dan Clark

Deductive Data Warehouses and Aggregate (Derived) Tables

New Approach of Computing Data Cubes in Data Warehousing

Participant Guide RP301: Ad Hoc Business Intelligence Reporting

Analytics with Excel and ARQUERY for Oracle OLAP

End to End Microsoft BI with SQL 2008 R2 and SharePoint 2010

Designing Self-Service Business Intelligence and Big Data Solutions

IST722 Data Warehousing

SQL Server Administrator Introduction - 3 Days Objectives

Sage 200 Business Intelligence Datasheet

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited

Business Intelligence & Product Analytics

DATA CUBES E Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES

The difference between. BI and CPM. A white paper prepared by Prophix Software

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

College of Engineering, Technology, and Computer Science

SAP BO 4.1 COURSE CONTENT

SBI2013: Building BI Solutions using Excel and SharePoint 2013

Sterling Business Intelligence

Microsoft Consulting Services. PerformancePoint Services for Project Server 2010

The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

GEHC IT Solutions. Centricity Practice Solution. Centricity Analytics 3.0

SQL Server Analysis Services Complete Practical & Real-time Training

CHAPTER 6: ANALYZE MICROSOFT DYNAMICS NAV 5.0 DATA IN MICROSOFT EXCEL

Designing a Dimensional Model

SQL SERVER TRAINING CURRICULUM

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance

Course Outline. Module 1: Introduction to Data Warehousing

The Jet Reports Suite of Products

DATA WAREHOUSING AND OLAP TECHNOLOGY

COURSE SYLLABUS COURSE TITLE:

IAF Business Intelligence Solutions Make the Most of Your Business Intelligence. White Paper November 2002

BI4Dynamics provides rich business intelligence capabilities to companies of all sizes and industries. From the first day on you can analyse your

Transcription:

Sławomir WIAK, Paweł DRZYMAŁA, Henryk WELFLE Technical University of Lodz Using ORACLE tools to generate Multidimensional Model in Warehouse. Abstract. The paper presents modern techniques of data modelling and processing, collected by the company. It presents the process of multidimensional data modelling (include the transformation of logical structure to relational structure and next transformation to Multidimensional Model of warehouse in star or snowflake schema). It also shows the ETL process and methods of creating OLAP cubes by use of ORACLE tools to support decision making by business analysts. An approach based on data mining techniques allows analysts to capture certain features in customers, to offer dedicated products for the customer groups. Based on customer behaviour can be concluded about his tendencies to their certain behaviours and preferences. Streszczenie. W pracy zaprezentowano współczesne techniki modelowania i przetwarzania danych pozyskiwanych przez instytucje. Artykuł przedstawia proces modelowania wielowymiarowych danych oraz proces ETL do momentu ich prezentacji z użyciem narzędzi firmy ORACLE. Przedstawiono metody tworzenia kostek OLAP wspomagające proces podejmowania decyzji przez analityków biznesowych. Podejście oparte na technikach data mining pozwala analitykom uchwycić pewne cechy wspólne dla klientów, w celu złożenia oferty produktów dedykowanych dla konkretnych grup. Na podstawie zachowań klientów można wnioskować o ich skłonnościach do pewnych zachowań i preferencji. (Modelowanie wielowymiarowych danych do ich prezentacji z użyciem narzędzi firmy ORACLE) Keywords: Database in Management, Business Intelligence, Data Warehouse, OLAP. Słowa kluczowe: Bazy Danych w Zarządzaniu, Business Intelligence Hurtownie Danych, OLAP. Introduction Today's companies need tools to support sales, production, management. The high costs of carrying out a marketing campaign and its preparation time will need a tool to facilitate this task. The overall aim of such activities is to maximize profits at the lowest cost. There are many ways to reach customers. The development of information technology has enabled the analysis of data for company profile. Clients of institutions that use BI tools also received new offers dedicated to their expectations. New technologies based on an analysis of data will save the time needed to complete tasks. Companies gain a competitive advantage in the market by the more fit the profile of their activities to the customers' expectations. Capture trends in business such as the existing terms of the dynamics of growing sales of certain types of products it is now possible with the use of data analysis techniques. Each product has its own characteristics and cannot be addressed to all customers. This situation requires tools to support corporate control of production processes, sales and quality management. An approach based on data mining techniques and expertise allows analysts to capture certain features in common customers, to offer dedicated products for the customer groups. Based on customer behavior can be concluded about his tendencies to certain products. The fulfillment of conditions provides that there is some probability that a customer buys a product. Multidimensional data analysis techniques based on OLAP cubes and the construction of warehouse and data modeling are presented in this article using Oracle tools. Modeling the Business and multidimensional structures OLAP consists objects that are a part of dimensional model. The dimensional data model (include: dimensions, attributes, levels, hierarchies, measures and cubes) is highly structured and implies rules that govern the relationships among the data and control how the data can be queried. The fact table is referred to a cube, and the columns (in table) are referred to measures. The cube has edges, which are referred to dimensions. The fact table include measures that are linked to a dimensions. Each dimension is a grouping of related columns from one or more tables. Analysts know which business measures they are interested in examining. Dimensions and attributes make the data meaningful, and how the dimensions of their business are organized into levels and hierarchies. Fig. 1. Diagram of a Relational schema. Fig. 2. Multi-dimensional schema. PRZEGLĄD ELEKTROTECHNICZNY (Electrical Review), ISSN 0033-2097, R. 88 NR 1a/2012 257

Cubes are the physical implementation of the dimensional model and are highly optimized for dimensional queries. The OLAP engine that underlies cubes is based on an indexed multidimensional array model, which provides direct cell access. Diagram of a relational and multidimensional schema are presented on Fig1 and Fig.2. The OLAP engine manipulates dimensional objects and are optimized for analysis, analytic and row functions can be calculated much faster in OLAP than in SQL. Overview of the Multidimensional Data Model The dimensional objects include cubes, measures, dimensions, attributes, levels, and hierarchies (Fig.3). - defined the range of date stored in the time dimension as Fig. 5. - choose the levels in the hierarchy (Fig.6.) Cube Dimension 1 Dimension 2 Attributes Hierarchies Levels Attributes Hierarchies Levels Fig. 5. Parameters window for the dimension of time. Measure Dimension 3 Measure Measure Fig. 3. Diagram of the OLAP Dimensional Model Attributes Hierarchies Levels Dimensions: Dimensions contain a set of unique values that identify and categorize data, they form the edges of a cube. Baselevel dimension values correspond to the unique keys of a fact table. Fig. 6. Time hierarchy and levels window for the dimension of time. The time dimension, giving levels, hierarchies, attributes are presented in Fig. 7. Fig. 8. presents the mapping of the logical structure to the physical structure of time dimension, which is stored in a database. Fig. 4. OLAP multidimensional model Time Dimensions: Time dimension require additional information about the members in terms of each member s time span and enddate. With these additional attributes multidimensional model is able to provide additional features compared to other dimensions such as: - Prior Value, - Difference from Prior Period, - Percent Difference from Prior Period, - Future Value, - Moving Average, - Moving Maximum, - Moving Minimum, - Moving Total, - Year to Date. It is possible to automatically define the time dimension using OWB environment and fill it with data from a given range of years. To do this, perform the following steps using the Create Time Dimension Wizard: - provide a name and description (e.g. time_dim) - set the storage type (MOLAP multidimensional storage, ROLAP relational storage). If you have high volumes of data combined with high refresh rates or if you have detailed high volume data. Choose MOLAP to store aggregated data that is used for analysis. Fig. 7. Dimensional Time - Structure 258 PRZEGLĄD ELEKTROTECHNICZNY (Electrical Review), ISSN 0033-2097, R. 88 NR 1a/2012

Prior/Future Comparison Time Frame Percent Markup Percent Variance Rank Share Variance Prior Value Difference from Prior Period Percent Difference from Prior Period Future Value Moving Average Moving Maximum Moving Minimum Moving Total Year to Date Fig. 8. Dimensional Time - Mapping Hierarchies and Levels: A hierarchy (Fig.9.) is a way to organize data at different levels of aggregation. In viewing data, analysts use dimension hierarchies to recognize trends at one level, drill down to lower levels to identify reasons for these trends, and roll up to higher levels to see what affect these trends have on a larger sector of the business. Defining OLAP objects The following Oracle tools to define the analytical workspace (AW) are available: Oracle Analytic Workspace Manager (AWM), Oracle Warehouse Bulider (OWB), Oracle SQL Developer Data Modeler and other package. All objects can be created in a dedicated and most powerful tool, which is Oracle Warehouse Bulider (including the model of ROLAP and MOLAP warehouse in the structure of a star or snowflake). OWB can perform the Extraction Transformation and Loading (ETL) engine using Oracle Workflow in order to fill the data warehouse. It allows both logical and physical design of the warehouse. Region Subregion Country Customer Fig. 9. Levels in a Dimension Hierarchy Attributes. An attribute provides additional information about the data. Some attributes are used for display. You might also have attributes like package, sizes etc. Time attributes can provide information about the Time dimension that may be useful in some types of analysis, such as number of days in each time period. Each attribute corresponds to a column in dimension table. Cubes Cubes provide a means of organizing measures that have the same shape, they have the exact same dimensions. A cube corresponds to a single fact table. Measures Measures populate the cells of a cube with the facts collected about business operations. Measures are organized by dimensions, which typically include a Time dimension. An analytic database contains snapshots of historical data, derived from data in a transactional database, legacy system, syndicated sources, or other data sources. Three years of historical data is generally considered to be appropriate for analytic applications. Table 1. Calculated measures Group Name Calculation Name Basic Arithmetic Addition Subtraction Multiplication Division Ratio Advanced Arithmetic Cumulative total Index Fig. 10. Warehouse logical schema in the form of a snowflake. The modeling process by using Oracle SQL Developer Data Modeler Oracle SQL Developer Data Modeler allows modeling the logical structure of warehouse in star or snowflake schema. Entity tab allows to select (on object definition) the PRZEGLĄD ELEKTROTECHNICZNY (Electrical Review), ISSN 0033-2097, R. 88 NR 1a/2012 259

type classification as Dimension or Fact. The next step is to set the definition of the Entity Attributes and Data Type. Each object in the form of such Dimension or Cube should also be set to a Unique Identifiers. Cube object usually has not unique identifier key. It is also necessary to carry out the mapping process, which is performed by selecting one type of relation 1: N relation, relation 1:1, M: N relation. The logical model is transformed to the relational schema (Engineer to Relational Model). In the process, you can choose which objects will be involved in transformation (Fig. 11). Fig. 11. Engineer to Relational Model process. Fig. 13. Multidimensional Model. Fig. 12. Relational schema after transformation process of the logical model. The next step is to create Multidimensional Model (MD) from the Relational Model (RD) in which is defined Physical Models based on the: attributes, levels and hierarchies. Such defined structure, for example, can be exported to Analytical Workspace (Oracle AW) and also be viewed in the AWM editor or can be modified as objects of OLAP. Designing ETL When source and target objects is defined in Oracle SQL Developer Data Modeler then the process of data movement and transformation logic should be started (ETL). ETL stands for Extract, Transform, and Load. ETL is designed in Warehouse Builder (OWB) and is used to create mappings to extract data from sources, transform it, and load it into targets. ETL process flows logic that interrelate mappings sources with targets and other activities can be designed using the OWB Process Flow Editor. To build ETL process logic the Workflow Server should be started. The Mapping Editor (Fig.15.) interface in Warehouse Builder is used to design a mapping. You select operators from the Mapping Editor Palette panel and place them on the canvas. Fig. 14. Multidimensional Model. Fig. 15. The process of mapping sources with targets objects. When mappings is done (that define the operations for moving data from sources to targets) the process flows can be created and defined. Process flows interrelate mappings and activities external to Warehouse Builder. In OWB, a 260 PRZEGLĄD ELEKTROTECHNICZNY (Electrical Review), ISSN 0033-2097, R. 88 NR 1a/2012

process flow is used to manage the execution of mappings and related activities. The activities editor of process flow includes a variety of activities that can be added and then connected with transitions to designing of a flow. Transitions indicate the sequence and conditions in which to launch activities, an activity represents units of work in a process flow and these units can involve components internal or external to Warehouse Builder. The AWM application can present data in tabular or graphically form. Such applications are used by business analysts to make decisions about functioning of the company and its development strategy. The analysts can manipulate cube objects with use of drag and drop methods. They may also limit the scope of the presented data using filters that limit data on individual dimensions, hierarchies and levels. The OWB has a similar methods of data presentation. Another way is to build own application with an interface to the data warehouse structure, e.g. using JDBC OCI8. Fig. 19. presents a method to build reports based on the Excel worksheet in which the imported data retrieved from the materialized views created in the process of maintenance OLAP cubes. All necessary data is stored in a xls file. Reports can be created in a dynamic manner depending on the needs and presented graphically. The xls file consists all imported data from Cube. Fig. 17. Sample data from OLAP cubes (AWM or OWB application) Fig. 16. Design a Process Flow in OWB using ORACLE Workflow Server In the window Measure Data Viewer, data can be presented in the form of many types of charts (eg bar, line, pie). Fig. 18. shows the possibility to present data in a pie chart. Fig. 18. Sample data from OLAP cubes (pie chart) PRZEGLĄD ELEKTROTECHNICZNY (Electrical Review), ISSN 0033-2097, R. 88 NR 1a/2012 261

Fig. 19. Sample data from OLAP cubes in Microsoft Excel. Summary The paper presents modern techniques of data modeling and processing, collected by the company. It presents the process of multidimensional data modeling (include the transformation of logical structure to relational structure and next transformation to Multidimensional Model of warehouse in star or snowflake schema). It also shows the ETL process and methods of creating OLAP cubes by use of ORACLE tools to support decision making by business analysts. An approach based on data mining techniques allows analysts to capture certain features in customers, to offer dedicated products for the customer groups. Based on customer behavior can be concluded about his tendencies to their certain behaviors and preferences. REFERENCES [1] Oracle OLAP - User's Guide 11g. Release 1 (11.1). B28124-03 [2] Oracle Database - Data Warehousing Guide 11g. Release 1 (11.1). B28313-02 [3] Oracle Warehouse Builder - User's Guide 11g. Release 1 (11.1). B31278-06 [4] Oracle SQL Developer - User s Guide Release 3.0. E17472-05 [5] Oracle SQL Developer - Data Modeler User s Guide. Release 3.0. E15802-03 [6] Chuck Ballard, Dirk Herreman, Don Schau, Rhonda Bell, Eunsaeng Kim, Ann Valencic. Data Modeling Techniques for Data Warehousing. IBM - International Technical Support Organization - 1998. http://www.redbooks.ibm.com [7] Smarter Planet IBM: http://www.ibm.com/smarterplanet/us/en/overview/ideas/ [8] Mendrala D., Szeliga M. Serwer SQL 2008. Usługi biznesowe. Analiza i eksploracja danych, Helion 2009, Gliwice. [9] Chuck Ballard, Daniel M. Farrell, Amit Gupta, Carlos Mazuela, Stanislav Vohnik. Dimensional Modeling: In a Business Intelligence Environment. IBM - International Technical Support Organization - 2006. http://www.redbooks.ibm.com Authors: prof. dr hab. inż. Sławomir Wiak, Technical University of Lodz, Institute of Mechatronics and Information Systems, ul. Stefanowskiego 18/22, 90-924 Łódź, E-mail: slawomir.wiak@p.lodz.pl; dr inż. Paweł Drzymała, Technical University of Lodz, Institute of Mechatronics and Information Systems, Stefanowskiego 18/22, 90-924 Łódź, E-mail: pawel.drzymala@p.lodz.pl; dr inż. Henryk Welfle, Technical University of Lodz, Institute of Mechatronics and Information Systems, Stefanowskiego 18/22, 90-924 Łódź, E-mail: henryk.welfle@p.lodz.pl. 262 PRZEGLĄD ELEKTROTECHNICZNY (Electrical Review), ISSN 0033-2097, R. 88 NR 1a/2012