Data Warehouse Snowflake Design and Performance Considerations in Business Analytics
|
|
|
- Daniel Gervase Roberts
- 9 years ago
- Views:
Transcription
1 Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker School of Business and Technology, Webster University, St. Louis, Missouri, USA {wang, Abstract Snowflake is a data warehouse schema design where dimension tables are normalized on top of a star schema design. Snowflake schema is generally not recommended due to its performance overhead in joining the normalized dimension tables. However, the Snowflake schema can be extended in a way to improve performance for business analysis activities. In business analytics paradigm, two distinct environments are complementary and work together to provide effective business analytics. Firstly, the data warehouse environment transforms operational data into information. Secondly, the analytical environment delivers information to end users for further data analysis and decision making. The snowflake schema bridges the gap between the two environments. Snowflake schema facilitates the mapping of wide dimension structures with many dimension attributes to analytical processing hierarchies. The snowflake schema makes navigation along hierarchies easier and supports flexible analysis such as drilldown and rollup. This paper examines the two complementary business intelligence environments, roles played by the snowflake design in mapping from data warehouse to analytics, and performance considerations in snowflake design with case studies. Index Terms data warehouse, snowflake design, business intelligence, business analytics I. INTRODUCTION Business intelligence (BI) is a paradigm where enterprises integrate their operational data to make decisions based on data analytics. The goal of business intelligence is to present information to the end user in a way that supports decision making [1]. Decisions based on data can greatly enhance enterprise knowledge management and customer relationship management. Analytics and business intelligence capabilities have become competitive differentiators for many enterprises [2] and [3]. Before data can be used effectively for BI query processing and presentation, operational data must be transformed into decision support data. To prepare for the data warehouse environment transaction data must be extracted and integrated from multiple sources. Further, the operational data must be transformed into information Manuscript received on March 2, 20 15; revised on August 25, J. Adv. Inf. Technol. doi: /jait so that necessary business understanding can be achieved and knowledge can be extracted from the data [4]. Data warehouses are fundamental to the business intelligence environment, encompass data from the entire enterprise, and focus on enterprise-wide business processes. The need for more intensive and complex analytics is the motivation for another environment to support online analytical processing (OLAP). OLAP makes the contents of the data warehousing environment available to users in an optimized structure, including strategic information, for decision making. OLAP supports multidimensional data analysis that focuses on analytical process and the relationship among business subject areas. OLAP environment maps data elements from the data warehouse to its own data structure in order to deliver information in an advanced, yet easy to use, interface. The design of BI systems, that are important in facilitating the decision-making process, differs significantly from the customary operational or transaction database. Transaction databases are designed for running day-to-day business activities and supporting the operational needs of staff; therefore data manipulation is crucial to the design of transaction systems. In transaction systems, data are stored in detail and require users to execute queries to summarize or aggregate data for reporting. Accuracy is essential in order to manage data at the granularity of a logical business transaction and is achieved in part by normalizing the data storage mechanism often referred to as tables. Normalization is a design process for transaction databases designed to minimize data redundancy and reduce data anomalies during data manipulation. Data anomalies occur when inserting, updating, or deleting data and may introduce unintended errors in the database. Such anomalies are a barrier to the data integrity (accuracy) required to serve employees on the front line of interaction with customers. Essentially, normalization is an abstract data design approach that increases data integrity in the database. Operational database design reflects data integrity concerns and tends to be highly normalized to maintain efficient daily transaction support. Having summarized traits of operational databases and the context for BI and data warehousing, the next section will examine the two database environments used in BI.
2 The third section looks at the star schema design used to build many data warehouses. The fourth section depicts the snowflake schema as an alternative and its role in BI environments. The fifth section examines the benefits in terms of performance as well as the drawbacks of the snowflake design. The last section summarizes the findings. II. DISTINCTIVE ENVIRONMENTS IN BUSINESS INTELLIGENCE Business intelligence is a comprehensive and integrated environment to capture, integrate, and analyze data with the purpose of generating and presenting information to create intelligence about a business. To support business decision making, business intelligence in general encompasses two distinctive environments, namely data warehousing and analytical. Both environments are needed to provide a comprehensive solution in tools and techniques. Data warehousing environment transforms data to information and analytical environment transforms information to knowledge. Data warehouse provides an integrated, subject-oriented, non-volatile, and time variant environment to collect and store data. To put data into data warehouse, data from operational databases are integrated, cleansed and transformed into usable information that is ready to support complex data representation. To provide information for decision making, data warehouse has to be designed in subject oriented schema structure, where data is collected and organized by business areas to better answer business questions. Therefore, data warehouse is generally implement under star schema that consists of a fact table representing business measures and surrounding dimension tables representing business subject areas. With tremendous complexities and performance demands in business intelligence, it is almost impossible for top management to retrieve data warehousing data and answer business questions without a sophisticated analytical environment. The analytical environment provides users advanced tools to access information content and transform information into knowledge. It concentrates on transforming information from data warehouse into knowledge for decision makers to reach timely and accurate strategic decision in their business. As part of analytical environment, OLAP plays key roles in managing data, transforming data, analyzing data, and presenting data to end user [5]. The two environments work together complementarily in business intelligence to empower users to make sound business decisions based on the accumulated knowledge of the business as reflected on historic operational data, as depicted in Fig. 1. Each of the two environments has distinct characteristics and plays roles in the process of business intelligence. The two environments have to work together coherently. Data warehouse design has to take into consideration how the data in data warehouse will be processed and analyzed by end users. In turn, OLAP hierarchies for drilling down and rolling up have to map to underlying data warehouse dimension hierarchies at various granularity levels. Therefore, data warehouse design directly affects how the data will be used in the analytical environment. III. Figure 1. Data, information, and knowledge. STAR SCHEMA IN DATA WAREHOUSE DESIGN The star schema is used in data warehouse design because the existing normalized operational database does not yield a structure that serves advance data analysis requirements well. The star schema is a dimensional modeling technique for data warehouse where critical business measures, such as sales and revenue, are captured and examined from perspectives of multiple dimensions. Star schema focuses on dimensions and facts, mapping business subject areas with business measures. Star schema benefits include simple implementation and an intuitive process for user. Dimensions provide views by subject areas, such as customer, region, and product. A dimensional table is generally flat and wide, consisting of many attributes that describe the dimension. Since table joins are computationally expensive, it is generally recommended to denormalize dimension tables to reduce possible joins between tables so performance of queries can be satisfied. Denormalized dimension tables connect to the fact table by foreign keys, which is intuitive to business users simplifying business measures analysis. For example, as depicted in the Fig. 2, the business measures are units, sales, and cost, which can be analyzed by aggregating in subject areas such as sales channel, customer, and product. One of the features of star schema is its wide dimension table. A dimension is wide and flat with many textual descriptive attributes so measures can be fully described during analysis. For instance, the TIME dimension, in addition to the key for months, MONTH_ID, may contain many other attributes. If the table is laid out with columns and rows, the table is extended horizontally. Another feature of a wide dimension in star schema design is its multiple hierarchies. Dimension tables often consist of multiple hierarchies so that analysis along any hierarchy can be performed. In the example of above TIME dimension, there exist hierarchies such as calendar year and fiscal year. Analysis can be performed by drilling down along the calendar year hierarchy from all years, to calendar year, calendar quarter, and month. Similarly, it can also be performed along the fiscal year hierarchy from all year, to fiscal year, fiscal quarter, and month J. Adv. Inf. Technol. 213
3 With their flat and wide structure, dimension tables in star schema are not normalized. For optimized query performance, this denormalized design allows attributes in dimensions to participate in queries by relating directly to fact table without incurring any extra overhead of joining normalized tables. However, for many database professionals, this denormalized nature may cause problems in not only wasted storage space but also table updating and maintenance. For example, dimensional tables will be subject to changes under situations of slowly changing dimensions (SCD). If the attribute for the update in the dimension table is duplicated across multiple records, update anomalies may occur. year hierarchy, since analysis will be performed heavily along these two paths. Figure 3. Snowflake schema. IV. Figure 2. Sample star schema. SNOWFLAKE SCHEMA DESIGN Snowflake schema is a variation of star schema in data warehouse design. Typical snowflake schema can be achieved by normalizing a dimensional table to reach semantic simplicity. In snowflake schema, each hierarchical level is stored in a separate dimension table. Since levels of dimension hierarchy relate closely to path of analysis, snowflake design facilitates data filtering operations along the dimension hierarchies and simplifies user navigation through dimension tables. With original star schema fact table in the center, if all hierarchies in all dimension tables are normalized, the design resembles the intricate arrangements of a snowflake. There typically exist many potential hierarchies in a wide dimension for a given business subject area. However, not all hierarchies need be normalized in order to maintain a simple and easy to understand design. To better support decision making and map objects in the OLAP environment from data warehousing environment, the snowflake schema needs be designed keeping in mind the hierarchies that will participate in analytical processes. If a potential hierarchy will not be required when mapping to OLAP layer, it should be ignored in the normalization process to minimize design complexity and performance overhead. This approach balances between a pure star schema and a pure snowflake schema and produces an optimal design. A sample snowflake design is shown in Fig. 3, where TIME dimension is normalized along two hierarchies, calendar year hierarchy and fiscal OLAP hierarchies are mapped to data warehouse design. Fig. 4 lists typical mappings between OLAP objects to data warehouse objects. The OLAP cube maps to data warehouse fact table or a view that is based on a fact table. Measures from fact table directly map to cube measures. OLAP dimensions and hierarchies map to subdimensions in data warehouse. For instance, on the data warehouse side, there is only one flattened out dimension table TIME that has embedded hierarchies, such as calendar year and fiscal year. On the OLAP side, the mapping based on the snowflake schema will generate multiple hierarchical views. This is beneficial to the mapping process and easy in maintenance. Obviously, with the snowflake design where all the sub-dimensions are in their own normalized tables, the mapping process will be more simplified compared to the mapping performed in the original star schema. Figure 4. OLAP to data warehouse mappings. Snowflake schema bridges data warehousing environment to analytical environment by mapping hierarchies and multi-dimensions easily. Snowflake schema reflects how users view the data in their organizations. The normalized multiple dimension tables represent levels in the dimensional hierarchy. It is intuitive to understand since it matches business subject areas and the relationship among them. With snowflake design, existing alignment between data processes such as drilldown and rollup, and dimension hierarchies makes transformation from data to information and from information to knowledge easier, and consequently enhances business decision making. V. PERFORMANCE CONSIDERATIONS If data warehouses were used directly for data analysis under either snowflake or star schema design, aggregate 2015 J. Adv. Inf. Technol. 214
4 calculations would be done on the fly at any level above the base level in each dimension taking a significant amount of time in query processing. OLAP system enables quick and easy information retrieval by mapping data to underlying data warehouse. OLAP environment focuses on cubes that aggregate business measures for each unique combination of dimension hierarchies. In viewing data, analysts use dimension hierarchies to recognize trends at one level, drill down to lower levels to identify reasons for these trends, and roll up to higher level to see what effect these trends have on a larger sector of the business. The snowflake schema is really useful when a data warehouse maintains multiple fact tables representing different aggregation levels. The aggregate fact tables are pre-calculated and work with sub-dimension in hierarchies to speed up query operations. For example, the TIME dimension can be snowflaked to Year-Quarter- Month-Day. To speed up query operation for aggregated values for roll-up operations, multiple fact tables related to each level (year, quarter, and month) of aggregation in the location dimension can be created. Each fact table matches a level along the hierarchical structure within the dimension, as demonstrated in Fig. 5. The aggregate tables are pre-computed at the data-loading phase rather than at run time. The purpose of this technique is to save processor cycles at run time, thereby speeding up data analysis. Figure 6. Query 1 with aggregate function returned in 0.10 seconds. Figure 5. Multiple fact tables. With this design, aggregate tables contain all of the aggregate data so a query against them just selects the aggregated data, instead of performing calculations to generate the value. Queries select the aggregates directly from the fact table by applying the appropriate filter to each dimension, which will significantly improve the performance, as the sample queries and results compared in Fig. 6 and Fig. 7. Fig. 6 Sample Query 1 shows a query with aggregate function SUM() and GROUP BY clause that is executed against data of close to 300,000 records in a sample fact table where calculation is performed on the fly and result is retrieved in 0.10 seconds. In contrast, Fig. 7 Sample Query 2 shows another query that fetches the exact same results from aggregate table using filters, instead of using any aggregate function, which retrieves the same result in 0.01 second 10 times faster. Figure 7. Query 2 without aggregate function returned in 0.01 seconds. The normalization process may bring some other benefits into the picture. Dimension tables in the data warehouse are usually very large containing multiple sets of attributes at different granularities. For example demographic or geographic information in the customer dimension table may be separated as sub-dimensions. By implementing snowflake design, normalization is performed on the large and very wide dimension tables, 2015 J. Adv. Inf. Technol. 215
5 which makes navigation along hierarchies easier. It may also help optimize complex queries by implementing a heuristic-based query rewriting technique. With the snowflake design, structures in a data warehouse are easier to update and maintain. Normalization reduces data redundancies and, in turn, reduces data anomalies. With a large number of attributes in a dimension table, it is possible that a set of related attributes are updated less frequently than others. Having multiple dimension tables for sub-dimensions allows for queries to work with fewer records. The chances of data anomalies are greatly reduced. Normalization allows long text fields in dimension tables to be eliminated. Normalization reduces storage space requirements on holding many attributes of dimension tables, especially those involving long text fields that are repeated. If the data is sparse, where a large number of attributes are empty for each dimension record, those attributes that are rarely populated could be in their own sub-dimension table. The savings in space could be generous in many cases. However, since query performance in data analysis is commonly more important than storage efficiency, a fully normalized snowflake structure may not be the best approach. In many cases it may be appropriate to normalize certain dimensions that are directly involved in analysis processes and create partial snowflake structures in order to achieve significant storage savings at the price of an insignificant decrease in query efficiency. VI. CONCLUSIONS Data analytics in business intelligence requires robust data warehouse design to support flexible querying across multiple complex dimension relationships. Snowflake schema is a method of normalizing the dimension tables in a star schema and creating sub-dimensions for hierarchical levels. The snowflake schema is suitable for mapping flattened out dimension structure to OLAP hierarchies. It bridges the gap between the data warehouse environment and data analytics environment in business intelligence and facilitates the mapping between the two. Snowflake schema makes navigation along hierarchies easier and analysis such as drilldown and rollup possible. It works well with multiple aggregate fact tables where performance of aggregation analysis will be greatly enhanced. Lastly the snowflake schema saves processor cycles at runtime with appropriate filters on dimensions applied on aggregates directly in the fact table. With the snowflake design, structures in a data warehouse are easier to update and maintain. Effective business decision making requires better information delivery. The snowflake schema in data warehouse design plays important roles in supporting business analytics. REFERENCES [1] Cody, W. F. Kreulen, and J. T. Krishna, V. Spangler, and W. S., The integration of business intelligence and knowledge management, IBM Systems Journal, vol. 41, no. 4, pp , 2002 [2] M. J. Liberatore and W. Luo, The analytics movement: Implications for operations research, Interfaces, vol. 40, no. 4, pp , 2010 [3] A. McAfee and E. Brynjolfsson, Big data: The management revolution, Harvard Business Review, vol. 90, no. 12, pp , October, 2012 [4] A. Sen and A. P. Sinha, A comparison of data warehouse development methodologies, Communications of the Association of Computing Machinery (ACM), vol. 48, no. 3, pp , [5] J. Wang, J. L. Kourik, and P. E. Maher, Identifying characteristics and roles of OLAP in business decision support systems, Journal of Business and Educational Leadership, vol. 3, no. 1, Fall, pp , 2011 Jiangping Wang is an associate professor of computer science at Webster University. He has a B.A. from Chongqing University, China, an M.S. from the University of Leeds, United Kingdom and a Ph.D. from the Missouri University of Science and Technology, Rolla, Missouri, USA. Dr. Wang's areas of teaching include database design, database applications, data warehousing, web databases, database in web services, and distributed application development. His areas of research include database management systems, decision support systems, business intelligence, e-commerce data processing, and software project management. Janet L. Kourik is a Professor in the Mathematics and Computer Science Department of Webster University in St. Louis, Missouri, US. She has a B.S.C.S. from Webster University, an M.A. from Webster University and a Ph.D. from Nova Southeastern University. Dr. Kourik's areas of teaching include database concepts and applications, information systems, operating systems, and distributed systems. Her areas of research include databases and analytics, agile methods, informatics, and computer science education J. Adv. Inf. Technol. 216
When to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: [email protected] Abstract: Do you need an OLAP
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application
Basics of Dimensional Modeling
Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimensional
OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA
OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools
Paper by W. F. Cody J. T. Kreulen V. Krishna W. S. Spangler Presentation by Dylan Chi Discussion by Debojit Dhar THE INTEGRATION OF BUSINESS INTELLIGENCE AND KNOWLEDGE MANAGEMENT BUSINESS INTELLIGENCE
New Approach of Computing Data Cubes in Data Warehousing
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 14 (2014), pp. 1411-1417 International Research Publications House http://www. irphouse.com New Approach of
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Sterling Business Intelligence
Sterling Business Intelligence Concepts Guide Release 9.0 March 2010 Copyright 2009 Sterling Commerce, Inc. All rights reserved. Additional copyright information is located on the documentation library:
www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28
Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
DATA WAREHOUSING - OLAP
http://www.tutorialspoint.com/dwh/dwh_olap.htm DATA WAREHOUSING - OLAP Copyright tutorialspoint.com Online Analytical Processing Server OLAP is based on the multidimensional data model. It allows managers,
IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance
Data Sheet IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Overview Multidimensional analysis is a powerful means of extracting maximum value from your corporate
OLAP. Business Intelligence OLAP definition & application Multidimensional data representation
OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1 Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for
Turkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
Deductive Data Warehouses and Aggregate (Derived) Tables
Deductive Data Warehouses and Aggregate (Derived) Tables Kornelije Rabuzin, Mirko Malekovic, Mirko Cubrilo Faculty of Organization and Informatics University of Zagreb Varazdin, Croatia {kornelije.rabuzin,
University of Gaziantep, Department of Business Administration
University of Gaziantep, Department of Business Administration The extensive use of information technology enables organizations to collect huge amounts of data about almost every aspect of their businesses.
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
Fluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi [email protected]
Data Warehousing: Data Models and OLAP operations By Kishore Jaladi [email protected] Topics Covered 1. Understanding the term Data Warehousing 2. Three-tier Decision Support Systems 3. Approaches
PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.
PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions A Technical Whitepaper from Sybase, Inc. Table of Contents Section I: The Need for Data Warehouse Modeling.....................................4
CHAPTER 5: BUSINESS ANALYTICS
Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse
14. Data Warehousing & Data Mining
14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
Designing a Dimensional Model
Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and
OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved
CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information
SQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
Part 22. Data Warehousing
Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem
Optimizing Your Data Warehouse Design for Superior Performance
Optimizing Your Data Warehouse Design for Superior Performance Lester Knutsen, President and Principal Database Consultant Advanced DataTools Corporation Session 2100A The Problem The database is too complex
Business Intelligence: Effective Decision Making
Business Intelligence: Effective Decision Making Bellevue College Linda Rumans IT Instructor, Business Division Bellevue College [email protected] Current Status What do I do??? How do I increase
Week 3 lecture slides
Week 3 lecture slides Topics Data Warehouses Online Analytical Processing Introduction to Data Cubes Textbook reference: Chapter 3 Data Warehouses A data warehouse is a collection of data specifically
The strategic importance of OLAP and multidimensional analysis A COGNOS WHITE PAPER
The strategic importance of OLAP and multidimensional analysis A COGNOS WHITE PAPER While every attempt has been made to ensure that the information in this document is accurate and complete, some typographical
Data Warehouse design
Data Warehouse design Design of Enterprise Systems University of Pavia 21/11/2013-1- Data Warehouse design DATA PRESENTATION - 2- BI Reporting Success Factors BI platform success factors include: Performance
DATA CUBES E0 261. Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES
E0 261 Jayant Haritsa Computer Science and Automation Indian Institute of Science JAN 2014 Slide 1 Introduction Increasingly, organizations are analyzing historical data to identify useful patterns and
www.ducenit.com Self-Service Business Intelligence: The hunt for real insights in hidden knowledge Whitepaper
Self-Service Business Intelligence: The hunt for real insights in hidden knowledge Whitepaper Shift in BI usage In this fast paced business environment, organizations need to make smarter and faster decisions
CHAPTER 3. Data Warehouses and OLAP
CHAPTER 3 Data Warehouses and OLAP 3.1 Data Warehouse 3.2 Differences between Operational Systems and Data Warehouses 3.3 A Multidimensional Data Model 3.4Stars, snowflakes and Fact Constellations: 3.5
B.Sc (Computer Science) Database Management Systems UNIT-V
1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used
The Benefits of Data Modeling in Business Intelligence
WHITE PAPER: THE BENEFITS OF DATA MODELING IN BUSINESS INTELLIGENCE The Benefits of Data Modeling in Business Intelligence DECEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2
HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT
HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT POINT-AND-SYNC MASTER DATA MANAGEMENT 04.2005 Hyperion s new master data management solution provides a centralized, transparent process for managing critical
CHAPTER 4: BUSINESS ANALYTICS
Chapter 4: Business Analytics CHAPTER 4: BUSINESS ANALYTICS Objectives Introduction The objectives are: Describe Business Analytics Explain the terminology associated with Business Analytics Describe the
Monitoring Genebanks using Datamarts based in an Open Source Tool
Monitoring Genebanks using Datamarts based in an Open Source Tool April 10 th, 2008 Edwin Rojas Research Informatics Unit (RIU) International Potato Center (CIP) GPG2 Workshop 2008 Datamarts Motivation
Databases in Organizations
The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron
Presented by: Jose Chinchilla, MCITP
Presented by: Jose Chinchilla, MCITP Jose Chinchilla MCITP: Database Administrator, SQL Server 2008 MCITP: Business Intelligence SQL Server 2008 Customers & Partners Current Positions: President, Agile
Business Intelligence, Analytics & Reporting: Glossary of Terms
Business Intelligence, Analytics & Reporting: Glossary of Terms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Ad-hoc analytics Ad-hoc analytics is the process by which a user can create a new report
Dimodelo Solutions Data Warehousing and Business Intelligence Concepts
Dimodelo Solutions Data Warehousing and Business Intelligence Concepts Copyright Dimodelo Solutions 2010. All Rights Reserved. No part of this document may be reproduced without written consent from the
Data Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong [email protected] Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
Business Intelligence for SUPRA. WHITE PAPER Cincom In-depth Analysis and Review
Business Intelligence for A Technical Overview WHITE PAPER Cincom In-depth Analysis and Review SIMPLIFICATION THROUGH INNOVATION Business Intelligence for A Technical Overview Table of Contents Complete
A Brief Tutorial on Database Queries, Data Mining, and OLAP
A Brief Tutorial on Database Queries, Data Mining, and OLAP Lutz Hamel Department of Computer Science and Statistics University of Rhode Island Tyler Hall Kingston, RI 02881 Tel: (401) 480-9499 Fax: (401)
<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option
Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option The following is intended to outline our general product direction. It is intended for
An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann
Trivadis White Paper Comparison of Data Modeling Methods for a Core Data Warehouse Dani Schnider Adriano Martino Maren Eschermann June 2014 Table of Contents 1. Introduction... 3 2. Aspects of Data Warehouse
CASE PROJECTS IN DATA WAREHOUSING AND DATA MINING
CASE PROJECTS IN DATA WAREHOUSING AND DATA MINING Mohammad A. Rob, University of Houston-Clear Lake, [email protected] Michael E. Ellis, University of Houston-Clear Lake, [email protected] ABSTRACT This paper
Unlock your data for fast insights: dimensionless modeling with in-memory column store. By Vadim Orlov
Unlock your data for fast insights: dimensionless modeling with in-memory column store By Vadim Orlov I. DIMENSIONAL MODEL Dimensional modeling (also known as star or snowflake schema) was pioneered by
Data Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734
Cleveland State University Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734 SS Chung 14 Build a Data Mining Model using Data
Information Package Design
Information Package Design an excerpt from the book Data Warehousing on the Internet: Accessing the Corporate Knowledgebase ISBN #1-8250-32857-9 by Tom Hammergren The following excerpt is provided to assist
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive
The Oracle Enterprise Data Warehouse (EDW)
The Oracle Enterprise Data Warehouse (EDW) Daniel Tkach Introduction: Data Warehousing Today In today s information era, the volume of data in an enterprise grows rapidly. The decreasing costs of processing
Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers
OLAP Learning Objectives Definition of OLAP Data cubes OLAP operations MDX OLAP servers 2 What is OLAP? OLAP has two immediate consequences: online part requires the answers of queries to be fast, the
How To Model Data For Business Intelligence (Bi)
WHITE PAPER: THE BENEFITS OF DATA MODELING IN BUSINESS INTELLIGENCE The Benefits of Data Modeling in Business Intelligence DECEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2
Data Warehouse design
Data Warehouse design Design of Enterprise Systems University of Pavia 11/11/2013-1- Data Warehouse design DATA MODELLING - 2- Data Modelling Important premise Data warehouses typically reside on a RDBMS
Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
CS2032 Data warehousing and Data Mining Unit II Page 1
UNIT II BUSINESS ANALYSIS Reporting Query tools and Applications The data warehouse is accessed using an end-user query and reporting tool from Business Objects. Business Objects provides several tools
Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
OLAP and Data Warehousing! Introduction!
The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still
Analytics with Excel and ARQUERY for Oracle OLAP
Analytics with Excel and ARQUERY for Oracle OLAP Data analytics gives you a powerful advantage in the business industry. Companies use expensive and complex Business Intelligence tools to analyze their
Data Warehousing Concepts
Data Warehousing Concepts JB Software and Consulting Inc 1333 McDermott Drive, Suite 200 Allen, TX 75013. [[[[[ DATA WAREHOUSING What is a Data Warehouse? Decision Support Systems (DSS), provides an analysis
Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide
Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide IBM Cognos Business Intelligence (BI) helps you make better and smarter business decisions faster. Advanced visualization
The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, srecko@vizija.
The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, [email protected] ABSTRACT Health Care Statistics on a state level is a
Tracking System for GPS Devices and Mining of Spatial Data
Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja
BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach
2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,
MDM and Data Warehousing Complement Each Other
Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There
THE TECHNOLOGY OF USING A DATA WAREHOUSE TO SUPPORT DECISION-MAKING IN HEALTH CARE
THE TECHNOLOGY OF USING A DATA WAREHOUSE TO SUPPORT DECISION-MAKING IN HEALTH CARE Dr. Osama E.Sheta 1 and Ahmed Nour Eldeen 2 1,2 Department of Mathematics (Computer Science) Faculty of Science, Zagazig
Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Chapter 23, Part A
Data Warehousing and Decision Support Chapter 23, Part A Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical
PREFACE INTRODUCTION MULTI-DIMENSIONAL MODEL. Chris Claterbos, Vlamis Software Solutions, Inc. [email protected]
BUILDING CUBES AND ANALYZING DATA USING ORACLE OLAP 11G Chris Claterbos, Vlamis Software Solutions, Inc. [email protected] PREFACE As of this writing, Oracle Business Intelligence and Oracle OLAP are
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
Data Warehousing. Paper 133-25
Paper 133-25 The Power of Hybrid OLAP in a Multidimensional World Ann Weinberger, SAS Institute Inc., Cary, NC Matthias Ender, SAS Institute Inc., Cary, NC ABSTRACT Version 8 of the SAS System brings powerful
SAS BI Course Content; Introduction to DWH / BI Concepts
SAS BI Course Content; Introduction to DWH / BI Concepts SAS Web Report Studio 4.2 SAS EG 4.2 SAS Information Delivery Portal 4.2 SAS Data Integration Studio 4.2 SAS BI Dashboard 4.2 SAS Management Console
Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina
Data Warehousing Read chapter 13 of Riguzzi et al Sistemi Informativi Slides derived from those by Hector Garcia-Molina What is a Warehouse? Collection of diverse data subject oriented aimed at executive,
Data W a Ware r house house and and OLAP Week 5 1
Data Warehouse and OLAP Week 5 1 Midterm I Friday, March 4 Scope Homework assignments 1 4 Open book Team Homework Assignment #7 Read pp. 121 139, 146 150 of the text book. Do Examples 3.8, 3.10 and Exercise
<no narration for this slide>
1 2 The standard narration text is : After completing this lesson, you will be able to: < > SAP Visual Intelligence is our latest innovation
Developing Business Intelligence and Data Visualization Applications with Web Maps
Developing Business Intelligence and Data Visualization Applications with Web Maps Introduction Business Intelligence (BI) means different things to different organizations and users. BI often refers to
A Service-oriented Architecture for Business Intelligence
A Service-oriented Architecture for Business Intelligence Liya Wu 1, Gilad Barash 1, Claudio Bartolini 2 1 HP Software 2 HP Laboratories {[email protected]} Abstract Business intelligence is a business
ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process
ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced
BUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
The Benefits of Data Modeling in Data Warehousing
WHITE PAPER: THE BENEFITS OF DATA MODELING IN DATA WAREHOUSING The Benefits of Data Modeling in Data Warehousing NOVEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2 SECTION 2
The IBM Cognos Platform
The IBM Cognos Platform Deliver complete, consistent, timely information to all your users, with cost-effective scale Highlights Reach all your information reliably and quickly Deliver a complete, consistent
Speeding ETL Processing in Data Warehouses White Paper
Speeding ETL Processing in Data Warehouses White Paper 020607dmxwpADM High-Performance Aggregations and Joins for Faster Data Warehouse Processing Data Processing Challenges... 1 Joins and Aggregates are
Business Intelligence Tutorial
IBM DB2 Universal Database Business Intelligence Tutorial Version 7 IBM DB2 Universal Database Business Intelligence Tutorial Version 7 Before using this information and the product it supports, be sure
