Chapter 3, Data Warehouse and OLAP Operations
|
|
- Cory Washington
- 8 years ago
- Views:
Transcription
1 CSI 4352, Introduction to Data Mining Chapter 3, Data Warehouse and OLAP Operations Young-Rae Cho Associate Professor Department of Computer Science Baylor University CSI 4352, Introduction to Data Mining Lecture 3, Data Warehouse & OLAP Operations Basic Concept of Data Warehouse Data Warehouse Modeling Data Warehouse Architecture Data Warehouse Implementation From Data Warehousing to Data Mining 1
2 What is Data Warehouse? Data Warehouse ( defined in many different ways ) A decision support database that is maintained separately from the organization s operational database The support of information processing by providing a solid platform of consolidated, historical data for analysis A data warehouse is a (1) subject-oriented, (2) integrated, (3) time-variant, and (4) nonvolatile collection of data in support of management s decision-making process. W. H. Inmon Data Warehousing The process of constructing and using data warehouses Data Warehouse Subject-Oriented Organized around major subjects e.g., customers, products, sales Focusing on the modeling and analysis of data for decision makers, not on daily operations or transaction processing Provide a simple and concise view around particular subject issues Excluding data that are not useful in the decision support process 2
3 Data Warehouse Integrated Integrating multiple, heterogeneous data sources Relational databases, flat files, on-line transaction records Apply data cleaning and data integration techniques Ensures consistency in naming conventions, encoding structures, attribute measures, etc. among different data sources Data Warehouse Time Variant The time horizon of data warehouses is significantly longer than that of operational systems Operational databases have current data values Data warehouses provide information from a historical perspective (e.g., years) Time is a key structure in data warehouses Contain the attribute of time (explicitly or implicitly) 3
4 Data Warehouse Nonvolatile A physically separate storage of data transformed from operational databases Operational update of data does not occur Not require transaction processing, recovery, and concurrency control mechanisms Require only two operations, initial loading of data and access of data Data Integration Methods Methods (1) Process to provide uniform interface to multiple data sources Tradition Database Integration (2) Process to combine multiple data sources into coherent storage Data warehousing Traditional DB Integration A query-driven approach Wrappers / mediators on top of heterogeneous data sources Data Warehousing An update-driven approach Combined the heterogeneous data sources in advance Stored them in a warehouse for direct query and analysis 4
5 OLTP vs. OLAP OLTP (on-line transaction processing) Major task of traditional relational DBMS Day-to-day operations: e.g., purchasing, inventory, manufacturing, banking, payroll, registration, accounting, etc. OLAP (on-line analytical processing) Major task of data warehouse system Data analysis and decision making Distinct Features (OLTP vs. OLAP) User and system orientation (customers vs. market analysts) Data contents (current, detailed vs. historical, consolidated) Database design (ER + application vs. star + subject) View (current, local vs. evolutionary, integrated) OLTP vs. OLAP Feature OLTP OLAP Characteristic operational processing information processing Orientation transaction analysis Users clerk, DBA knowledge worker (CEO, analyst) Function day-to-day operations decision support DB Design application-oriented subject-oriented Data current, up-to-date historical, integrated, summarized Unit of work short, simple transaction complex query Access read/write/update read-only (lots of scans) 5
6 Why Data Warehouse? Performance Issue DBMS: tuned for OLTP e.g., access methods, indexing, concurrency control, recovery Data warehouse: tuned for OLAP e.g., complex queries, multidimensional view, consolidation Data Issue Decision support requires historical data, consolidated and summarized data, consistent data CSI 4352, Introduction to Data Mining Lecture 3, Data Warehouse & OLAP Operations Basic Concept of Data Warehouse Data Warehouse Modeling Data Warehouse Architecture Data Warehouse Implementation From Data Warehousing to Data Mining 6
7 Data Format for Warehouse Dimensions Multi-dimensional data model Data are stored in the form of a data cube Data Cube A view of multi-dimensions Dimension tables, such as item (item_name, brand, type), or time (day, week, month, quarter, year) Fact table contains measures (such as dollars_sold) and keys to each of the related dimension tables Cuboid Each combination of dimensional spaces in a data cube 0-D cuboid, 1-D cuboid, 2-D cuboid,, n-d cuboid The Lattice of Cuboids all 0-D (apex) cuboid time item location supplier 1-D cuboids time,item time,location item,location location,supplier time,supplier item,supplier 2-D cuboids time,item,location time,location,supplier time,item,supplier item,location,supplier time, item, location, supplier 3-D cuboids 4-D (base) cuboid 7
8 Conceptual Modeling Key of Modeling Data Warehouses Handling dimensions & measures Examples Star schema: A fact table in the middle connected to a set of dimension tables Snowflake schema: A refinement of star schema where some dimensional hierarchy is normalized into a set of smaller dimension tables, forming a shape similar to snowflake Fact constellations: Multiple fact tables share dimension tables, viewed as a collection of stars, therefore called galaxy schema or fact constellation Example of Star Schema time time_key day day_of_the_week month quarter year branch branch_key branch_name branch_type Measures Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales item item_key item_name brand type supplier_type location location_key street city state country 8
9 Example of Snowflake Schema time time_key day day_of_the_week month quarter year Sales Fact Table time_key item_key item item_key item_name brand type supplier_key supplier supplier_key supplier_type branch branch_key branch_name branch_type Measures branch_key location_key units_sold dollars_sold avg_sales location location_key street city_key city city_key city state country Example of Fact Constellation time time_key day day_of_the_week month quarter year Sales Fact Table time_key item_key item item_key item_name brand type supplier_type Shipping Fact Table time_key item_key shipper_key from_location branch branch_key branch_name branch_type Measures branch_key location_key units_sold dollars_sold avg_sales location location_key street city state country to_location dollars_cost units_shipped shipper shipper_key shipper_name location_key shipper_type 9
10 Cube Definition in DMQL Cube Definition (Fact Table) define cube <cube_name> [<dimension_list>]: <measure_list> Dimension Definition (Dimension Table) define dimension <dimension_name> as (<attribute_or_dimension_list>) Special Case (Shared Dimension Table) define dimension <dimension_name> as <dimension_name_first> in cube <cube_name_first> Star Schema Definition in DMQL Example define cube sales [time, item, branch, location]: dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold = count(*) define dimension time as (time_key, day, day_of_week, month, quarter, year) define dimension item as (item_key, item_name, brand, type, supplier_type) define dimension branch as (branch_key, branch_name, branch_type) define dimension location as (location_key, street, city, state, country) 10
11 Snowflake Schema Definition in DMQL Example define cube sales [time, item, branch, location]: dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold = count(*) define dimension time as (time_key, day, day_of_week, month, quarter, year) define dimension item as (item_key, item_name, brand, type, supplier(supplier_key, supplier_type)) define dimension branch as (branch_key, branch_name, branch_type) define dimension location as (location_key, street, city(city_key, province_or_state, country)) Fact Constellation Schema Definition in DMQL Example define cube sales [time, item, branch, location]: dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold = count(*) define dimension time as (time_key, day, day_of_week, month, quarter, year) define dimension item as (item_key, item_name, brand, type, supplier_type) define dimension branch as (branch_key, branch_name, branch_type) define dimension location as (location_key, street, city, state, country) define cube shipping [time, item, shipper, from_location, to_location]: dollar_cost = sum(cost_in_dollars), unit_shipped = count(*) define dimension time as time in cube sales define dimension item as item in cube sales define dimension shipper as (shipper_key, shipper_name, location as location in cube sales, shipper_type) define dimension from_location as location in cube sales define dimension to_location as location in cube sales 11
12 Measures Distributive If the result derived by applying the function to n aggregate values is the same as that derived by applying the function on all the data without partitioning e.g., count(), sum(), min(), max() Algebraic If it can be computed by an algebraic function with m arguments, each of which is obtained by applying a distributive aggregate function e.g., avg(), standard_deviation() Holistic If there is no constant bound on the storage size needed to describe a subaggregate e.g., median(), mode(), rank() Concept Hierarchy Schema Hierarchy e.g., street < city < state < country e.g., day < { month < quarter ; week } < year Year Quarter Week Month Day Set-group hierarchy e.g., { (0..100] ; ( ] } < (0..200] e.g., { (0..10] < lowprice ; { ( ] ; ( ] } < highprice } < allproducts allproducts (0..200] lowprice highprice (0..100] ( ] (0..10] ( ] ( ] 12
13 Three Components of Data Cube Example Measures: data values as a function of products, locations and time Dimensions: Hierarchies: product location time locations Company Category Product Country City Office Year Quarter Week Month Day time Example of Cuboid Cells time dimension product dimension all location dimension 0-D (apex) cuboid product quarter country product, quarter product, country quarter, country product, quarter, country 1-D cuboids 2-D cuboids 3-D (base) cuboid 13
14 Example of Data Cube TV PC VCR sum Quarter 1Qtr 2Qtr 3Qtr 4Qtr sum Total annual sales of TV in U.S.A. U.S.A Canada Mexico Country sum OLAP Operations Roll-up (drill-up) Summarizes (aggregates) data by climbing up hierarchy or by dimension reduction Drill-down (roll-down) Reverse of roll-up by stepping down to lower-level data or introducing new dimensions Slice Selecting data on one dimension Dice Selecting data on multi-dimensions Pivot (rotate) Reorienting the cube, or transforming 3-D data to a series of 2-D spaces 14
15 Roll-UP & Drill-Down location country city quarter time quarter country month Slice, Dice & Pivot location country supercom pc laptop mexico canada USA Q1 Q2 Q3 Q4 quarter time country country mexico pc laptop canada USA Q1 Q2 quarter canada USA Q1 Q2 Q3 Q4 quarter 15
16 Starnet Query Model Shipping Method air Orders contracts Customer customerid Time Location annually region country quarterly truck Each circle is called a footprint daily city order Promotion item sales-person division Product category division Organization CSI 4352, Introduction to Data Mining Lecture 3, Data Warehouse & OLAP Operations Basic Concept of Data Warehouse Data Warehouse Modeling Data Warehouse Architecture Data Warehouse Implementation From Data Warehousing to Data Mining 16
17 Data Warehouse Design Top-Down View Allows the selection of the relevant information necessary for the data warehouse Data Source View Exposes the information being captured, stored, and managed by operational systems Data Warehouse View Consists of fact tables and dimension tables Business Query View Shows the perspectives of data in the warehouse to end-users Data Warehouse Design Process Categories by Process Direction Top-down: Starts with overall design and planning (mature) Bottom-up: Starts with experiments and prototypes (rapid) Categories by Software Engineering View Waterfall: structured, systematic analysis at each step before proceeding to the next Spiral: rapid generation of functional systems, short turn around time Typical Data Warehouse Design Process Choose business processes for modeling, e.g., orders, invoices, etc Choose the grain (atomic level of data) of the business processes Choose the dimensions that will apply to each fact table Choose the measures that will populate each fact table 17
18 Data Warehouse Architecture Operational DBs Metadata Monitor & Integrator OLAP Server Other sources Extract Transform Load Refresh Data Warehouse Serve Query Analysis Reports Data Marts Data Sources Data Storage OLAP Engine Front-End Tools Three Data Warehouse Models Enterprise Warehouse A global view with all the information about subjects spanning the entire organization Data Mart A subset of corporate-wide data that is of value to a specific group of users Its scope is confined to specific, selected groups Independent vs. dependent (directly from warehouse) data mart Virtual Warehouse A set of views over operational databases Only some of possible summary views may be materialized 18
19 Development of Data Warehouse Multi-Tier Data Warehouse Distributed Data Marts Data Mart Data Mart Enterprise Data Warehouse Model Refinement Model Refinement Define a high-level corporate data model Utilities of Back-End Tools Data Extraction Get data from multiple, heterogeneous, and external sources Data Cleaning Detect errors in the data and rectify them when possible Data Transformation Convert data from the original format to the warehouse format Loading Sort, summarize, consolidate, compute views, check integrity, and build indices and partitions Refresh Propagate the updates from data sources to the warehouse 19
20 OLAP Server Architecture Relational OLAP (ROLAP) Uses relational or extended-relational DBMS to store and manage warehouse data and OLAP middle ware Includes optimization of DBMS back-end, implementation of aggregation navigation logic, and additional tools and services High scalability Multidimensional OLAP (MOLAP) Sparse array-based multidimensional storage engine Fast indexing to pre-computed summarized data Hybrid OLAP (HOLAP) Low-level: relational / high-level: array High flexibility Metadata Repository Definition of Metadata The data defining data warehouse objects Examples Description of the structure of the data warehouse, e.g., schema, view, dimensions, hierarchies, data definitions, data mart locations and contents Operational meta-data, e.g., history of migrated data, currency of data, warehouse usage statistics, error reports Algorithms used for summarization Mapping from operational environment to data warehouse Data related to system performance Business data, e.g., business terms and definitions, ownership of data, charging policies 20
21 CSI 4352, Introduction to Data Mining Lecture 3, Data Warehouse & OLAP Operations Basic Concept of Data Warehouse Data Warehouse Modeling Data Warehouse Architecture Data Warehouse Implementation From Data Warehousing to Data Mining Data Cube Computation View as a Lattice of Cuboids How many cuboids in an n-dimensional cube? n 2 How many cuboid cells in an n-dimensional cube with L i levels? product, n ( time L i 1) i 1 Materialization of Data Cube Full materialization (all cuboids), Partial materialization (some cuboids), No materialization (only base cuboid) Selection of cuboids to materialize Based on the size, sharing, access frequency, etc. all product time location product, location time, location product, time, location 21
22 Cube Operation Cube Definition and Computation in DMQL define cube sales [item, city, year]: sum (sales_in_dollars) compute cube sales Cube Definition and Computation in SQL select item, city, year, SUM (amount) from sales cube by item, city, year Internal Operations group by (item, city, year) group by (item, city), (item, year), (city, year) group by (item), (city), (year) group by () Iceberg Cube Iceberg Cube Computation Computing only the cuboid cells whose count or other aggregates satisfying the condition like HAVING COUNT(*) >= min_sup Motivation Only a small portion of cube cells may be above the water in a sparse cube Only calculate interesting cells data above certain threshold Avoid explosive growth of the cube 22
23 Indexing OLAP Data Bitmap Indexing Index on a particular column Each value in the column has a bit vector The length of bit vector is the number of records in the base table The i th bit is set if the i th row of the base table has the value Not suitable for high cardinality domains Base Table Index on Region Index on Type CusID Region Type RecID Asia Europe US RecID Retail Dealer C1 Asia Retail C2 Europe Dealer C3 Asia Dealer C4 US Retail C5 Europe Dealer Join Indexing Link the value of dimensions to rows in the fact table CSI 4352, Introduction to Data Mining Lecture 3, Data Warehouse & OLAP Operations Basic Concept of Data Warehouse Data Warehouse Modeling Data Warehouse Architecture Data Warehouse Implementation From Data Warehousing to Data Mining 23
24 Data Warehouse Usage Information Processing Supports querying, basic statistical analysis, and reporting using crosstabs, tables, charts and graphs Analytical Processing Supports OLAP operations in multi-dimensional space Data Mining Supports pattern discovery from warehouse data, and presenting the mining results using visualization tools From OLAP To OLAM On-Line Analytical Mining (OLAM) ( OLAP + Data Mining ) in data warehouse Why OLAM? High quality data Data warehouse contains integrated, consistent and cleaned data Information processing infrastructure ODBC/OLE DB connections, web accessing, service facilities OLAP-based exploratory data analysis Mining with drilling, dicing, pivoting, etc. On-line selection of data mining functions Integration and swapping of multiple data mining functions 24
25 OLAM System Architecture mining query User GUI API mining result Layer4 User Interface OLAM Engine OLAP Engine Layer3 OLAP/OLAM Data Cube API Data Warehouse Meta Data Layer2 MDDB Database API data cleaning data integration Layer1 Databases Data Repository Questions? References Gray, J., et al., Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab and Sub-Totals, Data Mining and Knowledge Discovery, Vol. 1 (1997) Lecture Slides are found on the Course Website, 25
Data Warehousing and OLAP Technology
Data Warehousing and OLAP Technology 1. Objectives... 3 2. What is Data Warehouse?... 4 2.1. Definitions... 4 2.2. Data Warehouse Subject-Oriented... 5 2.3. Data Warehouse Integrated... 5 2.4. Data Warehouse
More informationData Warehouse. MIT-652 Data Mining Applications. Thimaporn Phetkaew. School of Informatics, Walailak University. MIT-652: DM 2: Data Warehouse 1
Data Warehouse MIT-652 Data Mining Applications Thimaporn Phetkaew School of Informatics, Walailak University MIT-652: DM 2: Data Warehouse 1 Chapter 2: Data Warehousing and OLAP Technology for Data Mining
More informationData warehousing. Han, J. and M. Kamber. Data Mining: Concepts and Techniques. 2001. Morgan Kaufmann.
Data warehousing Han, J. and M. Kamber. Data Mining: Concepts and Techniques. 2001. Morgan Kaufmann. KDD process Application Pattern Evaluation Data Mining Task-relevant Data Data Warehouse Selection Data
More informationData W a Ware r house house and and OLAP Week 5 1
Data Warehouse and OLAP Week 5 1 Midterm I Friday, March 4 Scope Homework assignments 1 4 Open book Team Homework Assignment #7 Read pp. 121 139, 146 150 of the text book. Do Examples 3.8, 3.10 and Exercise
More informationData Warehousing & OLAP
Data Warehousing & OLAP Data Mining: Concepts and Techniques Chapter 3 Jiawei Han and An Introduction to Database Systems C.J.Date, Eighth Eddition, Addidon Wesley, 4 1 What is Data Warehousing? What is
More informationData Mining for Knowledge Management. Data Warehouses
1 Data Mining for Knowledge Management Data Warehouses Themis Palpanas University of Trento http://disi.unitn.eu/~themis Data Mining for Knowledge Management 1 Thanks for slides to: Jiawei Han Niarcas
More informationWWW.VIDYARTHIPLUS.COM
4.1 Data Warehousing Components What is Data Warehouse? - Defined in many different ways but mainly it is: o A decision support database that is maintained separately from the organization s operational
More informationDatabase Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing
Database Applications Advanced Querying Transaction processing Online setting Supports day-to-day operation of business OLAP Data Warehousing Decision support Offline setting Strategic planning (statistics)
More informationLecture 2 Data warehousing
King Saud University College of Computer & Information Sciences IS 466 Decision Support Systems Lecture 2 Data warehousing Dr. Mourad YKHLEF The slides content is derived and adopted from many references
More informationOverview of Data Warehousing and OLAP
Overview of Data Warehousing and OLAP Chapter 28 March 24, 2008 ADBS: DW 1 Chapter Outline What is a data warehouse (DW) Conceptual structure of DW Why separate DW Data modeling for DW Online Analytical
More informationData Warehousing and Online Analytical Processing
Contents 4 Data Warehousing and Online Analytical Processing 3 4.1 Data Warehouse: Basic Concepts.................. 4 4.1.1 What is a Data Warehouse?................. 4 4.1.2 Differences between Operational
More informationDATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
More informationBUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
More informationTIES443. Lecture 3: Data Warehousing. Lecture 3. Data Warehousing. Course webpage: http://www.cs.jyu.fi/~mpechen/ties443.
TIES443 Lecture 3 Data Warehousing Mykola Pechenizkiy Course webpage: http://www.cs.jyu.fi/~mpechen/ties443 Department of Mathematical Information Technology University of Jyväskylä November 3, 2006 1
More information2 Data Warehouse and OLAP Technology for Data Mining 3. 2.1 What is a data warehouse?... 3. 2.2 Amultidimensional data model... 6
Contents 2 Data Warehouse and OLAP Technology for Data Mining 3 2.1 What is a data warehouse?... 3 2.2 Amultidimensional data model.... 6 2.2.1 From tables and spreadsheets to data cubes....... 6 2.2.2
More informationData Mining. Session 4 Main Theme Data Warehousing and OLAP. Dr. Jean-Claude Franchitti
Data Mining Session 4 Main Theme Data Warehousing and OLAP Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical Sciences Adapted from course textbook
More informationThis tutorial will help computer science graduates to understand the basic-toadvanced concepts related to data warehousing.
About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This
More informationBuilding Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu
Building Data Cubes and Mining Them Jelena Jovanovic Email: jeljov@fon.bg.ac.yu KDD Process KDD is an overall process of discovering useful knowledge from data. Data mining is a particular step in the
More informationDATA WAREHOUSING - OLAP
http://www.tutorialspoint.com/dwh/dwh_olap.htm DATA WAREHOUSING - OLAP Copyright tutorialspoint.com Online Analytical Processing Server OLAP is based on the multidimensional data model. It allows managers,
More informationData Warehousing & OLAP
Data Warehousing & OLAP Data Mining: Concepts and Techniques Chapter 3 Jiawei Han and An Introduction to Database Systems C.J.Date, Eighth Eddition, Addidon Wesley, 2004 1 What is Data Warehousing? What
More informationData Warehousing and elements of Data Mining
Data Warehousing and elements of Data Mining prof. e-mail: maurizio.pighin@uniud.it Dipartimento di Matematica e Informatica Università di Udine - Italy Motivation: Necessity is the Mother of Invention
More informationCopyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
More informationData W a Ware r house house and and OLAP II Week 6 1
Data Warehouse and OLAP II Week 6 1 Team Homework Assignment #8 Using a data warehousing tool and a data set, play four OLAP operations (Roll up (drill up), Drill down (roll down), Slice and dice, Pivot
More informationLearning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers
OLAP Learning Objectives Definition of OLAP Data cubes OLAP operations MDX OLAP servers 2 What is OLAP? OLAP has two immediate consequences: online part requires the answers of queries to be fast, the
More informationData Warehousing & OLAP
Data Warehousing & OLAP Motivation: Business Intelligence Customer information (customer-id, gender, age, homeaddress, occupation, income, family-size, ) Product information (Product-id, category, manufacturer,
More informationPart 22. Data Warehousing
Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem
More informationOLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA
OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,
More informationData Warehousing and OLAP. t.calders@tue.nl
Data Warehousing and OLAP Toon Calders Toon Calders t.calders@tue.nl Motivation «Traditional» relational databases are geared towards online transaction processing: bank terminal flight reservations student
More informationWeek 13: Data Warehousing. Warehousing
1 Week 13: Data Warehousing Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots of buzzwords, hype slice & dice, rollup,
More informationData Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina
Data Warehousing Read chapter 13 of Riguzzi et al Sistemi Informativi Slides derived from those by Hector Garcia-Molina What is a Warehouse? Collection of diverse data subject oriented aimed at executive,
More informationCHAPTER 4 Data Warehouse Architecture
CHAPTER 4 Data Warehouse Architecture 4.1 Data Warehouse Architecture 4.2 Three-tier data warehouse architecture 4.3 Types of OLAP servers: ROLAP versus MOLAP versus HOLAP 4.4 Further development of Data
More informationOutline. Data Warehousing. What is a Warehouse? What is a Warehouse?
Outline Data Warehousing What is a data warehouse? Why a warehouse? Models & operations Implementing a warehouse 2 What is a Warehouse? Collection of diverse data subject oriented aimed at executive, decision
More informationIntroduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in
Introduction to Data Warehousing Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Necessity is the mother of invention Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai,
More informationData Warehousing & OLAP
Data Warehousing & OLAP What is Data Warehouse? A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of management s decisionmaking process. W.
More informationWeek 3 lecture slides
Week 3 lecture slides Topics Data Warehouses Online Analytical Processing Introduction to Data Cubes Textbook reference: Chapter 3 Data Warehouses A data warehouse is a collection of data specifically
More information14. Data Warehousing & Data Mining
14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,
More informationA Technical Review on On-Line Analytical Processing (OLAP)
A Technical Review on On-Line Analytical Processing (OLAP) K. Jayapriya 1., E. Girija 2,III-M.C.A., R.Uma. 3,M.C.A.,M.Phil., Department of computer applications, Assit.Prof,Dept of M.C.A, Dhanalakshmi
More informationDATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM
DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM MOHAMMED SHAFEEQ AHMED Guest Lecturer, Department of Computer Science, Gulbarga University, Gulbarga, Karnataka, India (e-mail:
More informationData Warehousing and OLAP
1 Data Warehousing and OLAP Hector Garcia-Molina Stanford University Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots
More informationLecture Data Warehouse Systems
Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in
More informationCHAPTER 3. Data Warehouses and OLAP
CHAPTER 3 Data Warehouses and OLAP 3.1 Data Warehouse 3.2 Differences between Operational Systems and Data Warehouses 3.3 A Multidimensional Data Model 3.4Stars, snowflakes and Fact Constellations: 3.5
More informationwww.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28
Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT
More informationData Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
More informationOLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH
OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH 1 Online Analytic Processing OLAP 2 OLAP OLAP: Online Analytic Processing OLAP queries are complex queries that Touch large amounts of data Discover
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes Final Exam Overview Open books and open notes No laptops and no other mobile devices
More information1960s 1970s 1980s 1990s. Slow access to
Principles of Knowledge Discovery in Fall 2002 Chapter 2: Warehousing and Dr. Osmar R. Zaïane University of Alberta Dr. Osmar R. Zaïane, 1999-2002 Principles of Knowledge Discovery in University of Alberta
More informationAnwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing
Anwendungssoftwares a Data-Warehouse-, Data-Mining- and OLAP-Technologies Online Analytic Processing Online Analytic Processing OLAP Online Analytic Processing Technologies and tools that support (ad-hoc)
More information1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application
More informationOLAP Systems and Multidimensional Expressions I
OLAP Systems and Multidimensional Expressions I Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing
CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing Class Projects Class projects are going very well! Project presentations: 15 minutes On Wednesday
More informationLection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
More informationData Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
More informationMulti-dimensional index structures Part I: motivation
Multi-dimensional index structures Part I: motivation 144 Motivation: Data Warehouse A definition A data warehouse is a repository of integrated enterprise data. A data warehouse is used specifically for
More information(Week 10) A04. Information System for CRM. Electronic Commerce Marketing
(Week 10) A04. Information System for CRM Electronic Commerce Marketing Course Code: 166186-01 Course Name: Electronic Commerce Marketing Period: Autumn 2015 Lecturer: Prof. Dr. Sync Sangwon Lee Department:
More information2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000
2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 Introduction This course provides students with the knowledge and skills necessary to design, implement, and deploy OLAP
More informationData Warehousing and Decision Support. Introduction. Three Complementary Trends. Chapter 23, Part A
Data Warehousing and Decision Support Chapter 23, Part A Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical
More informationLecture 2: Introduction to Business Intelligence. Introduction to Business Intelligence
TIES443 Lecture 2 Introduction to Business Intelligence Mykola Pechenizkiy Course webpage: http://www.cs.jyu.fi/~mpechen/ties443 November 2, 2006 Department of Mathematical Information Technology University
More informationDATA CUBES E0 261. Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES
E0 261 Jayant Haritsa Computer Science and Automation Indian Institute of Science JAN 2014 Slide 1 Introduction Increasingly, organizations are analyzing historical data to identify useful patterns and
More informationData Warehousing: Data Models and OLAP operations. By Kishore Jaladi kishorejaladi@yahoo.com
Data Warehousing: Data Models and OLAP operations By Kishore Jaladi kishorejaladi@yahoo.com Topics Covered 1. Understanding the term Data Warehousing 2. Three-tier Decision Support Systems 3. Approaches
More informationData Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing
Data Warehousing Outline Overview of data warehousing Dimensional Modeling Online Analytical Processing From OLTP to the Data Warehouse Traditionally, database systems stored data relevant to current business
More informationFluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
More informationIST722 Data Warehousing
IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF
More informationOverview. Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Data Warehousing. An Example: The Store (e.g.
Overview Data Warehousing and Decision Support Chapter 25 Why data warehousing and decision support Data warehousing and the so called star schema MOLAP versus ROLAP OLAP, ROLLUP AND CUBE queries Design
More informationAn Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies
An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,
More informationA Critical Review of Data Warehouse
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin
More informationNew Approach of Computing Data Cubes in Data Warehousing
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 14 (2014), pp. 1411-1417 International Research Publications House http://www. irphouse.com New Approach of
More informationOLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
More informationBasics of Dimensional Modeling
Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimensional
More informationOLAP. Business Intelligence OLAP definition & application Multidimensional data representation
OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1 Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for
More informationData Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
More informationIntegrate multiple, heterogeneous data sources. Data cleaning and data integration techniques are applied
Objectives Motivation: Why data warehouse? What is a data warehouse? Whyy separate p DW? Conceptual modeling of DW Data Mart Data Warehousing Architectures Data Warehouse Development Data Warehouse Vendors
More informationLITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES
LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES MUHAMMAD KHALEEL (0912125) SZABIST KARACHI CAMPUS Abstract. Data warehouse and online analytical processing (OLAP) both are core component for decision
More informationModule 1: Introduction to Data Warehousing and OLAP
Raw Data vs. Business Information Module 1: Introduction to Data Warehousing and OLAP Capturing Raw Data Gathering data recorded in everyday operations Deriving Business Information Deriving meaningful
More informationB.Sc (Computer Science) Database Management Systems UNIT-V
1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used
More informationChapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
More informationData Warehouse Technology And The MSD Databases
Data Warehouse Technology And The MSD Databases Philip McNeil Data Warehouses The MSD Databases Populating & using the Search Database Data Warehouses What is a Data Warehouse? A subject-oriented, integrated,
More informationDecision Support. Chapter 23. Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1
Decision Support Chapter 23 Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful
More informationDesigning a Dimensional Model
Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and
More informationCourse 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
More informationMario Guarracino. Data warehousing
Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the
More informationDimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
More informationUnit -3. Learning Objective. Demand for Online analytical processing Major features and functions OLAP models and implementation considerations
Unit -3 Learning Objective Demand for Online analytical processing Major features and functions OLAP models and implementation considerations Demand of On Line Analytical Processing Need for multidimensional
More informationBUSINESS ANALYTICS AND DATA VISUALIZATION. ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ
1 BUSINESS ANALYTICS AND DATA VISUALIZATION ITM-761 Business Intelligence ดร. สล ล บ ญพราหมณ 2 การท าความด น น ยากและเห นผลช า แต ก จ าเป นต องท า เพราะหาไม ความช วซ งท าได ง ายจะเข ามาแทนท และจะพอกพ นข
More informationData Warehousing. Paper 133-25
Paper 133-25 The Power of Hybrid OLAP in a Multidimensional World Ann Weinberger, SAS Institute Inc., Cary, NC Matthias Ender, SAS Institute Inc., Cary, NC ABSTRACT Version 8 of the SAS System brings powerful
More informationThe Study on Data Warehouse Design and Usage
International Journal of Scientific and Research Publications, Volume 3, Issue 3, March 2013 1 The Study on Data Warehouse Design and Usage Mr. Dishek Mankad 1, Mr. Preyash Dholakia 2 1 M.C.A., B.R.Patel
More informationPowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.
PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions A Technical Whitepaper from Sybase, Inc. Table of Contents Section I: The Need for Data Warehouse Modeling.....................................4
More informationAdvanced Data Management Technologies
ADMT 2015/16 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and Data Warehousing J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:
More informationChapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
More informationOLAP and Data Warehousing! Introduction!
The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still
More informationWhen to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP
More informationCS2032 Data warehousing and Data Mining Unit II Page 1
UNIT II BUSINESS ANALYSIS Reporting Query tools and Applications The data warehouse is accessed using an end-user query and reporting tool from Business Objects. Business Objects provides several tools
More informationData Warehousing, OLAP, and Data Mining
Data Warehousing, OLAP, and Marek Rychly mrychly@strathmore.edu Strathmore University, @ilabafrica & Brno University of Technology, Faculty of Information Technology Advanced Databases and Enterprise Systems
More informationChapter 7 Multidimensional Data Modeling (MDDM)
Chapter 7 Multidimensional Data Modeling (MDDM) Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. To assess the capabilities of OLTP and OLAP systems 2.
More informationIT0457 Data Warehousing. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
IT0457 Data Warehousing G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT Outline What is data warehousing The benefit of data warehousing Differences between OLTP and data warehousing The architecture
More informationTurkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
More informationDATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS
DATA WAREHOUSE CONCEPTS A fundamental concept of a data warehouse is the distinction between data and information. Data is composed of observable and recordable facts that are often found in operational
More informationUNIT-3 OLAP in Data Warehouse
UNIT-3 OLAP in Data Warehouse Bharati Vidyapeeth s Institute of Computer Applications and Management, New Delhi-63, by Dr.Deepali Kamthania U2.1 OLAP Demand for Online analytical processing Major features
More informationData Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
More informationDATA WAREHOUSE E KNOWLEDGE DISCOVERY
DATA WAREHOUSE E KNOWLEDGE DISCOVERY Prof. Fabio A. Schreiber Dipartimento di Elettronica e Informazione Politecnico di Milano DATA WAREHOUSE (DW) A TECHNIQUE FOR CORRECTLY ASSEMBLING AND MANAGING DATA
More informationM2074 - Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 5 Day Course
Module 1: Introduction to Data Warehousing and OLAP Introducing Data Warehousing Defining OLAP Solutions Understanding Data Warehouse Design Understanding OLAP Models Applying OLAP Cubes At the end of
More information