Morteza Zaker ( Student ID : ) [email protected]
|
|
|
- Howard Reed
- 10 years ago
- Views:
Transcription
1 Data Warehouse Design Considerations 1. Introduction Data warehouse (DW) is a database containing data from multiple operational systems that has been consolidated, integrated, aggregated and structured to support the business analysis and a decisionmaking process [Inmon, 2005, Kimball, 2002]. Data warehousing is an art on the one hand and a science on the other. Designing a DW is like building a bridge that should be designed and constructed following both civil engineering and architecture principles. The absence of either science or art behind this construction will result in the collapse on the bridge. As being a bricklayer does not necessarily mean that one can also be an architect, a mere knowledge of database technology does not entail expertise on data warehousing. Indeed, DW necessitates simultaneous application of a number of technologies designed sophisticatedly through a body of principles called architecture. This is why there is consensus among experts that managing the Data Warehouse System (DWs) is an extremely challenging task [Rainardi, 2007, Shin et al, 2006 ]. Due to the ever-increasing accessible data accumulated from the internet and other sources and owing to the exponential growth of DW enterprises, companies encounter a large body of data thereby gradually rendering it more and more vital for the companies to wisely control their DW. In such situations a sophisticated design is necessary to ensure high data integration, high performance and ease of maintenance. As such, issues related to designing of DWs with high performance are gaining importance. Thus, the contention of the present research in DW design is an art that should be based on decisions and skills of an expert. Accordingly, an advanced database design such as DW becomes much of a challenge as the applications of that database need become more sophisticated and refined. [March et al, 2007]. DWs have two significant architectures, namely, the data flow and the system architectures. Data flow in DW is a substantial architecture that can play an important role to enhance DW performance. It is a configuration of data stores along with the arrangement of how the data flow from the source systems through these data stores to the applications used by the end-users. This includes how the data are controlled, logged, and monitored as well as the mechanism to ensure the quality of the data in the data stores [Rainardi, 2007]. Data flow consists of many parts such as the data model, physical designing, ETL, data quality, metadata to name but a few. Indexing techniques as physical designing and Data Architecture as logical designing are two of the main components of data flow architecture. [Rainardi, 2007]. The data architecture is different from data flow architecture and the activity to produce data architecture is known as data modeling which encompasses how the data are arranged in each data store and how a data store is designed to reflect the business processes. In addition to data architecture, another component that can significantly improve the query and loading performance of data warehousing is referred to as index techniques. Building indexes on a database has an important impact on the query performance, especially in huge databases such as DW where the queries are usually very complex and ad hoc. [Rainardi, 2007, Inmon, 2005, Kimball, 2002, P'Neil, 1997, O'Neil et al, 1995]. Inasmuch as the characteristics of a very large historical database are considerably
2 different from those of common transaction processing systems, the proper choice of index structures will enhance the query performance in DWs. 2. Problems and Objectives In this section, data modeling and indexing technique as two efficient subcategories of data flow architecture of DW are discussed. The two components of DW designing can optimize its performance that in turn will prove extensively beneficial to DW analysts and designers equally Data Modeling Normalization is a technique first mentioned by Codd [Codd, 1972] and has been deployed by Date [Date, 1997 ] for determining the optimal logical design to simplify the relational design of an integrated database, based on the groupings of entities to make better performance in storage operations. Most implementations for business applications are based on the relational model that has its own limitations one of which includes the complexity of access paths in physical storage structures of relational database. [Dadashzadeh, 1989, 2005] What is more, there are no references to any physical storage structure in the relational model, and it makes a logical view that is fully independent of the real based and physical organization of the data. Finally, although normal forms have many advantages which are considered the rule for relational database design, they suffer from low system performance [Date, 1997, 2003, Inmon, 1987; 1989] Date [Date, 2003] supports the fact that denormalization speeds up data retrieval. Nevertheless, there is a disadvantage to denormalization in the case of systems that require potential frequent updates. Basically, updates in dw applications are not usual in that data warehouses entail fairly fewer data updates. It should not be neglected that data in a dw are retrieved during the process of most transactions [Kimball et al, 2002]. Consequently, applying denormalization strategies is best suited to a data warehouses system due to infrequent updating. There are multiple ways to construct denormalization relationships for a database, such as Pre- Joined Tables, Report Tables, Mirror Tables, Split Tables, Combined Tables, Redundant Data, Repeating Groups, Derivable Data and Hierarchies [Mullins et al,2002, Claudia et al, 2003]. Hierarchical denormalization is a structure that is supported by most relational database management systems such as Oracle, DB2 and so on. Although designing, representing and traversing hierarchies are complex as compared to the normalized relationship, the main approach to reduce the query response time is by integrating and summarizing the data [Joy Mundy, 2006, Claudia et al, 2003]. Hierarchical denormalization is particularly useful in dealing with the growth of star schemas that can be found in many dw implementations [Claudia et al, 2003 and Shin, 2006]. According to the conventional wisdom, the issue open to discussion at this point is all relational database designs should be based on a normalized logical data model [Hanus, 1993]. The advantage of
3 normalization is to organize data with well-balanced structure to optimize data accessibility, which leads to some deficiencies that decrease system performance [Date, 1997, Westland, 1992, Menninger,1995]. Furthermore, even though the idea of improving the system by reducing table joins is not new, the decision for denormalization is hardly practical because it calls for a load of administrative burden among which documenting the structure of the denormalization assessments, validating the data, and scheduling for data migration can be mentioned. On the contrary, there are some research studies which have indicated that denormalization may result in a better performance and a more flexible data structure for users. [Claudia et al, 2003, Shin, 2006]. Hence, the ventilation of the present argument between its pros and cons is of a vital importance and can minimize many ambiguities mystifying the truth. The above-mentioned theoretical issues led the present researcher to pursue the following objectives: (i) To explore the significance of enhancement in query processing performance through optimizing the dw design adhering to denormalized hierarchical technique as a logical data modeling (ii) To investigate the possibility of positive effect(s) of Bitmap Index on columns involved by denormolized implementation 2-2 Indexing Technique There are various index techniques supported by database vendors such as Bitmap index [Inmon, 2005], B-tree [Comer, 1979, Kimball et al, 2002, Strohm, 2007], Projection index [O Neil et al, 1997], Join bitmap index [O Neil,1995], Range base bitmap index [Wu et al, 1998] and so on. As we know, Bitmap index is advisable for a system that contains data that are not frequently updated by many concurrent processes [O Neil, 2007, Cand Galemmo et al, 2003, Stockinger et al, 2007]. This is mainly due to the fact that Bitmap index stores large amount of row information in each block of the index structure. In addition, since Bitmap index locking is at the block level, any insert, update, or delete activity may result in locking an entire range of values [Lewis, 2006]. On the other hand, B-tree index is recommended for systems frequently updated. The reason is they do not need re-balancing as frequently as other self-balancing search trees. Further, all leaf blocks of the tree are at the same depth [Strohm, 2007]. Thus, choosing the proper type of index structures has a big impact on the DW environment. The main reason behind this problem is that there is no definite guideline for DW analysts to choose appropriate indexing methods. According to common practice, Bitmap index is best suited for columns having low cardinality and is recommended for low cardinality data [Chaudhuri, 1997, Kimball et al, 2002, Strohm. R 2007]. Strohm [Strohm. R 2007] concludes that the advantages of using Bitmap indexes are greatest for low cardinality columns, i.e., columns which have a small number of distinct values compared to the number of rows in the table. If the number of distinct values of a column is less than 1%, the column is a candidate for a Bitmap index. This assumption may be correct to some extent based on previous algorithms
4 and based on old machine processing used by the database software and hardware respectively, but as the usage of data is exploding, this assumption may no longer be applicable. The problems discussed above led to the following objectives: (i) To compare the efficiency of Bitmap index as opposed to B-tree index on a column with high cardinality (ii) To compare the query response time in multi-dimensional queries with the time that is needed to one-dimensional queries on both Bitmap index and B-tree index (iii) To explore whether query utilizing Bitmap index executed within a range of predicates has any affinity by the cardinality conditions. 3- Summery of Chapters The present research commences with the statement of the problem that triggered it and a brief review of its underlying theories in the first chapter, introduction that is followed by a review of the related literature and works in the second chapter. It covers the pertinent research on both hierarchical denormalization and index techniques discussing the advantage of incorporating hierarchical denormalization and Bitmap index in DW design. Further, it presents a generally applicable denormalization, hierarchical denormalization process models as logical designing and indexing techniques as physical designing. The chapter ends with a discussion of drawbacks of some related models and provides commonly accepted models and techniques. Chapter 3, Experimental Design, presents an experiment plan with a logical model of hierarchical denormalization introducing an optimized model. Likewise, it puts forth an experiment plan with a physical model by standard datasets introducing appropriate indexing technique. The fourth chapter, Results and Discussion, includes a detailed justification of query transactions used in experiments (in both data modeling and indexing) preceded by the results of all experiments along related discussions. The final chapter is the Contribution and Conclusion in which the contributions of the project are discussed together with its limitations and suggestions for future research. 4- Outcomes 4-1- Indexing techniques as physical designing The performance measurement experiments can be presented in three main parts as follows: (i) The index file size (ii) Index construction time (iii) Query retrieval time Index file size and index construction time The time taken to construct B-tree and Bitmap indexes is shown in Table 1. As it can be observed the Bitmap requires slightly more time to build high-cardinality columns (Product table) as compared with low-cardinality (Sales table) on the same columns. B-tree, on the other hand, requires considerably more time to build all indexes regardless of the column s cardinalities.
5 Table 1: Index files size and index construction time Sales Order Product Size(MB) Time(S) Size(MB) Time(S) Size(MB) Time(S) ID_Bit Id_Bt Name_Bit Name_Bt Actice_Bit Active_Bt Table 1 summarizes the different index sizes over various kinds of data cardinality. In Figure 1, we consider only the size of the two columns on Bitmap and B-tree indexes. For high-cardinality cases, Bitmap generates a large number of small bitmap objects and it takes a considerable time to allocate memory of these bitmaps. Since the index file size of Bitmap index depends on the cardinality of the column, ultimately, the index size on the columns will be smaller than a B-tree even for full cardinality (100% distinct values) on the same column. Table 1 and Figure 1 show that to build index on a large column which is involved by B-tree is prohibitively expensive in terms of space and creation time. 30 Bit map Index file size( GB ) B- Tree 0 Low Cardinality Normal Cardinality High Cardinality Figure 1: Index file size of bitmap with various cardinality Query processing time In this section, we evaluated the time required to answer the queries. These timing measurements directly reflect the performance of indexing methods. A summary of all the timing measurements on several kinds of queries which will be shown in the presentation slides are depicted in Table 2. Sales Order Product Tables (Low Cardinality) (Normal Cardinality) (High Cardinality) Bitmap B-tree Bitmap B-tree Bitmap B-tree Query1A Query1B Query2A Query2B Query3A Query3B Query4A Query4B Query5C
6 Query5A,B Query Table2: Query response time per seconds Figure 2 shows the query elapse time for the Product table (table with high cardinality). As it is indicated in this figure, Bitmap index is much faster than B-Tree index. Thus, it can be claimed that Bitmap index is suitable for all levels of column cardinality as shown in Figure 3 where the query elapse time is about constant for each query type Elapsed time ( ms ) Quer y1a Quer y1b Que ry2a Que ry2b Query4A Query4B Que ry5c (S) Bitmap (S) B-tre e Figure 2: Query elapse time for Bitmap and B-Tree index on high cardinality 140 E lapsed tim e ( ms ) Query1A Query1B Query2A Query2B Query4A Query5C Low Cardinality Normal Cardinality High Cardinality Figure 3: Query elapse time for Bitmap index on various level of column cardinal 4-2- Hierarchical Denormalization (Data Modeling) In order to compare efficiency of denormalization and normalization processes and analysis the performance of these data modeling, we build a series of queries on some columns for evaluation. In our dataset, there are 4 tables; Fact, D1, D2 and D1D2. Fact, D1 and D2 tables have approximately 1.7 billion of records and D1D2 table (a combination of D1 and D2 tables) has approximately 3.36 billion records. These records were randomly generated using PL/SQL Block by Oracle11G tools. These tables can be categorized into two database schemas, Schema 1 and Schema 2 which are portrayed in Figure 2 and Figure 3 respectively. In Schema 1, the tables were applied by normalization modeling where D1 table is
7 connected to the D2 table by one-to-many relationship and similarly, D2 table is also connected to the Fact table by one-to-many relationship. In Schema 2, D1D2 table is directly connected to the Fact table by one to many relationships. The D1D2 table is implemented by hierarchical technique. All attributes, except the keys(pk), of the dimensions are associated by Bitmap index; Schema 1 are contained by 5 indexed columns while Schema 2 are contained by 3 indexed attributes. Fact table Fact D2-Id (fk) F1 n umeric (8) F2 n umeric(8) Dimension D2 D2-Id numeric(8 ) (pk) D1-Id (fk) D2-Name varch ar (8) D2-agg int eger Dimension D1 D1-Id numeric (8) (pk) D1-Name varchar (8) F1 F2 D 2-Id D2-Id D1-Id D2-Name D2-agg 1 1 Region Region Region Region D1-Id D2-Name 1 Division 1 2 Division 2 3 Division Figure 4: Schema1 with normalized design Schema1 includes a DW system with fact data which is chained with huge amount of data stored in D2 and D1 dimensions (shown in Figure 2) while Schema2 contains the fact table and one dimension table implemented by hierarchy technique (shown in Figure 5). Fact D1D2 D1D2-Id (fk) F1 numeric(8) F2 numeric(8) D1D2-Id num eric(8) (pk) D1D2-Name varchar (8) D1-Parent-Id num eric(8) D1D2-Agg integer D1D2-Id D1D2-Name D1-Parent-Id D1D2-agg Division 1 Region 1 F1 F2 D1D2-Id 1 Division Divisio n 2 0 Division 2 Region Region Division 3 0 Region Region Division Region Region Region Figure 5: Second schema with denormalized design In order to evaluate the time required to respond to different query types including range, aggregation and join queries; we will briefly describe all of our Selected six SQL queries with 70 stored procedures during the presentation meeting. Basically, for each query, we use suffix A to represent query on Schema1 and suffix B to represent query on Schema 2.
8 We present the performance measurement experiments in three main parts as follows: (i) Hierachical denormalization effects on one-dimensional modeling (ii) Hierachical denormalization effects on multi-dimensional modeling (iii) Bitmap indexing effects on the hierarchical denormalized modeling One Dimensional Figure 6 shows the query elapse time for one-dimensional queries which were applied on first and second schemas. This figure shows that although the query retrieval time on the first schema which has been designed by normalization method is faster than denormalized schema; query performance can be enormously enhanced by using index techniques especially Bitmap index technique. Figure 6: Query elapse times for one dimension queries Multi Dimensional Figure 7 shows the query elapse time for Multidimensional hierarchical Queries which have been applied on first and second schemas. This diagram shows that using hierarchical denormalization method can improve system response time when the queries are unanticipated ad hoc queries 5- Achieved contributions Figure 7: Query elapse times for multi dimension queries which are involved by join operations
9 This research presented a practical view of indexing technique, normalization, denormalization and proposed hierarchical denormalization with fundamental guidelines to be used in DW design. It clearly portrayed the conventional academic idea of applying bitmap index for low cardinality datasets cannot be considered the best. All identified guidelines need to be given appropriate concentration at the time of initial design. The outcomes of our experiments provided quite convincing perspectives to practitioners for some reasons: Firstly, the used database and datasets reflected the functionality and multiple aspects of the hierarchical denormalization prototype of a data warehouse. Secondly, several kinds of query instance and data populating the database were chosen to have wide both academic and industry relevance. Finally, in indexing techniques, the performance metrics also presented valuable information regarding the performance enhancement with DWs, which would be most interesting for those who work in a professional area. Two significant categories of contributions can be concluded from this research: References: 1. Physical designing: 1-1- Bitmap index is the conclusive choice for a DW designing no matter for columns with high or low cardinality The widespread opinion on using Bitmap index and B-tree index in DW should be 2. Data Modeling: changed by giving the preference to Bitmap index in most DW designs Hierarchical denormalization presents positive effects on DW performance Using hierarchical denormalization reduces the number of the entities present in Snowflake Schema. Such a reduction will result in a lower relations and joins among the entities that can be a main way to enhance DWs performance. Chaudhuri. S, Dayal,An. U Overview of Data Warehousing and OLAP Technology, ACM SIGMOD RECORD Inmon. W, Building the Data Warehouse, John Wiley Sons, fourth edition, 2005 Kimball. R, Reeves. L, Ross. M, The Data Warehouse Toolkit, John Wiley Sons, NEW YORK, 2nd edition, 2002 Rainardi. V, Building a Data Warehouse, Published by Apress, 2007 March. S.T, Hevner. A. R. Integrated decision support systems: A data warehousing perspective. Decis. Support Syst. 43, 3 (Apr. 2007), DOI= Dadashzadeh. M, An improved division operator for relational algebra. Inf. Syst. 14(5): Dadashzadeh. M, Set Comparison in Relational Query Languages. Encyclopedia of Database Technologies and Applications 2005: Mullins. C. S, Database Administration: The Complete Guide to Practices and Procedures Addison-Wesley, Paperback, Published June 2002, 736 pages, ISBN
10 O Neil. P, Quass. D, Improved query performance with variant indexes, In SIGMOD: Proceedings of the 1997 ACM SIGMOD international conference on Management of data.1997 O Neil. P and Graefe. G, Multi-table joins through bitmapped join indices, ACM SIGMOD Record 24 number 3, Sep 1995, pp O Neil. E, O Neil. P, Bitmap index design choices and their performance implications, Database Engineering and Applications Symposium. IDEAS th International, pp Wu. K, Yu. P Range-based bitmap indexing for high cardinality attributes with skew, In COMPSAC 98: Proceedings of the 22nd International Computer Software and Applications Conference. IEEE Computer Society, Washington, DC, USA, 1998, pp Imho. C, Galemmo. N, Geiger. J, Mastering Data Warehouse Design : Relational and Dimensional Techniques. John Wiley and Sons, NEW YORK.2003 Lewis. J,Oracle index management secrets, BMC Software ( 2006, pp Comer. D,b-tree, ACM Comput. Surv. 11, 2, 1979, pp Stockinger. K, Wu. K, Bitmap indices for data warehouses, In Data Warehouses and OLAP,IRM Press,2007, Chapter 7. R.Strohm,Oracle Database Concepts 11g.,Oracle, Redwood City,CA 94065, 2007 Kimball. R, Reeves. L, M.Ross,The Data Warehouse Toolkit. John Wiley and Sons, NEW YORK, 2002 Inmon. W. H, Building the Data Warehouse. John Wiley and Sons, 2005 Shin. S. K, Sanders. G. L, Denormalization strategies for data retrieval from data warehouses. Decis. Support Syst. Oct. 2006, pp DOI= Date. C.j, An Introduction to Database Systems, Addison-Wesley Longman Publishing Co., Inc, 2003 Joy. W. T, Mundy, The Microsoft DataWarehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence Toolset. John Wiley and Sons, NEW YORK, Claudia. I, Galemmo. N Mastering Data Warehouse Design -Relational And Dimensional. John Wiley and Sons, 2003, ISBN: Hanus. M, To normalize or denormalize, that is the. question. In In Proceedings of Computer Measurement Group s 1993 International Conference, pp Date. C.J,.The normal is so...interesting. Database Programming and Design. 1997, pp Strohm. R, Oracle Database Concepts 11g. Oracle, Redwood City,CA Westland. J. C, Economic incentives for database normalization. Inf. Process. Manage. Jan. 1992, pp DOI= W Menninger. D, Breaking all the rules: an insider s guide to practical normalization, Data Based Advis. (Jan. 1995), pp
Optimizing the Data Warehouse Design by Hierarchical Denormalizing
Optimizing the Data Warehouse Design by Hierarchical Denormalizing Morteza Zaker, Somnuk Phon-Amnuaisuk, Su-Cheng Haw Faculty of Information Technology, Multimedia University, Malaysia [email protected],
INTERNATIONAL JOURNAL OF COMPUTERS AND COMMUNICATIONS Issue 2, Volume 2, 2008
1 An Adequate Design for Large Data Warehouse Systems: Bitmap index versus B-tree index Morteza Zaker, Somnuk Phon-Amnuaisuk, Su-Cheng Haw Faculty of Information Technologhy Multimedia University, Malaysia
Indexing Techniques for Data Warehouses Queries. Abstract
Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 [email protected] [email protected] Abstract Recently,
Unlock your data for fast insights: dimensionless modeling with in-memory column store. By Vadim Orlov
Unlock your data for fast insights: dimensionless modeling with in-memory column store By Vadim Orlov I. DIMENSIONAL MODEL Dimensional modeling (also known as star or snowflake schema) was pioneered by
BUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
Evaluation of Bitmap Index Compression using Data Pump in Oracle Database
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 3, Ver. III (May-Jun. 2014), PP 43-48 Evaluation of Bitmap Index Compression using Data Pump in Oracle
Data Warehouse Snowflake Design and Performance Considerations in Business Analytics
Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker
Data Warehousing Concepts
Data Warehousing Concepts JB Software and Consulting Inc 1333 McDermott Drive, Suite 200 Allen, TX 75013. [[[[[ DATA WAREHOUSING What is a Data Warehouse? Decision Support Systems (DSS), provides an analysis
Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices
Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil
When to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: [email protected] Abstract: Do you need an OLAP
Data Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong [email protected] Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
A Brief Tutorial on Database Queries, Data Mining, and OLAP
A Brief Tutorial on Database Queries, Data Mining, and OLAP Lutz Hamel Department of Computer Science and Statistics University of Rhode Island Tyler Hall Kingston, RI 02881 Tel: (401) 480-9499 Fax: (401)
ETL-EXTRACT, TRANSFORM & LOAD TESTING
ETL-EXTRACT, TRANSFORM & LOAD TESTING Rajesh Popli Manager (Quality), Nagarro Software Pvt. Ltd., Gurgaon, INDIA [email protected] ABSTRACT Data is most important part in any organization. Data
Microsoft Data Warehouse in Depth
Microsoft Data Warehouse in Depth 1 P a g e Duration What s new Why attend Who should attend Course format and prerequisites 4 days The course materials have been refreshed to align with the second edition
Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach
2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,
www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28
Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT
A Critical Review of Data Warehouse
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin
Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC 10.1.3.4.1
Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC 10.1.3.4.1 Mark Rittman, Director, Rittman Mead Consulting for Collaborate 09, Florida, USA,
Using the column oriented NoSQL model for implementing big data warehouses
Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'15 469 Using the column oriented NoSQL model for implementing big data warehouses Khaled. Dehdouh 1, Fadila. Bentayeb 1, Omar. Boussaid 1, and Nadia
Paper 3510-2015 SAS Visual Analytics: Emerging Trend in Institutional Research Sivakumar Jaganathan, Thulasi Kumar University of Connecticut
Paper 3510-2015 SAS Visual Analytics: Emerging Trend in Institutional Research Sivakumar Jaganathan, Thulasi Kumar University of Connecticut ABSTRACT Institutional research and effectiveness offices at
PartJoin: An Efficient Storage and Query Execution for Data Warehouses
PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE [email protected] 2
LEARNING SOLUTIONS website milner.com/learning email [email protected] phone 800 875 5042
Course 20467A: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Length: 5 Days Published: December 21, 2012 Language(s): English Audience(s): IT Professionals Overview Level: 300
CS2032 Data warehousing and Data Mining Unit II Page 1
UNIT II BUSINESS ANALYSIS Reporting Query tools and Applications The data warehouse is accessed using an end-user query and reporting tool from Business Objects. Business Objects provides several tools
Speeding ETL Processing in Data Warehouses White Paper
Speeding ETL Processing in Data Warehouses White Paper 020607dmxwpADM High-Performance Aggregations and Joins for Faster Data Warehouse Processing Data Processing Challenges... 1 Joins and Aggregates are
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
Fluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
Data Testing on Business Intelligence & Data Warehouse Projects
Data Testing on Business Intelligence & Data Warehouse Projects Karen N. Johnson 1 Construct of a Data Warehouse A brief look at core components of a warehouse. From the left, these three boxes represent
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What
Columnstore Indexes for Fast Data Warehouse Query Processing in SQL Server 11.0
SQL Server Technical Article Columnstore Indexes for Fast Data Warehouse Query Processing in SQL Server 11.0 Writer: Eric N. Hanson Technical Reviewer: Susan Price Published: November 2010 Applies to:
COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER
Page 1 of 8 ABOUT THIS COURSE This 5 day course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL Server
A Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing
A Framework for Developing the Web-based Integration Tool for Web-Oriented Warehousing PATRAVADEE VONGSUMEDH School of Science and Technology Bangkok University Rama IV road, Klong-Toey, BKK, 10110, THAILAND
Implementing a Data Warehouse with Microsoft SQL Server 2012
Implementing a Data Warehouse with Microsoft SQL Server 2012 Module 1: Introduction to Data Warehousing Describe data warehouse concepts and architecture considerations Considerations for a Data Warehouse
Implementing a Data Warehouse with Microsoft SQL Server
Page 1 of 7 Overview This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL 2014, implement ETL
Enterprise Performance Tuning: Best Practices with SQL Server 2008 Analysis Services. By Ajay Goyal Consultant Scalability Experts, Inc.
Enterprise Performance Tuning: Best Practices with SQL Server 2008 Analysis Services By Ajay Goyal Consultant Scalability Experts, Inc. June 2009 Recommendations presented in this document should be thoroughly
Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998
1 of 9 5/24/02 3:47 PM Dimensional Modeling and E-R Modeling In The Data Warehouse By Joseph M. Firestone, Ph.D. White Paper No. Eight June 22, 1998 Introduction Dimensional Modeling (DM) is a favorite
Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning
Course Outline: Course: Implementing a Data with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning Duration: 5.00 Day(s)/ 40 hrs Overview: This 5-day instructor-led course describes
Designing a Dimensional Model
Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and
Data Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777
Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777 Course Outline Module 1: Introduction to Data Warehousing This module provides an introduction to the key components of a data warehousing
Lection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
Database Design Patterns. Winter 2006-2007 Lecture 24
Database Design Patterns Winter 2006-2007 Lecture 24 Trees and Hierarchies Many schemas need to represent trees or hierarchies of some sort Common way of representing trees: An adjacency list model Each
Business Intelligence in E-Learning
Business Intelligence in E-Learning (Case Study of Iran University of Science and Technology) Mohammad Hassan Falakmasir 1, Jafar Habibi 2, Shahrouz Moaven 1, Hassan Abolhassani 2 Department of Computer
CHAPTER 5: BUSINESS ANALYTICS
Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Deriving Business Intelligence from Unstructured Data
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 9 (2013), pp. 971-976 International Research Publications House http://www. irphouse.com /ijict.htm Deriving
Whitepaper. Innovations in Business Intelligence Database Technology. www.sisense.com
Whitepaper Innovations in Business Intelligence Database Technology The State of Database Technology in 2015 Database technology has seen rapid developments in the past two decades. Online Analytical Processing
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
Implementing a Data Warehouse with Microsoft SQL Server
This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with SQL Server Integration Services, and
Oracle Business Intelligence Foundation Suite 11g Essentials Exam Study Guide
Oracle Business Intelligence Foundation Suite 11g Essentials Exam Study Guide Joshua Jeyasingh Senior Technical Account Manager WW A&C Partner Enablement Objective & Audience Objective Help you prepare
Sales and Operations Planning in Company Supply Chain Based on Heuristics and Data Warehousing Technology
Sales and Operations Planning in Company Supply Chain Based on Heuristics and Data Warehousing Technology Jun-Zhong Wang 1 and Ping-Yu Hsu 2 1 Department of Business Administration, National Central University,
Data Integration and ETL Process
Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second
Implementing a Data Warehouse with Microsoft SQL Server
Course Code: M20463 Vendor: Microsoft Course Overview Duration: 5 RRP: 2,025 Implementing a Data Warehouse with Microsoft SQL Server Overview This course describes how to implement a data warehouse platform
CHAPTER 4: BUSINESS ANALYTICS
Chapter 4: Business Analytics CHAPTER 4: BUSINESS ANALYTICS Objectives Introduction The objectives are: Describe Business Analytics Explain the terminology associated with Business Analytics Describe the
An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days
Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days Course
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Description: This five-day instructor-led course teaches students how to design and implement a BI infrastructure. The
University Data Warehouse Design Issues: A Case Study
Session 2358 University Data Warehouse Design Issues: A Case Study Melissa C. Lin Chief Information Office, University of Florida Abstract A discussion of the design and modeling issues associated with
Establish and maintain Center of Excellence (CoE) around Data Architecture
Senior BI Data Architect - Bensenville, IL The Company s Information Management Team is comprised of highly technical resources with diverse backgrounds in data warehouse development & support, business
Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.
Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc. Introduction Abstract warehousing has been around for over a decade. Therefore, when you read the articles
BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
www.dotnetsparkles.wordpress.com
Database Design Considerations Designing a database requires an understanding of both the business functions you want to model and the database concepts and features used to represent those business functions.
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
Turkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance
Brochure More information from http://www.researchandmarkets.com/reports/2248199/ Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance Description: - This is the first book to provide
BW-EML SAP Standard Application Benchmark
BW-EML SAP Standard Application Benchmark Heiko Gerwens and Tobias Kutning (&) SAP SE, Walldorf, Germany [email protected] Abstract. The focus of this presentation is on the latest addition to the
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS In today's scenario data warehouse plays a crucial role in order to perform important operations. Different indexing techniques has been used and analyzed using
Implementing a Data Warehouse with Microsoft SQL Server MOC 20463
Implementing a Data Warehouse with Microsoft SQL Server MOC 20463 Course Outline Module 1: Introduction to Data Warehousing This module provides an introduction to the key components of a data warehousing
COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER
COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER MODULE 1: INTRODUCTION TO DATA WAREHOUSING This module provides an introduction to the key components of a data warehousing
SQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus
MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus I. Contact Information Professor: Joseph Morabito, Ph.D. Office: Babbio 419 Office Hours: By Appt. Phone: 201-216-5304 Email: [email protected]
A Case Study in Integrated Quality Assurance for Performance Management Systems
A Case Study in Integrated Quality Assurance for Performance Management Systems Liam Peyton, Bo Zhan, Bernard Stepien School of Information Technology and Engineering, University of Ottawa, 800 King Edward
The Benefits of Data Modeling in Data Warehousing
WHITE PAPER: THE BENEFITS OF DATA MODELING IN DATA WAREHOUSING The Benefits of Data Modeling in Data Warehousing NOVEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2 SECTION 2
DATABASE MANAGEMENT SYSTEM
REVIEW ARTICLE DATABASE MANAGEMENT SYSTEM Sweta Singh Assistant Professor, Faculty of Management Studies, BHU, Varanasi, India E-mail: [email protected] ABSTRACT Today, more than at any previous
Methodology Framework for Analysis and Design of Business Intelligence Systems
Applied Mathematical Sciences, Vol. 7, 2013, no. 31, 1523-1528 HIKARI Ltd, www.m-hikari.com Methodology Framework for Analysis and Design of Business Intelligence Systems Martin Závodný Department of Information
CHAPTER 6 DATABASE MANAGEMENT SYSTEMS. Learning Objectives
CHAPTER 6 DATABASE MANAGEMENT SYSTEMS Management Information Systems, 10 th edition, By Raymond McLeod, Jr. and George P. Schell 2007, Prentice Hall, Inc. 1 Learning Objectives Understand the hierarchy
High-Volume Data Warehousing in Centerprise. Product Datasheet
High-Volume Data Warehousing in Centerprise Product Datasheet Table of Contents Overview 3 Data Complexity 3 Data Quality 3 Speed and Scalability 3 Centerprise Data Warehouse Features 4 ETL in a Unified
Extended RBAC Based Design and Implementation for a Secure Data Warehouse
Extended RBAC Based Design and Implementation for a Data Warehouse Dr. Bhavani Thuraisingham The University of Texas at Dallas [email protected] Srinivasan Iyer The University of Texas
Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)
Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463) Course Description Data warehousing is a solution organizations use to centralize business data for reporting and analysis. This five-day
Presented by: Jose Chinchilla, MCITP
Presented by: Jose Chinchilla, MCITP Jose Chinchilla MCITP: Database Administrator, SQL Server 2008 MCITP: Business Intelligence SQL Server 2008 Customers & Partners Current Positions: President, Agile
Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
Data Warehouses & OLAP
Riadh Ben Messaoud 1. The Big Picture 2. Data Warehouse Philosophy 3. Data Warehouse Concepts 4. Warehousing Applications 5. Warehouse Schema Design 6. Business Intelligence Reporting 7. On-Line Analytical
MS SQL Performance (Tuning) Best Practices:
MS SQL Performance (Tuning) Best Practices: 1. Don t share the SQL server hardware with other services If other workloads are running on the same server where SQL Server is running, memory and other hardware
SAS BI Course Content; Introduction to DWH / BI Concepts
SAS BI Course Content; Introduction to DWH / BI Concepts SAS Web Report Studio 4.2 SAS EG 4.2 SAS Information Delivery Portal 4.2 SAS Data Integration Studio 4.2 SAS BI Dashboard 4.2 SAS Management Console
Bitmap Index an Efficient Approach to Improve Performance of Data Warehouse Queries
Bitmap Index an Efficient Approach to Improve Performance of Data Warehouse Queries Kale Sarika Prakash 1, P. M. Joe Prathap 2 1 Research Scholar, Department of Computer Science and Engineering, St. Peters
Performance Enhancement Techniques of Data Warehouse
Performance Enhancement Techniques of Data Warehouse Mahesh Kokate VJTI-Mumbai, India [email protected] Shrinivas Karwa VJTI, Mumbai- India [email protected] Saurabh Suman VJTI-Mumbai, India
