CHAPTER-6 DATA WAREHOUSE
|
|
|
- Cameron Rogers
- 10 years ago
- Views:
Transcription
1 CHAPTER-6 DATA WAREHOUSE 1
2 CHAPTER-6 DATA WAREHOUSE 6.1 INTRODUCTION Data warehousing is gaining in popularity as organizations realize the benefits of being able to perform sophisticated analyses of their data. Recent years have seen the introduction of a number of datawarehousing engines, from both established database vendors as well as new players. The engines themselves are relatively easy to use and come with a good set of end-user tools. However, there is one key stumbling block to the rapid development of data warehouses, namely that of warehouse population. Specifically, problems arise in populating a warehouse with existing data since it has various types of heterogeneity. Given the lack of good tools, this task has generally been performed by various system integrators, e.g. software consulting organizations which have developed in-house tools and processes for the task. The general conclusion is that the task has proven to be labor-intensive, error-prone, and generally frustrating, leading a number of warehousing projects to be abandoned mid-way through development. However, the picture is not as grim as it appears. The problems that are being encountered in warehouse creation are very similar to those encountered in data integration, and they have been studied for about two decades. However, not all problems relevant to warehouse creation have been solved, and a number of research issues remain. The principal goal of this paper is to identify the common issues in data integration and a data-warehouse creation. Developers of warehouse creation tools to examine and, where appropriate, incorporate the techniques developed 2
3 for data integration, and researchers in both the data integration and the data warehousing communities to address the open research issues in this important area. 6.2 ENTERPRISE DATA MANAGEMENT ARCHITECTURE The Standard modern enterprise data management architecture for a large organization i.e., day-to-day, as well as strategic, i.e., long term, data management and analysis needs. It consists of operational data management systems which support existing applications by managing current data. It also consists of a corporate data warehouse and a number of data marts on which various kinds of strategic analyses are performed. In the following, we briefly describe the various elements of the architecture. Operational Data System These are traditional relational and other data systems, which are used for the day-to-day operations of the organization. They contain up-todate, and provide interactive access, both for transaction and decisionsupport systems. The database size can range from tens of megabytes to a few gigabytes, which though large in itself is quite small compared to the size of the corresponding warehouse. Since these databases are regularly updated, usually by concurrently executing applications, a critical goal fro them is to maintain transaction consistency. In addition, since many of the application are mission critical, e.g., airlines and banking, there is usually a need to mask system failures from the application. Finally, due to interactive access, response time and throughput requirements are stringent. 3
4 Warehouse Creation For a warehouse to be used effectively, it is important to pay sufficient attention to its creation. As shown in Fig. 1 this includes architecture selection, warehouse schema creation, and warehouse population. These issues are discussed in detail in subsequent sections. Corporate Data Warehouse This is the principal repository of historical information in the organization, and stores data instances for the enterprise data model. Data is entered into this repository periodically, usually in an appendedonly manner. Sometimes the data here may directly be used for analysis. However, in most cases focused sets of data are extracted into smaller data marts where they are analyzed. Since this is often the definitive data in the organization, in many instances such as warehouse has also been called the foundation data, Since data is usually entered to extracted in a batch mode, interactive response is not a significant issue. Data Marts Data marts are smaller data warehouses, usually focusing on a small subset of the enterprise data. Typically each mart is used by a particular unit of the organization for various strategic analyses relevant to its goals. Data is extracted from the corporate warehouse into the data mart periodically, and used for analysis. Interactive response is an issue in data marts as interactive analysis tools work directly on the data. 4
5 Warehouse Analysis Tools A number of tools have been built for performing strategic analysis of warehouse data. As shown in Fig. 2 these include trend analysis, data mining, simulation, forecasting, and on line analytical processing. 6.3 ISSUES IN DATA WAREHOUSE CREATION A number of issues must be addressed in building a data warehouse. Some are associated with the creation of the warehouse and other with its operation. In the following, we discuss the issues affecting warehouse creation. Warehouse Architecture Selection A number of architectural approaches to data warehousing are possible, and selection between them is affected by factors such as size, nature of use, etc. Database Conversion Shown in Fig. 2, this architectural choice involves taking all the data in the source systems and performing a conversion to the (integrated) target Fig. 21 Database conversions 5
6 System. This approach is applied when: the source system are to be retired and all the data is being moved to the target system, and both the source and target systems will continue to be in operation, and this is the initial population of the target database. The data in the course systems is mapped into a global model and then written out/exported to the target database. In this approach, bulk data conversion is being performed, and hence optimization of the processing is important. Integrating data from multiple sources can require comparison between all pairs of data items, i.e., the cost can in general be quadratic with respect to the input size. Since input data can typically be in the order of tens of millions or higher this optimization is quite critical. Fig. 3. Database Synchronization Fig. 3 shows the synchronization architecture. In this case, there is an existing warehouse, and data is extracted periodically from the operational source systems to update the target system, thus synchronizing the target system to the source system. The key differences between this architecture and the previous one are : 1. The volume of data handled in much smaller, and 6
7 2. Information being produced for the target database already has its counterpart from before, with which it may require integration. The main issue is one of using incremental algorithms to optimize the processing. A lot of recent work in materialized view maintenance is relevant to this problem. Federated Database In some cases the numbers of users of the enterprises-wide integrated data are few, and having a separate full-fledged data warehouse may be overkill. In addition, there may sometimes be a requirement for having the target database be completely up-to-date. In such cases, a federated database (FDB) may be the best architectural choice, as shown in Fig. 4 in this architecture, only the data required by a query is integrated. Hence, the optimization focuses on handling small amounts of data as well as fast interactive response time. Since query processing and data integration must be handled together, the optimizer must consider both simultaneously. Enterprise Schema Creation With any of the data warehouse architectural choices discussed above, the first step is to develop an enterprise schema. The enterprise schema is the output of enterprise data modeling, and captures the main entities about which the enterprise maintains information and relationships between them. It is invariably the case that various units of an organization have existing schemas for their respective portion of the enterprises-wide information. These are used as the starting point for the enterprise schema. Thus, an important task for enterprise it, and filling out any missing elements, In addition, data warehouses provide explicit 7
8 support for the historical dimension as well as aggregation, which must be modeled as special entities. Finally, the union of the enterprise data may give rise to new entities which must be modeled in the schema. Enterprise Data Model A data model for the enterprise must be first developed, to identify the key entities in the organization and their relationships to each other. In an ideal situation, such modeling would begin by analyzing the various processes in the organization and extracting the relevant entities and relationships from them. However, in most situations it is much more realistic to use the existing entities and relationships, modeled by the schemas of existing components data sources, as a starting point. Analysis of various organizational processes helps in refining the enterprise data model. Experience has shown that there is no perfect enterprise data model, and hence this task can never be 100 percent complete. Thus, once a reasonable model has been obtained it must be put into operation, with provision for subsequent modifications. Schema Integration In creating the enterprises data model, an all too often encountered situation is one shear multiple, independently developed database model different aspects of the same set of real-world entities. However, having been developed independently, the schemas do not match well. Structural heterogeneity is one class of discrepancies which arise, e.g., different field sizes, units, or scales. Semantic heterogeneity is another class of discrepancies, which includes the synonym and the homonym problems. An example of the former is the names EMP and EMPLOYEE being used to refer to the same class of real-world employees in two different databases. An example of the latter is the name EMP being used 8
9 to refer to real-world employees in one database, and to real world employers in another. The reason for these discrepancies is that the designers of the respective databases were targeting different sets of applications, even though for the same set of real-world entities, and hence captured different (but overlapping) models of reality in their databases. Hence, overall the schema integration problem is one of integrating multiple overlapping models of a set of real world- entities. Specific technical approaches include automatic and semiautomatic ways to determine synonyms, antonyms, IS-A relationships, etc. A large body of related work exists in the database and knowledge representation community. Constraints In addition to the structural and semantic mismatches of schema entities, there is a additional problem of constraint mismatches, which are often not evident from the definitions of entity types. For example, one database may have a constraint such as EMP. Age<= 60 while another may have the constraint on age as EMP. Age <=65. In integrating such databases there seems to be in general no right approach to resolving such constraint incompatibilities. For example, we can take the approach that the new constraint should be EMP. Age <= 65 since it is the eaker of the two, and hence all data is guaranteed to satisfy it. However, tightness of constraints in a schema is usually an indication of the quality of the data in the database, and such an approach eventually degrades the overall quality of the data to its lowest common denominator. An alternative approach is to select some tighter constraint at integration time, based on existing constraints in participating databases, and 9
10 perform the requisite data quality improvement through the use of tools and human intervention, to ensure that the integrated data has at least this data quality. Using an object model for the integrated database, another possible approach is to define two employee sets, namely one with EMP. Age <= 60 and the other with EMP. Age <=65 with an IS-A relationship between the two sets. There has been some initial work in addressing these issues. Warehouse Population One the component schemas have been integrated to develop the global schema of the warehouse, with acceptable resolution of schema mismatches, the next step is to populated the warehouse with data. Experience has shown that even though the schemas may have been integrated, there may still be problems in integrating the specific data instances. Fig. 4 Federated database. 10
11 Semantic Issues A number of semantic discrepancies arise in integrating data instances from multiple sources. A specific problem is entity identification, namely determining whether pair of records coming from two different databases represents the same or different real world entities. This becomes a difficult problem because usually there is no common key which can be used as in identifier for the union of the two sets of records. Another is the attribute value conflict resolution problem. This occurs when records from different database have been matched, i.e., entity identification has occurred, but it is found that values of the same attribute for an entity instance, coming from different data sources, are different. There can be a number of possible sources of such errors, e.g., data-entry errors, different policies of database maintenance, and modeling of slightly different concepts. As data warehouses are being built, experience with such problems is being generated, and various ad hoc solutions have been proposed. A systematic study of semantic heterogeneity issues in warehouse population has only just begun. Scalability Since warehouses store information about the database as it progresses over time, they tend to grown much more rapidly than on-line databases. It is quite common to start with an initial warehouse of size gigabytes, and subsequently have a periodic (weekly or monthly) update rate of 1-10 gigabytes. The processing described above for warehouse population must be carried out for the initial population as well as for every periodic update. The data integration tasks typically range in complexity from O (nlgn) to O (n2) for n data items. Given tens to hundreds of gigabytes of data, this can be very time consuming, and 11
12 hence there is a need for improved algorithms for data integration tasks. Furthermore, since these tasks are heavily set-oriented, data-parallel computing techniques appear promising, and should be investigated. Incremental Updates As data is added to an existing warehouse, it must be integrated with pre existing data. The success of incremental update techniques in on-line databases must be extended to this type of processing. Examples of such techniques include incremental algorithms and indices. As of now hardly any work has been done in this area. 12
Framework for Data warehouse architectural components
Framework for Data warehouse architectural components Author: Jim Wendt Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 04/08/11 Email: [email protected] Abstract:
LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES
LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES MUHAMMAD KHALEEL (0912125) SZABIST KARACHI CAMPUS Abstract. Data warehouse and online analytical processing (OLAP) both are core component for decision
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Successful Outsourcing of Data Warehouse Support
Experience the commitment viewpoint Successful Outsourcing of Data Warehouse Support Focus IT management on the big picture, improve business value and reduce the cost of data Data warehouses can help
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS
9 8 TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS Assist. Prof. Latinka Todoranova Econ Lit C 810 Information technology is a highly dynamic field of research. As part of it, business intelligence
An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
TopBraid Insight for Life Sciences
TopBraid Insight for Life Sciences In the Life Sciences industries, making critical business decisions depends on having relevant information. However, queries often have to span multiple sources of information.
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 [email protected]
MDM and Data Warehousing Complement Each Other
Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There
Information management software solutions White paper. Powerful data warehousing performance with IBM Red Brick Warehouse
Information management software solutions White paper Powerful data warehousing performance with IBM Red Brick Warehouse April 2004 Page 1 Contents 1 Data warehousing for the masses 2 Single step load
JOURNAL OF OBJECT TECHNOLOGY
JOURNAL OF OBJECT TECHNOLOGY Online at www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2008 Vol. 7, No. 8, November-December 2008 What s Your Information Agenda? Mahesh H. Dodani,
Lection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
Business Intelligence
Transforming Information into Business Intelligence Solutions Business Intelligence Client Challenges The ability to make fast, reliable decisions based on accurate and usable information is essential
TIBCO Live Datamart: Push-Based Real-Time Analytics
TIBCO Live Datamart: Push-Based Real-Time Analytics ABSTRACT TIBCO Live Datamart is a new approach to real-time analytics and data warehousing for environments where large volumes of data require a management
Proper study of Data Warehousing and Data Mining Intelligence Application in Education Domain
Journal of The International Association of Advanced Technology and Science Proper study of Data Warehousing and Data Mining Intelligence Application in Education Domain AMAN KADYAAN JITIN Abstract Data-driven
IT0457 Data Warehousing. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
IT0457 Data Warehousing G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT Outline What is data warehousing The benefit of data warehousing Differences between OLTP and data warehousing The architecture
Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract
224 Business Intelligence Journal July DATA WAREHOUSING Ofori Boateng, PhD Professor, University of Northern Virginia BMGT531 1900- SU 2011 Business Intelligence Project Jagir Singh, Greeshma, P Singh
BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining
BUSINESS INTELLIGENCE Bogdan Mohor Dumitrita 1 Abstract A Business Intelligence (BI)-driven approach can be very effective in implementing business transformation programs within an enterprise framework.
PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.
PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions A Technical Whitepaper from Sybase, Inc. Table of Contents Section I: The Need for Data Warehouse Modeling.....................................4
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Part 22. Data Warehousing
Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA
OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,
Extended RBAC Based Design and Implementation for a Secure Data Warehouse
Extended RBAC Based Design and Implementation for a Data Warehouse Dr. Bhavani Thuraisingham The University of Texas at Dallas [email protected] Srinivasan Iyer The University of Texas
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007
Business Intelligence and Service Oriented Architectures An Oracle White Paper May 2007 Note: The following is intended to outline our general product direction. It is intended for information purposes
Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya
Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data
When to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: [email protected] Abstract: Do you need an OLAP
[callout: no organization can afford to deny itself the power of business intelligence ]
Publication: Telephony Author: Douglas Hackney Headline: Applied Business Intelligence [callout: no organization can afford to deny itself the power of business intelligence ] [begin copy] 1 Business Intelligence
Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
Integrating SAP and non-sap data for comprehensive Business Intelligence
WHITE PAPER Integrating SAP and non-sap data for comprehensive Business Intelligence www.barc.de/en Business Application Research Center 2 Integrating SAP and non-sap data Authors Timm Grosser Senior Analyst
Implementing Oracle BI Applications during an ERP Upgrade
Implementing Oracle BI Applications during an ERP Upgrade Summary Jamal Syed BI Practice Lead Emerging solutions 20 N. Wacker Drive Suite 1870 Chicago, IL 60606 Emerging Solutions, a professional services
Data Warehouses & OLAP
Riadh Ben Messaoud 1. The Big Picture 2. Data Warehouse Philosophy 3. Data Warehouse Concepts 4. Warehousing Applications 5. Warehouse Schema Design 6. Business Intelligence Reporting 7. On-Line Analytical
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Investor Presentation. Second Quarter 2015
Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
Knowledge Base Data Warehouse Methodology
Knowledge Base Data Warehouse Methodology Knowledge Base's data warehousing services can help the client with all phases of understanding, designing, implementing, and maintaining a data warehouse. This
Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper
Offload Enterprise Data Warehouse (EDW) to Big Data Lake Oracle Exadata, Teradata, Netezza and SQL Server Ample White Paper EDW (Enterprise Data Warehouse) Offloads The EDW (Enterprise Data Warehouse)
Data Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
Implementing Oracle BI Applications during an ERP Upgrade
1 Implementing Oracle BI Applications during an ERP Upgrade Jamal Syed Table of Contents TABLE OF CONTENTS... 2 Executive Summary... 3 Planning an ERP Upgrade?... 4 A Need for Speed... 6 Impact of data
Proven Testing Techniques in Large Data Warehousing Projects
A P P L I C A T I O N S A WHITE PAPER SERIES A PAPER ON INDUSTRY-BEST TESTING PRACTICES TO DELIVER ZERO DEFECTS AND ENSURE REQUIREMENT- OUTPUT ALIGNMENT Proven Testing Techniques in Large Data Warehousing
DATA INTEGRATION CS561-SPRING 2012 WPI, MOHAMED ELTABAKH
DATA INTEGRATION CS561-SPRING 2012 WPI, MOHAMED ELTABAKH 1 DATA INTEGRATION Motivation Many databases and sources of data that need to be integrated to work together Almost all applications have many sources
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
DATA MINING AND WAREHOUSING CONCEPTS
CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation
Data Integration and Data Provenance. Profa. Dra. Cristina Dutra de Aguiar Ciferri [email protected]
Data Integration and Data Provenance Profa. Dra. Cristina Dutra de Aguiar Ciferri [email protected] Outline n Data Integration q Schema Integration q Instance Integration q Our Work n Data Provenance q
3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools
Paper by W. F. Cody J. T. Kreulen V. Krishna W. S. Spangler Presentation by Dylan Chi Discussion by Debojit Dhar THE INTEGRATION OF BUSINESS INTELLIGENCE AND KNOWLEDGE MANAGEMENT BUSINESS INTELLIGENCE
Analytical CRM to Operational CRM Operational CRM to Analytical CRM Applications
Closing the Loop - Using SAS to drive CRM Anton Hirschowitz, Detica Ltd Introduction Customer Insight underpins Customer Relationship Management (CRM). Without a detailed understanding of customer profiles
Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA
Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges
BI and ETL Process Management Pain Points
BI and ETL Process Management Pain Points Understanding frequently encountered data workflow processing pain points and new strategies for addressing them What You Will Learn Business Intelligence (BI)
Cray: Enabling Real-Time Discovery in Big Data
Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects
CHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1. Introduction 1.1 Data Warehouse In the 1990's as organizations of scale began to need more timely data for their business, they found that traditional information systems technology
Master Data Management. Zahra Mansoori
Master Data Management Zahra Mansoori 1 1. Preference 2 A critical question arises How do you get from a thousand points of data entry to a single view of the business? We are going to answer this question
ETL-EXTRACT, TRANSFORM & LOAD TESTING
ETL-EXTRACT, TRANSFORM & LOAD TESTING Rajesh Popli Manager (Quality), Nagarro Software Pvt. Ltd., Gurgaon, INDIA [email protected] ABSTRACT Data is most important part in any organization. Data
Data Integration and ETL Process
Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second
OLAP and Data Warehousing! Introduction!
The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still
How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
Oracle BI Applications (BI Apps) is a prebuilt business intelligence solution.
1 2 Oracle BI Applications (BI Apps) is a prebuilt business intelligence solution. BI Apps supports Oracle sources, such as Oracle E-Business Suite Applications, Oracle's Siebel Applications, Oracle's
Business Process Management In An Application Development Environment
Business Process Management In An Application Development Environment Overview Today, many core business processes are embedded within applications, such that it s no longer possible to make changes to
Security Issues for the Semantic Web
Security Issues for the Semantic Web Dr. Bhavani Thuraisingham Program Director Data and Applications Security The National Science Foundation Arlington, VA On leave from The MITRE Corporation Bedford,
A Review of Data Mining Techniques
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
Introduction to Datawarehousing
DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society
The Role of the BI Competency Center in Maximizing Organizational Performance
The Role of the BI Competency Center in Maximizing Organizational Performance Gloria J. Miller Dr. Andreas Eckert MaxMetrics GmbH October 16, 2008 Topics The Role of the BI Competency Center Responsibilites
Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success
Developing an MDM Strategy Key Components for Success WHITE PAPER Table of Contents Introduction... 2 Process Considerations... 3 Architecture Considerations... 5 Conclusion... 9 About Knowledgent... 10
14. Data Warehousing & Data Mining
14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive
A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems
Proceedings of the Postgraduate Annual Research Seminar 2005 68 A Model-based Software Architecture for XML and Metadata Integration in Warehouse Systems Abstract Wan Mohd Haffiz Mohd Nasir, Shamsul Sahibuddin
Oracle Warehouse Builder 10g
Oracle Warehouse Builder 10g Architectural White paper February 2004 Table of contents INTRODUCTION... 3 OVERVIEW... 4 THE DESIGN COMPONENT... 4 THE RUNTIME COMPONENT... 5 THE DESIGN ARCHITECTURE... 6
How to Enhance Traditional BI Architecture to Leverage Big Data
B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...
perspective Progressive Organization
perspective Progressive Organization Progressive organization Owing to rapid changes in today s digital world, the data landscape is constantly shifting and creating new complexities. Today, organizations
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS In today's scenario data warehouse plays a crucial role in order to perform important operations. Different indexing techniques has been used and analyzed using
Data Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Fluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
Data Warehousing Concepts
Data Warehousing Concepts JB Software and Consulting Inc 1333 McDermott Drive, Suite 200 Allen, TX 75013. [[[[[ DATA WAREHOUSING What is a Data Warehouse? Decision Support Systems (DSS), provides an analysis
Database Marketing simplified through Data Mining
Database Marketing simplified through Data Mining Author*: Dr. Ing. Arnfried Ossen, Head of the Data Mining/Marketing Analysis Competence Center, Private Banking Division, Deutsche Bank, Frankfurt, Germany
Vertical Data Warehouse Solutions for Financial Services
Decision Framework, M. Knox Research Note 24 July 2003 Vertical Data Warehouse Solutions for Financial Services Packaged DW financial services solutions differ in degree of and approach to verticalization,
How To Use Noetix
Using Oracle BI with Oracle E-Business Suite How to Meet Enterprise-wide Reporting Needs with OBI EE Using Oracle BI with Oracle E-Business Suite 2008-2010 Noetix Corporation Copying of this document is
HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT
HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT POINT-AND-SYNC MASTER DATA MANAGEMENT 04.2005 Hyperion s new master data management solution provides a centralized, transparent process for managing critical
Datawarehousing and Business Intelligence
Datawarehousing and Business Intelligence Vannaratana (Bee) Praruksa March 2001 Report for the course component Datawarehousing and OLAP MSc in Information Systems Development Academy of Communication
BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
Eleven Steps to Success in Data Warehousing
DATA WAREHOUSING BUILDING A DATA WAREHOUSE IS NO EASY TASK... THE RIGHT PEOPLE, METHODOLOGY, AND EXPERIENCE ARE EXTREMELY CRITICAL Eleven Steps to Success in Data Warehousing by Sanjay Raizada General
CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing
CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing Class Projects Class projects are going very well! Project presentations: 15 minutes On Wednesday
OLAP Theory-English version
OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction
Advanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
Data Management Practices for Intelligent Asset Management in a Public Water Utility
Data Management Practices for Intelligent Asset Management in a Public Water Utility Author: Rod van Buskirk, Ph.D. Introduction Concerned about potential failure of aging infrastructure, water and wastewater
Is ETL Becoming Obsolete?
Is ETL Becoming Obsolete? Why a Business-Rules-Driven E-LT Architecture is Better Sunopsis. All rights reserved. The information contained in this document does not constitute a contractual agreement with
Effecting Data Quality Improvement through Data Virtualization
Effecting Data Quality Improvement through Data Virtualization Prepared for Composite Software by: David Loshin Knowledge Integrity, Inc. June, 2010 2010 Knowledge Integrity, Inc. Page 1 Introduction The
