OLAP, Knowledge Discovery from Database, Social Security Fund, Oracle Warehouse Builder, Oracle Discoverer.
|
|
|
- Shon Blankenship
- 10 years ago
- Views:
Transcription
1 ABSTRACT Mohamed Salah GOUIDER 1, Amine FARHAT 2 BESTMOD Laboratory Institut Supérieur de Gestion 41, rue de la liberté, cite Bouchoucha Bardo, 2000, Tunis, TUNISIA [email protected] 1, [email protected] 2 The amounts of data available to decision makers are increasingly important, given the network availability, low cost storage and diversity of applications. To maximize the potential of these data within the National Social Security Fund (NSSF) in Tunisia, we have built a data warehouse as a multidimensional database, cleaned, homogenized, historicized and consolidated. We used Oracle Warehouse Builder to extract, transform and load the source data into the Data Warehouse, by applying the KDD process. We have implemented the Data Warehouse as an Oracle OLAP. The knowledge extraction has been performed using the Oracle Discoverer tool. This allowed users to take maximum advantage of knowledge as a regular report or as ad hoc queries. We started by implementing the main topic for this public institution, accounting for the movements of insured persons. The great success that has followed the completion of this work has encouraged the NSSF to complete the achievement of other topics of interest within the NSSF. We suggest in the near future to use Multidimensional Data Mining to extract hidden knowledge and that are not predictable by the OLAP. KEYWORDS OLAP, Knowledge Discovery from Database, Social Security Fund, Oracle Warehouse Builder, Oracle Discoverer. 1. INTRODUCTION Decision support systems have a great support to users for making decisions and solving complex problems. They have done this last time, a great success in the professional world. Knowledge Discovery in Database (KDD) is a nontrivial process of analyzing data to extract patterns or knowledge valid, novel and useful to decision-making [3]. KDD is motivated by the massive use of computer systems. The huge volumes of data stored in these systems are improving exponentially, which implies the use of Data Warehousing. A Data Warehouse is defined as a subject-oriented, historical, integrated and non-volatile collection of data, intended for decision support [5]. The Data Warehouse imports data from several operational systems, which are usually heterogeneous. A selection step (the data concerning the subject of the Data Warehouse) and pre-processing of data through ETL tools [7] (Extract, Transform and Load) is needed, this is the integration step. The exploitation and exploration of the data warehouse uses multiple highly sophisticated tools, primarily: OLAP Systems (On-Line Analytical Processing) [7], and Data Mining [3]. We used all the techniques and technologies mentioned above to build a Data Warehouse for the National Social Security Fund NSSF- in Tunisia. Social security in Tunisia is entrusted to two agencies: the National Pension and Providence Fund ( and National Social Security Fund NSSF- ( The /ijdms
2 first deals with the insured public sector and the second deals with the private sector. NSSF manages hundreds of thousands of insured persons and their dependents, according to several schemes and application of laws and regulations. The management of insured uses several computer systems: at the central and regional sites. The complexity and fragmentation of operations on heterogeneous computing systems, physically and logically, make it very difficult and sometimes impossible to extract information and knowledge, and achievement tests for purposes of decision support. To resolve this problem, we designed and implemented a data warehouse that includes all movements of insured accounts. This data warehouse is the core of a system for decision support; it is designed as a multidimensional model based on a factual basis and several dimensions, using the star model. The development of tools for manipulating the warehouse has been done mainly on: the tools of manipulation of multidimensional OLAP structures, tools for extracting reports, charts and analytical formulation of ad-hoc queries (unplanned) adapted to the needs of decision makers. This article is organized as follows. Section 2 is devoted to the specification of the project and its objectives. Section 3 concerns the design and implementation of data warehouse. Section 4 presents the results of our project. Conclusion and perspectives of this work are discussed in section PROJECT SPECIFICATION An insured is defined as an individual affiliated with a social security scheme. It pays its dues and subsidies and has several benefits. The movements of accounting insured designate its related financial transactions. The NSSF manages a large number of insured persons, exceeding three million. The management of insured persons and their movements accounting, is distributed over several heterogeneous systems: mainframe, n-tier architecture and relational databases. This complicates the analysis for purposes of decision support and especially when confronting the problem of data integrity and consolidation. Taking into account this problem and before starting the construction of the warehouse, the following targets were set, in collaboration with users and business experts: Improved monitoring of monetary transactions covered by social insurance. Better knowledge of movement provided by several dimensions: Plan, Nature of Benefits, Time, Location. Optimal management of expenditure and revenue. Homogenization of heterogeneous data sources: Oracle Database, Excel files, flat files, etc. The dynamic analysis of the causes of good functioning and dysfunctions based on different dimensions (the Plan, the Regional Office, Nature of service, period) and crosses between dimensions. The dynamic search of correlations between different independent criteria using analytical tools. Planned and instantaneous diffusion of information necessary to all decision makers: Scoreboards and Scoring. The generation of more detailed summary statements of these movements. 3. BUILDING THE DATA WAREHOUSE At this section, the various stages of design and construction of data warehouse are presented. The study of information system of the NSSF has allowed us to locate the sources of data. 103
3 3.1 Data Sources. The following data sources were identified: The CICS system: the mainframe, IBM, which manages the financial operations of the NSSF. This system is distributed over the central site and all regional and local subsidiaries. Almost forty COBOL files were selected for data extraction. Each file has a size of approximately 100MB. Earlier versions of files have been used to recover data from previous years. It should be noted that the administration and maintenance of CICS system and its files is a very heavy and complex task, involving several old technologies. The Insured Persons Management System: based on a client / server architecture, using Oracle 10g DBMS [8]. The database has almost 6 million rows or records with a size of 120 GB, implemented on a powerful server, SUN. The relational structure of the database has greatly aided and facilitated the task of extracting and manipulating data. Several Oracle databases and access files that contain other data necessary for data warehouse. Excel files from local and regional structures. These files have typically large sizes of up to 50 MB. These files contain data that are not processed through computer applications (CICS System and Oracle DBMS). The content of these files is extremely important within the NSSF. A set of about 100 Excel files has been processed The Design Model of the Data Warehouse After locating the sources of data, we proceeded to design the Data Warehouse; the star model [11] was adopted, based on a fact table and multiple dimensions. This model is presented in Figure 1 below. Figure 1. Data Warehouse Design: The star schema 104
4 The model shown in Figure 2 is the result of the study of the decision objectives, and the existing operational systems that are used to load the warehouse. The different movements of insured accounts are the basic facts; the above dimensions represent different aspects of an element of the basic fact table: Social Insured Dimension: This dimension defines the data necessary and available for an insured. It allows seeing the movements of insured persons in relation to their identities and marital status. Regional Office Dimension: This dimension can analyze the movements performed by the regional office or by governorate. It uses two hierarchical levels: Regional and Governorate. Time Dimension: This dimension helps to visualize the movement provided by the time axis is divided into four significant levels in the business logic of the NSSF: Day, Month, Quarter and Year. Payment Dimension: This dimension can classify movements by mode of payment. Social Scheme (Regime) Dimension: This dimension brings the types of social schemes offered by the NSSF. Benefit (Prestation) Dimension: This dimension presents the different types of benefits of the NSSF. This multidimensional approach is used to achieve effective decision analysis. To implement the data warehouse, we used Oracle Warehouse Builder 10g [10]. 3.3 The ETL Process An ETL (Extract, Transform and Load) has been implemented for loading the data warehouse from various heterogeneous data sources. Oracle Warehouse Builder [10] was used to process data from Oracle databases and Access databases. Regarding the accounting transactions of the central system, we developed an application in Visual Basic, for extracting, transforming and loading data to a relational database initially and then to the multidimensional structures, a sample of this code source, designed to load the data of the regional office dimension is presented below: Private Sub Charger_Bureau_Regional() Dim MaComm As ADODB.Command Dim cnx As New ADODB.Connection Set cnx = New ADODB.Connection 'Définition de la chaîne de connexion cnx.connectionstring = "Provider=MSDASQL.1;Password=*******;Persist Security Info=True;User ID=assurance;Data Source=assurancedns;Mode=ReadWrite" 'MSDASQL 'Ouverture de la base de données cnx.open If cnx.state = adstateopen Then MsgBox "connection etablie" End If Set MaComm = New ADODB.Command Set MaComm.ActiveConnection = cnx Set DataGrid1.DataSource = Adodc3 DataGrid1.Refresh NBENRG.Caption = Adodc3.Recordset.RecordCount 105
5 dater1 = 0 matr1 = 0 CCR1 = 0 For j = 1 To Adodc3.Recordset.RecordCount + 1 wcode_br = Adodc3.Recordset.Fields("code_br") wnom = Adodc3.Recordset.Fields("nom_br") wcode_p = Adodc3.Recordset.Fields("code_postal") wgov = Adodc3.Recordset.Fields("Gouvernorat") MaComm.CommandText = " insert into bureau_regional VALUES ('" & wcode_br & "','" & wnom & "','" & wcode_p & "','" & wgov & "');" MaComm.Execute g = g + 1 Adodc3.Recordset.MoveNext Next j NBENRG1.Caption = Adodc3.Recordset.RecordCount End Sub The transformation represents an important step in preprocessing and adaptation of data sources to the model of the warehouse. We performed the correction of incorrect values and replacement of missing values by using statistical techniques: Bining [12], Medium [9], and Regression [6]. A particular interest has been allocated to the resolution of conflicts and consolidation of heterogeneous data. Decision support and analytical orientations have been given to a number of attributes and their values (Adding attributes, calculation of aggregates, Standardization...). We returned to previous backups of production systems to retrieve data of previous years. Thus, Data Warehouse built spread over a long period which provides more powerful analytical tools. Indeed, the data warehouse built is very large, with a size of 200 GB The mechanisms of access and retrieval of knowledge After building the data warehouse, we created a set of mechanisms and tools for access and exploration of knowledge. Creating an OLAP cube [1, 2, 4] (On Line Analytical Processing) is the first step. Recall that the OLAP cube is a multidimensional logical structure for exploration and navigation in data with OLAP tools and enforcement of various analytical operations with very high information value [7]: ROLL UP, ROTATE, SLICE. The creation of the cube induces the creation of several intermediate calculations supposed to improve the response time thereafter. Figure 2 shows the topology of the cube that we created as generated by Oracle Warehouse Builder. 106
6 Figure 2. Cube Topology The huge volume of data and the analytical queries affects negatively system performance in terms of response time. To overcome this problem, several materialized views [1] have been created based on the needs and on the tests performed with users and business experts. The principle is the pre-calculation of query results may be asked frequently, these results are stored in dedicated structures to allow quick access. We programmed the reconstruction of those views to each update of the data warehouse. For example, we expose the materialized view MvtRegPresBr that stores the calculated amounts of balances of monetary transactions by service, by regional office (We have 41 regional and local offices) and the Scheme dimension. The creation of materialized views implies the achievement of all intermediate computings i.e., including combinations of all variables instantiations, may be performed into users queries thereafter. This optimizes the response time of the system significantly. This view has the following structure: Name Datatype Size Scale Nulls? Default Value LIBELLE_REGIME VARCHAR2 250 No LIBELLE_PRESTATION VARCHAR2 50 No NOM_BR VARCHAR2 40 Yes SUM(MVTASS.MONTANT) NUMBER The following source code is the SQL script executed to create the MvtRegPresBr view: CREATE MATERIALIZED VIEW "ASSURANCE"."MTPRESTREGBR" STORAGE ( INITIAL 2M NEXT 2K MAXEXTENTS UNLIMITED) TABLESPACE "DWCNSS" BUILD IMMEDIATE USING INDEX TABLESPACE "DWCNSS" STORAGE ( INITIAL 2M NEXT 2K MAXEXTENTS UNLIMITED) REFRESH FORCE ON DEMAND ENABLE QUERY REWRITE AS Yes 107
7 SELECT REGIME.LIBELLE_REGIME,PRESTATION.LIBELLE_PRESTATION,BUREAU_REGIONAL.NOM_BR,SU M(MVTASS.MONTANT) FROM BUREAU_REGIONAL,PRESTATION,MVTASS,REGIME WHERE (MVTASS.CODE_BR=BUREAU_REGIONAL.CODE_BR) AND (MVTASS.CODE_PRESTATION=PRESTATION.CODE_PRESTATION) AND (MVTASS.CODE_REGIME=REGIME.CODE_REGIME) GROUP BY LIBELLE_REGIME,BUREAU_REGIONAL.NOM_BR,LIBELLE_PRESTATION; On another level, we have used Oracle Discoverer [10], a tool of Oracle Business Intelligence suite, which exists in two versions: Administrator and Desktop, to create reports of exploration data warehouse. These reports can be manipulated interactively by the user and allow him to express his choice dynamically, which is the main objective of developing OLAP solutions. In addition to that, users can create their own reports or charts and navigate into the hierarchies of dimensions. The interface between Discoverer and the Oracle Data warehouse is done through the layers EUL (End User Layers) created with the Administrator tool. In Section 4, we present examples of results generated. 4. RESULT We expose the results and cases of use of the Data Warehouse, which combined different date sources for the first time in their life cycles. Figure 3 below shows a sample of the report: Distribution of social benefits balance amounts by Governorate (we have 24 governorates) and type of Benefits (we have 8 Social Security Benefits) for a user defined time interval. Figure 4 shows a sample of the report of the amounts of benefits according to several dimensions and axes of analysis: Time, Regime (We have 6 regimes), Governorate, type of Benefits. We note the high level of usability of the report which allows the user to modify all parameters depending on his information needs. That was our goal from the beginning. Before the construction of OLAP, it was very difficult and sometimes impossible to extract the information contained in the report of Figure 4, necessary for control and monitoring. According to a study that we conducted, it required 3 or 4 days of treatment with the intervention of at least one computer scientist. With OLAP, the repot takes only 60 minutes of execution, with the possibility of intervention by the user to change input parameters and see results in real time. The following source code is the SQL script executed to extract this report on relational data bases without creating OLAP Cubes: SELECT REGIME.LIBELLE_REGIME, PRESTATION.LIBELLE_PRESTATION, BUREAU_REGIONAL.NOM_BR, SUM(MVTASS.MONTANT) FROM BUREAU_REGIONAL, PRESTATION, MVTASS,REGIME WHERE (MVTASS.CODE_BR = BUREAU_REGIONAL.CODE_BR) AND (MVTASS.CODE_PRESTATION = PRESTATION.CODE_PRESTATION) AND (MVTASS.CODE_REGIME = REGIME.CODE_REGIME) GROUP BY LIBELLE_REGIME, BUREAU_REGIONAL.NOM_BR, LIBELLE_PRESTATION; A fundamental characteristic of OLAP systems is that they can see the data at all levels of abstraction: From highest to lowest. Figure 5 provides an extract of the allocation of accounting transactions of a covered person (the most basic level of abstraction). With the possibility of changing parameters (Regime, insured person, level of abstraction...) interactively as shown in Figure 5. We've hidden the identity of the insured person for reasons of protection of private data. 108
8 Figure 3. Distribution of social benefits balance amounts by governorate, by regional office and benefit type Figure 4. Distribution of social benefits balance amounts across multiple dimensions with real time update capabilities 109
9 Figure5. Social benefits amounts of a well defined social insured person (Lowest level of abstraction) The display of information is a key factor for the user in interpreting the results. Our system allows the retrieval and the analysis of results in many formats. Figure 6 shows a diagram presenting the balance amounts by governorate. Figure 6. Diagram of social benefits balance amounts by governorate 110
10 At figure 7, a drill down operation has been applied on the Benefits Dimension which gave the allocation of accounting balances by type of benefit, and we show that the system can move from one level to other easily. All these operations can be performed dynamically and interactively by the user. The computational time consumed to extract these diagrams is of the order of 100 minutes or more, it depends deeply on the analysis period designated by the user. It should be noted that these diagrams could not be generated before the construction of the OLAP system: lack of integration of data and advanced analytics. Moreover, the analytical decision support query execution is a high computational load which may affect the performance of production systems, given the absence of computing units dedicated to decision support. Figure 7. Drill Down on Benefit Dimension (Figure 6) The last example shown in figures 8 and 9, shows the techniques of handling the time dimension and the possibility of extracting information by day, month, quarter and year (DRILL UP and DRILL DOWN) under dynamic choices of the user. We note that the system allows you to view multiple levels of abstraction of the time dimension on the same interface. 111
11 Figure 8. Handling the Time Dimension for real time update of information Figure 9. Changing other parameters of Figure 8. At this section, we exposed a sample of results that can be generated by the data warehouse tools that we built. This includes all accounting transactions of insured persons. 5. CONCLUSION In this paper, we addressed the problem of creating a solution for decision support within the National Social Security Fund of the Republic of Tunisia. To achieve this goal, we designed and built a Data Warehouse of accounting transactions of insured persons, whose data from production systems. The process of Knowledge Discovery from Data has been applied in all its steps, including pre-processing, transformation and consolidation. This step was very difficult given the heterogeneity of data sources (Cobol files, relational databases, Access...) and the 112
12 multitude of problems we encountered during the consolidation of data (missing data, wrong data...), it represented approximately 50% of the total period of the project. We used the Oracle Warehouse Builder ETL tool (Extract, Transform and Load). We also implemented our own programs to deal with some complex data sources. A multidimensional approach has been adopted for the creation of OLAP cubes and materialized views according to the preliminary studies and the initial objectives. Several tools have also been made available to users for the exploration of the Data Warehouse and retrieval of relevant knowledge useful in making strategic decisions in different formats in a dynamic and interactive way. Users and experts of the NSSF, always hampered by limited decision support resources, expressed a high level of satisfaction and commitment to use the Data Warehouse. We have mentioned some samples of the results extracted in Section 4 of this Article. The prospects of this work remain the creation of solutions for other fields the NSSF and the implementation of Data Mining techniques for data analysis and knowledge extraction. REFERENCES [1] Chaudhuri, S., Dayal, U., An overview of data warehousing and OLAP technology, in ACM SIGMOD, Num. 26, 1997, pages [2] Codd, E. F., Codd, S. B., Salley, C. T., Beyond decision support, in Computer world, [3] Fayyad, U., Piatetsky-Shapiro, G., et Smyth, P. From Data Mining to Knowledge Discovery: An Overview, in Fayyad, U., Piatetsky-Shapiro, G.,Amith, Smyth, P., and Uthurnsamy, R. (eds.), Advances in Knowledge Discovery and Data Mining, MIT Press, Cambridge, 1996, pages [4] Grey, J., Chauduri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., Pirahesh, H., Data cube: A relational aggregation operator generalizing Goup By, Cross tab and sub totals, in Data mining and Knowledge Discovery, Num. 1, 1997, pages [5] Inmon, W. H., Building the Data warehouse, Fourth Edition, in New York: John Wiley & Sons, [6] Han, J., Kamber, M., Data Mining: Concepts and Techniques, Second Edition, Morgan Kaufmann, San Fransisco, [7] Kimball, R., Ross, M., The Data warehouse Toolkit, Second Edition, New York: John Wiley & Sons, [8] Oracle Corp., Oracle 10g Administration 1 & 2, [9] Pyle, D., Data preparation for Data mining, in San Francisco: Morgan Kauffman, [10] Stachowiak, R., Rayman, J., Greenwald, R., Oracle Datawarehousing and Business Intelligence Solutions, in Indiapolis, Indiana: Wiley Publishing, [11] Nguyen. T. M., Complex Data Warehousing and Knowledge Discovery for Advanced Retrieval Development: Innovative Methods and Applications, Information Science Reference, [12] Weiss, S. M., Indurkhya, N., Predictive Data mining, in San Francisco: Morgan Kauffman,
13 Authors Mohamed Salah GOUIDER is associate professor at the University of Tunis Tunisia. He now has thirty years experience in the field of databases and in recent years in the field of Data Warehouse and Data Mining. He earned his doctorate at the Faculty of Science - University of Nice - France in He is currently a consultant top management in several public and private enterprises. He has extensive experience in several countries: France, Tunisia, Kuwait, Qatar and Benin. His recent research is focused mainly in the extraction of knowledge from data in the medical and financial, as well as the optimization algorithms of the Data Warehouse and Data Mining. Amine FARHAT, Master of Science in Computer Science, is a researcher and Ph.D. Student at the BESTMOD Laboratory, University of Tunis. He is assistant professor in Computer Science in the ISG of Tunis. He is also a senior computer science analyst Engineer, Head of Software Development and Decision Support Systems Projects, in a public Office. His research Interests are mainly Knowledge Discovery in Databases, Data mining algorithms, Artificial Intelligence and Data warehousing. 114
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
BUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28
Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT
A Brief Tutorial on Database Queries, Data Mining, and OLAP
A Brief Tutorial on Database Queries, Data Mining, and OLAP Lutz Hamel Department of Computer Science and Statistics University of Rhode Island Tyler Hall Kingston, RI 02881 Tel: (401) 480-9499 Fax: (401)
Turkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
Introduction to Data Warehousing. Ms Swapnil Shrivastava [email protected]
Introduction to Data Warehousing Ms Swapnil Shrivastava [email protected] Necessity is the mother of invention Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai,
OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
CHAPTER 5: BUSINESS ANALYTICS
Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse
Course Design Document. IS417: Data Warehousing and Business Analytics
Course Design Document IS417: Data Warehousing and Business Analytics Version 2.1 20 June 2009 IS417 Data Warehousing and Business Analytics Page 1 Table of Contents 1. Versions History... 3 2. Overview
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.
Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc. Introduction Abstract warehousing has been around for over a decade. Therefore, when you read the articles
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application
Distance Learning and Examining Systems
Lodz University of Technology Distance Learning and Examining Systems - Theory and Applications edited by Sławomir Wiak Konrad Szumigaj HUMAN CAPITAL - THE BEST INVESTMENT The project is part-financed
Data Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
A Critical Review of Data Warehouse
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin
BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
CHAPTER 4: BUSINESS ANALYTICS
Chapter 4: Business Analytics CHAPTER 4: BUSINESS ANALYTICS Objectives Introduction The objectives are: Describe Business Analytics Explain the terminology associated with Business Analytics Describe the
Data Warehouse Snowflake Design and Performance Considerations in Business Analytics
Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker
14. Data Warehousing & Data Mining
14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,
When to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: [email protected] Abstract: Do you need an OLAP
DATA WAREHOUSING - OLAP
http://www.tutorialspoint.com/dwh/dwh_olap.htm DATA WAREHOUSING - OLAP Copyright tutorialspoint.com Online Analytical Processing Server OLAP is based on the multidimensional data model. It allows managers,
An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
BUILDING A HEALTH CARE DATA WAREHOUSE FOR CANCER DISEASES
BUILDING A HEALTH CARE DATA WAREHOUSE FOR CANCER DISEASES Dr.Osama E.Sheta 1 and Ahmed Nour Eldeen 2 1,2 Department of Mathematics Faculty of Science, Zagazig University, Zagazig, Elsharkia, Egypt. 1 [email protected],
IST722 Data Warehousing
IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF
OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA
OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,
DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT
Scientific Bulletin Economic Sciences, Vol. 9 (15) - Information technology - DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT Associate Professor, Ph.D. Emil BURTESCU University of Pitesti,
DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
Data W a Ware r house house and and OLAP II Week 6 1
Data Warehouse and OLAP II Week 6 1 Team Homework Assignment #8 Using a data warehousing tool and a data set, play four OLAP operations (Roll up (drill up), Drill down (roll down), Slice and dice, Pivot
Speeding ETL Processing in Data Warehouses White Paper
Speeding ETL Processing in Data Warehouses White Paper 020607dmxwpADM High-Performance Aggregations and Joins for Faster Data Warehouse Processing Data Processing Challenges... 1 Joins and Aggregates are
The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 10, October 2014,
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
SQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
E-Governance in Higher Education: Concept and Role of Data Warehousing Techniques
E-Governance in Higher Education: Concept and Role of Data Warehousing Techniques Prateek Bhanti Asst. Professor, FASC, MITS Deemed University, Lakshmangarh-332311, Sikar, Rajasthan, INDIA Urmani Kaushal
How To Use Data Mining For Knowledge Management In Technology Enhanced Learning
Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning
DATA MINING AND WAREHOUSING CONCEPTS
CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation
Data Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong [email protected] Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Fluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
A Survey on Data Warehouse Architecture
A Survey on Data Warehouse Architecture Rajiv Senapati 1, D.Anil Kumar 2 1 Assistant Professor, Department of IT, G.I.E.T, Gunupur, India 2 Associate Professor, Department of CSE, G.I.E.T, Gunupur, India
Business Intelligence, Analytics & Reporting: Glossary of Terms
Business Intelligence, Analytics & Reporting: Glossary of Terms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Ad-hoc analytics Ad-hoc analytics is the process by which a user can create a new report
BUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT
BUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT Hongqin Fan ([email protected]) Graduate Research Assistant, University of Alberta, AB, T6G 2E1, Canada Hyoungkwan
Deductive Data Warehouses and Aggregate (Derived) Tables
Deductive Data Warehouses and Aggregate (Derived) Tables Kornelije Rabuzin, Mirko Malekovic, Mirko Cubrilo Faculty of Organization and Informatics University of Zagreb Varazdin, Croatia {kornelije.rabuzin,
Hybrid Support Systems: a Business Intelligence Approach
Journal of Applied Business Information Systems, 2(2), 2011 57 Journal of Applied Business Information Systems http://www.jabis.ro Hybrid Support Systems: a Business Intelligence Approach Claudiu Brandas
Tracking System for GPS Devices and Mining of Spatial Data
Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja
Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring
www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty
DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM
DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM MOHAMMED SHAFEEQ AHMED Guest Lecturer, Department of Computer Science, Gulbarga University, Gulbarga, Karnataka, India (e-mail:
Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing
Data Warehousing Outline Overview of data warehousing Dimensional Modeling Online Analytical Processing From OLTP to the Data Warehouse Traditionally, database systems stored data relevant to current business
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies
An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
II. OLAP(ONLINE ANALYTICAL PROCESSING)
Association Rule Mining Method On OLAP Cube Jigna J. Jadav*, Mahesh Panchal** *( PG-CSE Student, Department of Computer Engineering, Kalol Institute of Technology & Research Centre, Gujarat, India) **
Business Intelligence: Effective Decision Making
Business Intelligence: Effective Decision Making Bellevue College Linda Rumans IT Instructor, Business Division Bellevue College [email protected] Current Status What do I do??? How do I increase
Data warehouses. Data Mining. Abraham Otero. Data Mining. Agenda
Data warehouses 1/36 Agenda Why do I need a data warehouse? ETL systems Real-Time Data Warehousing Open problems 2/36 1 Why do I need a data warehouse? Why do I need a data warehouse? Maybe you do not
A Comparative Study on Operational Database, Data Warehouse and Hadoop File System T.Jalaja 1, M.Shailaja 2
RESEARCH ARTICLE A Comparative Study on Operational base, Warehouse Hadoop File System T.Jalaja 1, M.Shailaja 2 1,2 (Department of Computer Science, Osmania University/Vasavi College of Engineering, Hyderabad,
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
Data Integration and ETL Process
Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second
Part 22. Data Warehousing
Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem
Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition
Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition Milena Gerova President Bulgarian Oracle User Group [email protected] Who am I Project Manager in TechnoLogica Ltd
B.Sc (Computer Science) Database Management Systems UNIT-V
1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used
New Approach of Computing Data Cubes in Data Warehousing
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 14 (2014), pp. 1411-1417 International Research Publications House http://www. irphouse.com New Approach of
Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi [email protected]
Data Warehousing: Data Models and OLAP operations By Kishore Jaladi [email protected] Topics Covered 1. Understanding the term Data Warehousing 2. Three-tier Decision Support Systems 3. Approaches
The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led
The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led Course Description This instructor-led course provides students with the knowledge and skills to develop Microsoft End-to-
Datawarehousing and Business Intelligence
Datawarehousing and Business Intelligence Vannaratana (Bee) Praruksa March 2001 Report for the course component Datawarehousing and OLAP MSc in Information Systems Development Academy of Communication
Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,
Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex, Inc. Overview Introduction What is Business Intelligence?
Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University
Bussiness Intelligence and Data Warehouse Schedule Bussiness Intelligence (BI) BI tools Oracle vs. Microsoft Data warehouse History Tools Oracle vs. Others Discussion Business Intelligence (BI) Products
LEARNING SOLUTIONS website milner.com/learning email [email protected] phone 800 875 5042
Course 20467A: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Length: 5 Days Published: December 21, 2012 Language(s): English Audience(s): IT Professionals Overview Level: 300
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,
What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM
Relationship Management Analytics What is Relationship Management? CRM is a strategy which utilises a combination of Week 13: Summary information technology policies processes, employees to develop profitable
Data Mart/Warehouse: Progress and Vision
Data Mart/Warehouse: Progress and Vision Institutional Research and Planning University Information Systems What is data warehousing? A data warehouse: is a single place that contains complete, accurate
MINING CLICKSTREAM-BASED DATA CUBES
MINING CLICKSTREAM-BASED DATA CUBES Ronnie Alves and Orlando Belo Departament of Informatics,School of Engineering, University of Minho Campus de Gualtar, 4710-057 Braga, Portugal Email: {alvesrco,obelo}@di.uminho.pt
Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
Oracle Warehouse Builder 10g
Oracle Warehouse Builder 10g Architectural White paper February 2004 Table of contents INTRODUCTION... 3 OVERVIEW... 4 THE DESIGN COMPONENT... 4 THE RUNTIME COMPONENT... 5 THE DESIGN ARCHITECTURE... 6
CS2032 Data warehousing and Data Mining Unit II Page 1
UNIT II BUSINESS ANALYSIS Reporting Query tools and Applications The data warehouse is accessed using an end-user query and reporting tool from Business Objects. Business Objects provides several tools
Foundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
Designing a Dimensional Model
Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and
Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract
224 Business Intelligence Journal July DATA WAREHOUSING Ofori Boateng, PhD Professor, University of Northern Virginia BMGT531 1900- SU 2011 Business Intelligence Project Jagir Singh, Greeshma, P Singh
A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Abstract
A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Jeffrey M. Stewart College of Business University of Cincinnati [email protected]
Business Intelligence Systems
12 Business Intelligence Systems Business Intelligence Systems Bogdan NEDELCU University of Economic Studies, Bucharest, Romania [email protected] The aim of this article is to show the importance
Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
THE TECHNOLOGY OF USING A DATA WAREHOUSE TO SUPPORT DECISION-MAKING IN HEALTH CARE
THE TECHNOLOGY OF USING A DATA WAREHOUSE TO SUPPORT DECISION-MAKING IN HEALTH CARE Dr. Osama E.Sheta 1 and Ahmed Nour Eldeen 2 1,2 Department of Mathematics (Computer Science) Faculty of Science, Zagazig
Lection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
Data Warehousing Concepts
Data Warehousing Concepts JB Software and Consulting Inc 1333 McDermott Drive, Suite 200 Allen, TX 75013. [[[[[ DATA WAREHOUSING What is a Data Warehouse? Decision Support Systems (DSS), provides an analysis
RESEARCH OF DECISION SUPPORT SYSTEM (DSS) FOR GREENHOUSE BASED ON DATA MINING
RESEARCH OF DECISION SUPPORT SYSTEM (DSS) FOR GREENHOUSE BASED ON DATA MINING Cheng Wang 1, Lili Wang 2,*, Ping Dong 2, Xiaojun Qiao 1 1 National Engineering Research Center for Information Technology
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
