INTEGRATION OF HETEROGENEOUS DATABASES IN ACADEMIC ENVIRONMENT USING OPEN SOURCE ETL TOOLS
|
|
- Stuart Russell
- 8 years ago
- Views:
Transcription
1 INTEGRATION OF HETEROGENEOUS DATABASES IN ACADEMIC ENVIRONMENT USING OPEN SOURCE ETL TOOLS Azwa A. Aziz, Abdul Hafiz Abdul Wahid, Nazirah Abd. Hamid, Azilawati Rozaimee Fakulti Informatik, Universiti Sultan ZainalAbidin (UniSZA), Kuala Terengganu, Terengganu, Malaysia ABSTRACT Data warehouse (DW) can be considered as the most significant tool for strategic decision making. However, in academic environment, the components of data warehouse still have not been completely utilized. The aim of this paper is to design and implement ETL processes through the data integration of heterogeneous databases using TOS in academic environment. The usage of TOS for designing and semi automatically implementing ETL tasks in Java enables a fast way of adapting to new data sources. We already manage to integrate various sources of heterogeneous academic data sample and populate them in one central repository using TOS. It is important to ensure the capabilities of open source ETL are equal to any commercial products. Consequently, it will help in implementing DW projects with lower cost. KEYWORDS Extraction, transformation, and loading(etl), Business Intelligence (BI), Data Warehouse (DW), Heterogeneous DBMS, Talend Open Studio (TOS) 1 INTRODUCTION Data warehouse (DW) can be considered as the most significant tool for strategic decision making in business. A welldeveloped DW can dramatically improve an organization s decision-making capabilities. In the early years, the costs for the development of a DW were very expensive. However, lately the costs for developing and maintaining a DW has significantly lowered, thus it has progressively becoming a functional tool that can be used as repository of information to support managerial decision making [1], [2], [3], [4]. In academic environment, however, only recently show some interest in integrating DW in decision making processes. Academic institutions still exploring the possibilities and benefits of data warehouse, therefore, in the development of decision support systems, the components of data warehouse still have not been utilized completely [5]. The factors affecting the optimal management of the institution especially in decision making are the same factors that involve in the business processes, so the management of an academic institution can be considered as critical as the management of a large business company [6]. In ensuring to achieve the optimal management of the institution, data warehouse can be integrated with Business Intelligence (BI). The goal of data warehouse is not only to utilize BI but also to do it effectively. BI is a term that refers to a variety of software applications that can be used to analyze an organization s raw data. BI is made of several related activities, including data mining, online analytical processing, querying and reporting [7]. The main goal of BI is to produce correct and accurate information in order to make effective decisions. BI gives the users the ability to transform data into usable 433
2 information, thus taking apparently useless data and turning it into valuable information. The aim of this paper is to explicate the design and implementation of extraction, transformation, and loading (ETL) processes during the initial design and deployment stage through the data integration of heterogeneous databases using open source tools in academic environment. In academic environment, ETL can be considered as a valuable process because information from academic institutions involve data that are coming from many dissimilar sources such as academic systems, co-curriculum systems, hostel system and many more. In this research, ETL tools are used to extract data from different sources and then clean the data and make it uniform for the transformation process. The output from transformation process is loaded into the data mart. The data is merged into the data mart to give the decision makers the power to look through the data from different locations. This will increase the ability of decision makers to filter the data [8]. The paper is organized as follows. Section 2 describes some related research on components of data warehouse. Section 3 explains the design of the proposed system and the experimental design is described in Section 4. We present the result analysis and discussion in Section 5 and lastly we conclude this work in Section 6. 2 LITERATURE REVIEW There are many research that have been done to discuss and explain the practices, tools and standards that involve in ETL, DW, BI and any related technologies. There are many strategies that can be applied in the deployment of DW. In [9], authors proposed a framework for design, development and deployment of a DW. The framework involved combining meta-model with an ontology. The main outcome from the framework showed that it could upgrade the interoperability function in ETL processes. Reed et al. [10] proposed a robust yet economical means for undertaking any stated goal by utilizing Pentaho tools for ETL. The goal of this research was to combine different databases into a single data repository. This data repository could be applied to view and examine domestic violence victim and offender data across organizations to provide reports. Other than that, by using Pentaho, the authors intended to remove data conflicts and to generate a demographic profile throughout the criminal justice system. Final output of data mart signified the integrated and reliable information from different data sources. Dell aquila et al. [6] explained the practices in designing and modeling of an academic DW. The objectives of this specific academic DW were to produce an exclusive structure of analysis and report for administrative structures, such as departments for the students and also to supply real time data for external agencies. The outcome of this research was the DW provided a centralized source of information accessible through diverse academic units. Piedade and Santos [11] discussed the concepts, practices and architectures of Student Relationship Management (SRM) system. The main objective of this research was to provide a technical tool that could support higher education institution to gain knowledge that was vital to the decisionmaking procedure. In order to validate the proposed concepts and activities, they adopted a research methodology which involved the comprehension of a set of interviews. The results of this research involved two stages. The first stage verified 434
3 that there was no suitable technology that existed to support SRM concepts and practices. The second stage proved that the proposed framework permitted the definition of the SRM system s architecture and its main functionalities. Sahay and Mehta, [12] developed a system to support higher education institutions in evaluating and predicting critical matters related to student success. The objective of this study was to use data mining techniques for classification, categorization, estimation, and visualization. They also intended to use predictive models to predict the critical student issues in order to determine a prioritized list of critical factors. Thomsen and Pedersen conducted two investigations on ETL tools in the years of 2005 and 2008 [13], [14]. From the investigations, the authors discovered that there were many existed open source ETL tools, such as OpenSrcETL, OpenETL, CloverETL, KETL, Kettle, Octopus and Talend. Most of the tools could meet the fundamental requirements of data processing such as could support the function of extracting data from heterogeneous data sources and load the data into ROLAP or MOLAP system. The authors stated that most of these open source ETL tools were not very powerful except for Talend Open Studio (TOS). Talend operates in an open source model, where services and ancillary features are offered on a subscription basis. From an affordability standpoint, Talend opens up the marketplace for transformation and integration to all customers, regardless of size and data integration needs [15]. 3 CONCEPTUAL FRAMEWORK Our proposed framework is based on previous works that discussed about concepts and practices of DW and ETL. In DW, it is a common practice to separate between back room and front room entities. The back room is holding and handling the data while the front room allows for data accession. The back room can be labeled as data management or data preparation section of the related material in an academic environment. In the existing application, several Databases Management System (DBMS) were used to support transactions systems such as MySQL, Informix, Oracle, and Microsoft Access. To integrate those heterogeneous DBMS was a complex task to do especially by using 3GL languages. To enforce system developer to use a single DBMS for all applications also not an option as they had to choose their DBMS for specific purpose. Thus, ETL plays a vital role in every integration projects in order to speed up the development and moreover to achieve good results. For ETL tasks, an open source application known as Talend Open Studio (TOS) was used to build tasks. The tasks included transferred data from multiple external heterogeneous data sources, then transformed, cleaned, and loaded the data into the application s repository [16]. In this environment, the front room section enabled a user or client application to access the data that had been detained in the warehouse. The key task of the front room was to map the heterogeneous low level data, usually stored in a DW to other forms [17].The front room managed the queries that performed at the outside and then scheduled and planned them in order to accomplish the results for performance issues or can be referred as Business Intelligence. Golfarelli et al. [18], described Business intelligence as the process of turning data into information and then into knowledge. The front room may offer techniques of data mining, text mining or classical statistical methods that can be 435
4 performed on Data Marts (DM) and multidimensional cubes. planed, to enable an early user interaction in establishing the warehouse. 4 SIMULATION DESIGN Figure 1: The main components of the proposed framework. The proposed ETL framework consists of several main features. One of the features is the conversion of given file formats or database formats. This feature needs to be fitted to the structure that is needed by the loading processes, which store the data in the repository. Figure 1 shows ETL process embedded in the environment, excluding any parts of the front room. In data source stage, data comes from multiple data sources such as student personal details and academic records. This information can be accessed in different file format, including simple flat files, more complex XML files or as a database including Microsoft Excel and Access, MySQL and IBM Informix. The loading section makes use of Java code and create ETL job by TOS tool which is used in order to create a simple appliance of importing data from particular available columns as identified in data preparation. These files can be later reused in the ETL module to read, customize alteration and store the data. The requirement for data conversion consumes about 70-80% of the time used to build a DW [19], meanwhile the conversion and transformation steps including Java classes created by TOS, are the first software components to be considered and The purpose of this simulation is to prove that TOS is as capable to perform jobs as any commercial ETL tools. To prove the proposed framework, we use dummy data for simulation from two different DBMS which are Oracle and MYSQL to test ETL process in academic environment. Other than that, a text file in Microsoft Excel format has been added as source data to integrate with both DBMS. Oracle has been chosen as target database because most of enterprise companies are using Oracle in their enterprise applications. In this simulation, we have designed a multidimensional table that consists of fact and dimension tables. aim of the design is to perform analysis on student result based on ongoing assessments and demographic analysis using BI tools. A fact table is created in target database contains of students results, specifically programming subjects, known as fctstu. Two dimension tables are connected to fctstu which are dimpro and dimasses. dimpro consist of student personal information such as name and gender and also geographic information while dimassescontain of detail assessment results of particulars subject. Figure 2 shows Entity Relationship Diagram (ERD) of target systems. The design in DBMS involves Oracle and MYSQL. In MYSQL, a table known as r2is created to store a record of students results with information such as student names and gender. Meanwhile, in Oracle, a table known as stuinfo is created to store students personal information such as address, state and parent incomes. Whereas, Microsoft 436
5 Excel file contains of student assessment marks for the hold semester. Table 1: Sources system attributes DBMS DB TABLE ATTRIBUTE NAME MYSQL RPS r1 name matricno gender semester subjectcode gred ORACLE STUPRO stuinfo matricno name gender address state spmres parinc ExcelFile finalmark Assesme nt Mark matricno Section A Section B Section C Figure 3: Creating a connection to ORACLE and Figure 2: Multidimensional design for target system (Student results fact table). These three heterogeneous data sources then populated to the target databases in star schema by using TOS. Table 1shows detail of source column in each DBMS and text files. The first step in developing ETL jobs is to ensure successful connection has been established between TOS to respective databases. A GUI interface is developed to help in performing task as in Figure 3. Then, by using Structured Query Language (SQL) Builder, a test connection can be conducted by viewing data through TOS. The SQL statement can be manipulated to choose entities with particular attribute to perform analysis. Figure 4 shows the results in stuinfo table that have been generated using SQL statement to Oracle DBMS. The result is shows all data in. MYSQL DBMS In Figure 5 shows that SQL Builder interfaces generate result from query made to MYSQL DBMS. It shows a detail of r1records from sources systems. 437
6 perform data transformation when extracting data from sources. Figure 7 shows tmap interface when populating ETL jobs from simulation in academic environment. Figure 4: Result from stuinfo (ORACLE DBMS) Once connection has been successfully established, ETL jobs can be designed by using ETL jobs menu. This menu provides friendly interface where we can simply drag and drop to map a data from data sources to the target database/dw as shown in Figure 6. Figure 7: Mapping using tmap The final process is to compile and execute the jobs. Logs files are provided to give feedbacks either successful or fail run on the running jobs. In this simulation, all data from heterogeneous sources have been extracted to target database for respective DW tables. 5 EXPECTED OUTCOME Figure 6: ETL job from source to target TOS provide several functionalities to perform data extraction and transformation. One of basic functionalities is tmap which is used to develop a simple mapping from source to target. tmap provides a feature to The implementation of DW is crucial when dealing with various application systems that have their own characteristics. Nowadays, most organizations need one central repository that able to summarize all transactions data that will be used as guidelines for make decision. A successful project of DW implementation has been proven in various sectors. However, that main challenge of DW projects is the cost that involve in implementing the technology. Open source ETL is an option to reduce the cost for DW projects. The usage of TOS for designing 438
7 and semi automatically implementing ETL tasks in Java enables a fast way of adapting to new data sources. We already manage to integrate various sources of heterogeneous academic data sample and populate them in one central repository using TOS. We hope the same results will be achieved if the real life data is used. It is important to ensure the capabilities of open source ETL are equal to any commercial products. Consequently, it will help in implementing DW projects with lower cost. 6 CONCLUSION & FUTURE WORK This paper explained the design and implementation of an open source ETL tools in heterogeneous databases integration. It is our contribution to provide DW architecture with open source technologies in academic environment. We expect the architecture to evolve as the project matures, which should help to fit open source technologies into data warehouse. The main challenge to complete this research is to implement the framework in real life cases and to accommodate extra practical problems including BI. 7 REFERENCES 1. Immon, W. H.: Building the Data Warehouse. John Wiley & Sons, Chaundhuri, S., Dayal, Ganti, U. and V.: Database technology for decision support systems, IEEE Computer, Vol. 34, No 12, Jarke, M., Lenzerini, Vassiliou, M. Y. and Vassiliadis, P.: Fundamentals of Data Warehouses. Springer-Verlag, Kimball, R. and Ross, M.: The Data Warehouse Toolkit, 2nd edition. John Wiley & Sons, Wierschem, D., McMillen, J. and McBroom, R. :What Academia Can Gain from Building a Data Warehouse. EDUCAUSE Quarterly, Vol. 26, No. 1, Dell aquila, C., Tria, F. D., Lefons, E. and Tangorra, F.: An Academic Data Warehouse. In :Proceedings of the 7th WSEAS International Conference on Applied Informatics and Communications, Athens, Greece, August 24-26, Mulcahy, Ryan.: Business Intelligence Definition and Solutions. CIO.com. N.p., n.d. Web. 10 Oct Kimball, R., Reeves, L., Ross, M., and Thronthwaite, W. :The Data Warehouse Lifecycle Toolkit. Wiley, New York, Hoang, A. T. and Nguyen, B.: An Integrated Use of CWM and Ontological modeling approaches towards ETL Processes. In: IEEE International Conference on e-business Engineering Reed, S. E., Na, D. Y., Mayo, T. C., Shapiro, L. W. Joseph, Duty, B., Conklin, J. H. Donald Brown, E. : Implementing and Analyzing a Data Mart for the Arlington County Initiative to Manage Domestic Violence Offenders. In :Proceedings of the 2010 IEEE Systems and Information Engineering Design Symposium University of Virginia, Charlottesville, VA, USA, April 23, Piedade, M. B. and Santos, M. Y.: Student Relationship Management: Concept. Practice and Technological Support /08/2008 IEEE Sahay, A. and Mehta, K..: Assisting Higher Education in Assessing, Predicting, and Managing Issues Related to Student Success: A Web-based Software using Data Mining and Quality Function Deployment. Academic and Business Research Institute Conference, Las Vegas, Thomsen, C. and Pedersen, T. B. : A Survey of Open Source Tools for Business Intelligence. In: International Journal of Data Warehousing and Mining, Thomsen, C., Pedersen, T.B. and Lehner, W..: RiTE: Providing On-demand Data for Right-time Data Warehousing. In: Proc. of ICDE, Inmon, W.H. The Evolution of Integration. A White Paper by W. H. Inmon Talend Open Studio, [Online]: Fayyad, U. M., Piatetsky-Shapiro, G. and Smyth, P..: From data mining to knowledge discovery: An overview, in Advances in Knowledge Discovery and Data Mining:AAAI Press, Golfarelli, M., Rizzi, S. and Cella, I..: Beyond Data warehousing: what s next in business intelligence. In: Proceedings of the 7th ACM international workshop on data warehousing and OLAP, November Schönbach, C., Kowalski-Saunders, P. and Brusic, V..: Data warehousing in molecular biology. Brief. Bioinform. vol. 1, no. 1, pp , May
CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT
CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT Julaily Aida Jusoh, Norhakimah Endot, Nazirah Abd. Hamid, Raja Hasyifah Raja Bongsu, Roslinda Muda Faculty of Informatics,
More informationA DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT
A DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT Xiufeng Liu 1 & Xiaofeng Luo 2 1 Department of Computer Science Aalborg University, Selma Lagerlofs Vej 300, DK-9220 Aalborg, Denmark 2 Telecommunication Engineering
More informationDimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
More informationIndexing Techniques for Data Warehouses Queries. Abstract
Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 sirirut@cs.ou.edu gruenwal@cs.ou.edu Abstract Recently,
More informationBussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University
Bussiness Intelligence and Data Warehouse Schedule Bussiness Intelligence (BI) BI tools Oracle vs. Microsoft Data warehouse History Tools Oracle vs. Others Discussion Business Intelligence (BI) Products
More informationBusiness Intelligence in E-Learning
Business Intelligence in E-Learning (Case Study of Iran University of Science and Technology) Mohammad Hassan Falakmasir 1, Jafar Habibi 2, Shahrouz Moaven 1, Hassan Abolhassani 2 Department of Computer
More informationData Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1
Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview
More informationBUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
More informationA Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems
Proceedings of the Postgraduate Annual Research Seminar 2005 68 A Model-based Software Architecture for XML and Metadata Integration in Warehouse Systems Abstract Wan Mohd Haffiz Mohd Nasir, Shamsul Sahibuddin
More informationFlexible Data Warehouse Parameters: Toward Building an Integrated Architecture
Flexible Data Warehouse Parameters: Toward Building an Integrated Architecture Mustafa Musa Jaber, Mohd Khanapi Abd Ghani, Nanna Suryana, Mohammed Aal Mohammed, and Thamir Abbas Abstract Clinical databases
More informationAn Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
More informationData Integration and ETL Process
Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second
More informationA Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
More informationMeta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring
www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty
More informationWhat is Management Reporting from a Data Warehouse and What Does It Have to Do with Institutional Research?
What is Management Reporting from a Data Warehouse and What Does It Have to Do with Institutional Research? Emily Thomas Stony Brook University AIRPO Winter Workshop January 2006 Data to Information Historically
More informationA FRAMEWORK FOR EDUCATIONAL DATA WAREHOUSE (EDW) ARCHITECTURE USING BUSINESS INTELLIGENCE (BI) TECHNOLOGIES
A FRAMEWORK FOR EDUCATIONAL DATA WAREHOUSE (EDW) ARCHITECTURE USING BUSINESS INTELLIGENCE (BI) TECHNOLOGIES 1 AZWA ABDUL AZIZ, 1 JULAILY AIDA JUSOH, 1 HASNI HASSAN, 1 WAN MOHD RIZHAN WAN IDRIS, 2 ADDY
More informationChapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya
Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data
More informationSQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
More informationA Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing
A Framework for Developing the Web-based Integration Tool for Web-Oriented Warehousing PATRAVADEE VONGSUMEDH School of Science and Technology Bangkok University Rama IV road, Klong-Toey, BKK, 10110, THAILAND
More informationImplementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777
Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777 Course Outline Module 1: Introduction to Data Warehousing This module provides an introduction to the key components of a data warehousing
More informationA Data Warehouse Design for A Typical University Information System
(JCSCR) - ISSN 2227-328X A Data Warehouse Design for A Typical University Information System Youssef Bassil LACSC Lebanese Association for Computational Sciences Registered under No. 957, 2011, Beirut,
More informationA Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
More informationData Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
More informationCourse Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning
Course Outline: Course: Implementing a Data with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning Duration: 5.00 Day(s)/ 40 hrs Overview: This 5-day instructor-led course describes
More informationIST722 Data Warehousing
IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF
More informationDatawarehousing and Analytics. Data-Warehouse-, Data-Mining- und OLAP-Technologien. Advanced Information Management
Anwendersoftware a Datawarehousing and Analytics Data-Warehouse-, Data-Mining- und OLAP-Technologien Advanced Information Management Bernhard Mitschang, Holger Schwarz Universität Stuttgart Winter Term
More informationTurkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
More informationINTEROPERABILITY IN DATA WAREHOUSES
INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content
More informationIntegrating Ingres in the Information System: An Open Source Approach
Integrating Ingres in the Information System: WHITE PAPER Table of Contents Ingres, a Business Open Source Database that needs Integration... 3 Scenario 1: Data Migration... 4 Scenario 2: e-business Application
More informationChapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
More informationData Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
More informationDeriving Business Intelligence from Unstructured Data
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 9 (2013), pp. 971-976 International Research Publications House http://www. irphouse.com /ijict.htm Deriving
More informationIntegrating data in the Information System An Open Source approach
WHITE PAPER Integrating data in the Information System An Open Source approach Table of Contents Most IT Deployments Require Integration... 3 Scenario 1: Data Migration... 4 Scenario 2: e-business Application
More informationData Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
More informationEnabling Better Business Intelligence and Information Architecture With SAP Sybase PowerDesigner Software
SAP Technology Enabling Better Business Intelligence and Information Architecture With SAP Sybase PowerDesigner Software Table of Contents 4 Seeing the Big Picture with a 360-Degree View Gaining Efficiencies
More informationCONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive
More informationProper study of Data Warehousing and Data Mining Intelligence Application in Education Domain
Journal of The International Association of Advanced Technology and Science Proper study of Data Warehousing and Data Mining Intelligence Application in Education Domain AMAN KADYAAN JITIN Abstract Data-driven
More informationETL-EXTRACT, TRANSFORM & LOAD TESTING
ETL-EXTRACT, TRANSFORM & LOAD TESTING Rajesh Popli Manager (Quality), Nagarro Software Pvt. Ltd., Gurgaon, INDIA rajesh.popli@nagarro.com ABSTRACT Data is most important part in any organization. Data
More informationBusiness Intelligence Systems: a Comparative Analysis
Business Intelligence Systems: a Comparative Analysis Carlo DELL AQUILA, Francesco DI TRIA, Ezio LEFONS, and Filippo TANGORRA Dipartimento di Informatica Università di Bari via Orabona 4, I-70125 Bari
More informationData Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
More informationData Warehouses and Business Intelligence ITP 487 (3 Units) Fall 2013. Objective
Data Warehouses and Business Intelligence ITP 487 (3 Units) Objective Fall 2013 While the increased capacity and availability of data gathering and storage systems have allowed enterprises to store more
More informationSQL Server 2012 End-to-End Business Intelligence Workshop
USA Operations 11921 Freedom Drive Two Fountain Square Suite 550 Reston, VA 20190 solidq.com 800.757.6543 Office 206.203.6112 Fax info@solidq.com SQL Server 2012 End-to-End Business Intelligence Workshop
More informationEnabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software
SAP Technology Enabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software Table of Contents 4 Seeing the Big Picture with a 360-Degree View Gaining Efficiencies
More informationCourse 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
More informationOpen Source Business Intelligence Intro
Open Source Business Intelligence Intro Stefano Scamuzzo Senior Technical Manager Architecture & Consulting Research & Innovation Division Engineering Ingegneria Informatica The Open Source Question In
More informationOracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.
Oracle9i Data Warehouse Review Robert F. Edwards Dulcian, Inc. Agenda Oracle9i Server OLAP Server Analytical SQL Data Mining ETL Warehouse Builder 3i Oracle 9i Server Overview 9i Server = Data Warehouse
More informationOpen Source Business Intelligence Tools: A Review
Open Source Business Intelligence Tools: A Review Amid Khatibi Bardsiri 1 Seyyed Mohsen Hashemi 2 1 Bardsir Branch, Islamic Azad University, Kerman, IRAN 2 Science and Research Branch, Islamic Azad University,
More informationSAS BI Course Content; Introduction to DWH / BI Concepts
SAS BI Course Content; Introduction to DWH / BI Concepts SAS Web Report Studio 4.2 SAS EG 4.2 SAS Information Delivery Portal 4.2 SAS Data Integration Studio 4.2 SAS BI Dashboard 4.2 SAS Management Console
More informationA Critical Review of Data Warehouse
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin
More informationMethodology Framework for Analysis and Design of Business Intelligence Systems
Applied Mathematical Sciences, Vol. 7, 2013, no. 31, 1523-1528 HIKARI Ltd, www.m-hikari.com Methodology Framework for Analysis and Design of Business Intelligence Systems Martin Závodný Department of Information
More informationDATA MINING AND WAREHOUSING CONCEPTS
CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation
More informationHybrid Support Systems: a Business Intelligence Approach
Journal of Applied Business Information Systems, 2(2), 2011 57 Journal of Applied Business Information Systems http://www.jabis.ro Hybrid Support Systems: a Business Intelligence Approach Claudiu Brandas
More informationA Survey on Data Warehouse Architecture
A Survey on Data Warehouse Architecture Rajiv Senapati 1, D.Anil Kumar 2 1 Assistant Professor, Department of IT, G.I.E.T, Gunupur, India 2 Associate Professor, Department of CSE, G.I.E.T, Gunupur, India
More informationA Survey: Data Warehouse Architecture
, pp. 349-356 http://dx.doi.org/10.14257/ijhit.2015.8.5.37 A Survey: Data Warehouse Architecture 1,2 Muhammad Arif, 1 Ghulam Mujtaba 1 Faculty of Computer Science and Information Technology, University
More informationINTELLIGENT PROFILE ANALYSIS GRADUATE ENTREPRENEUR (ipage) SYSTEM USING BUSINESS INTELLIGENCE TECHNOLOGY
INTELLIGENT PROFILE ANALYSIS GRADUATE ENTREPRENEUR (ipage) SYSTEM USING BUSINESS INTELLIGENCE TECHNOLOGY Muhamad Shahbani, Azman Ta a, Mohd Azlan, and Norshuhada Shiratuddin INTRODUCTION Universiti Utara
More informationChapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
More informationDELIVERING DATABASE KNOWLEDGE WITH WEB-BASED LABS
DELIVERING DATABASE KNOWLEDGE WITH WEB-BASED LABS Wang, Jiangping Webster University Kourik, Janet L. Webster University ABSTRACT This paper describes the design of web-based labs that are used in database-related
More informationCourse Outline. Module 1: Introduction to Data Warehousing
Course Outline Module 1: Introduction to Data Warehousing This module provides an introduction to the key components of a data warehousing solution and the highlevel considerations you must take into account
More informationwww.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28
Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT
More informationEvaluation of Business Intelligence Systems
Evaluation of Business Intelligence Systems Francesco Di Tria, Ezio Lefons, and Filippo Tangorra Dipartimento di Informatica, Università degli Studi di Bari Aldo Moro via Orabona 4, 70125 Bari Italy francescoditria,
More information5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2
Class Announcements TIM 50 - Business Information Systems Lecture 15 Database Assignment 2 posted Due Tuesday 5/26 UC Santa Cruz May 19, 2015 Database: Collection of related files containing records on
More informationPaper 3510-2015 SAS Visual Analytics: Emerging Trend in Institutional Research Sivakumar Jaganathan, Thulasi Kumar University of Connecticut
Paper 3510-2015 SAS Visual Analytics: Emerging Trend in Institutional Research Sivakumar Jaganathan, Thulasi Kumar University of Connecticut ABSTRACT Institutional research and effectiveness offices at
More informationAnalyzing Polls and News Headlines Using Business Intelligence Techniques
Analyzing Polls and News Headlines Using Business Intelligence Techniques Eleni Fanara, Gerasimos Marketos, Nikos Pelekis and Yannis Theodoridis Department of Informatics, University of Piraeus, 80 Karaoli-Dimitriou
More informationTalend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain
Talend Metadata Manager Reduce Risk and Friction in your Information Supply Chain Talend Metadata Manager Talend Metadata Manager provides a comprehensive set of capabilities for all facets of metadata
More informationBusiness Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
More informationMetadata Management for Data Warehouse Projects
Metadata Management for Data Warehouse Projects Stefano Cazzella Datamat S.p.A. stefano.cazzella@datamat.it Abstract Metadata management has been identified as one of the major critical success factor
More informationChallenges in developing a cost-effective data warehouse for a tertiary institution in a developing country
Data Mining VII: Data, Text and Web Mining and their Business Applications 389 Challenges in developing a cost-effective data warehouse for a tertiary institution in a developing country A. Nazir & T.
More informationMigrating a Discoverer System to Oracle Business Intelligence Enterprise Edition
Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition Milena Gerova President Bulgarian Oracle User Group mgerova@technologica.com Who am I Project Manager in TechnoLogica Ltd
More informationMonitoring Genebanks using Datamarts based in an Open Source Tool
Monitoring Genebanks using Datamarts based in an Open Source Tool April 10 th, 2008 Edwin Rojas Research Informatics Unit (RIU) International Potato Center (CIP) GPG2 Workshop 2008 Datamarts Motivation
More informationBusiness Processes Meet Operational Business Intelligence
Business Processes Meet Operational Business Intelligence Umeshwar Dayal, Kevin Wilkinson, Alkis Simitsis, Malu Castellanos HP Labs, Palo Alto, CA, USA Abstract As Business Intelligence architectures evolve
More informationImplementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)
Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463) Course Description Data warehousing is a solution organizations use to centralize business data for reporting and analysis. This five-day
More informationData Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
More informationSubject Description Form
Subject Description Form Subject Code Subject Title COMP417 Data Warehousing and Data Mining Techniques in Business and Commerce Credit Value 3 Level 4 Pre-requisite / Co-requisite/ Exclusion Objectives
More informationA Survey of ETL Tools
RESEARCH ARTICLE International Journal of Computer Techniques - Volume 2 Issue 5, Sep Oct 2015 A Survey of ETL Tools Mr. Nilesh Mali 1, Mr.SachinBojewar 2 1 (Department of Computer Engineering, University
More informationDeductive Data Warehouses and Aggregate (Derived) Tables
Deductive Data Warehouses and Aggregate (Derived) Tables Kornelije Rabuzin, Mirko Malekovic, Mirko Cubrilo Faculty of Organization and Informatics University of Zagreb Varazdin, Croatia {kornelije.rabuzin,
More informationEast Asia Network Sdn Bhd
Course: Analyzing, Designing, and Implementing a Data Warehouse with Microsoft SQL Server 2014 Elements of this syllabus may be change to cater to the participants background & knowledge. This course describes
More informationBeta: Implementing a Data Warehouse with Microsoft SQL Server 2012
CÔNG TY CỔ PHẦN TRƯỜNG CNTT TÂN ĐỨC TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC LEARN MORE WITH LESS! Course 10777: Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012 Length: 5 Days Audience:
More informationMicrosoft Services Exceed your business with Microsoft SharePoint Server 2010
Microsoft Services Exceed your business with Microsoft SharePoint Server 2010 Business Intelligence Suite Alexandre Mendeiros, SQL Server Premier Field Engineer January 2012 Agenda Microsoft Business Intelligence
More informationMIS636 AWS Data Warehousing and Business Intelligence Course Syllabus
MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus I. Contact Information Professor: Joseph Morabito, Ph.D. Office: Babbio 419 Office Hours: By Appt. Phone: 201-216-5304 Email: jmorabit@stevens.edu
More informationThe Integration of Agent Technology and Data Warehouse into Executive Banking Information System (EBIS) Architecture
The Integration of Agent Technology and Data Warehouse into Executive Banking System (EBIS) Architecture Ismail Faculty of Technology and Communication (FTMK) Technical University of Malaysia Melaka (UTeM),75450
More informationData Integration with Talend Open Studio Robert A. Nisbet, Ph.D.
Data Integration with Talend Open Studio Robert A. Nisbet, Ph.D. Most college courses in statistical analysis and data mining are focus on the mathematical techniques for analyzing data structures, rather
More informationExtensibility of Oracle BI Applications
Extensibility of Oracle BI Applications The Value of Oracle s BI Analytic Applications with Non-ERP Sources A White Paper by Guident Written - April 2009 Revised - February 2010 Guident Technologies, Inc.
More informationDoctoral Program in Informatics Data Warehousing Systems Proposal for a Course (2011-2012)
Doctoral Program in Informatics Data Warehousing Systems Proposal for a Course (2011-2012) MAP-i Joint Doctoral Program in Informatics University of Minho, University of Porto, and University of Aveiro
More informationData Warehouse Architecture Overview
Data Warehousing 01 Data Warehouse Architecture Overview DW 2014/2015 Notice! Author " João Moura Pires (jmp@di.fct.unl.pt)! This material can be freely used for personal or academic purposes without any
More informationCASE PROJECTS IN DATA WAREHOUSING AND DATA MINING
CASE PROJECTS IN DATA WAREHOUSING AND DATA MINING Mohammad A. Rob, University of Houston-Clear Lake, rob@uhcl.edu Michael E. Ellis, University of Houston-Clear Lake, ellisme@uhcl.edu ABSTRACT This paper
More informationDBTech Pro Workshop. Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining. Georgios Evangelidis
DBTechNet DBTech Pro Workshop Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining Dimitris A. Dervos dad@it.teithe.gr http://aetos.it.teithe.gr/~dad Georgios Evangelidis
More informationCOURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER
Page 1 of 8 ABOUT THIS COURSE This 5 day course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL Server
More informationImplementing a Data Warehouse with Microsoft SQL Server
Page 1 of 7 Overview This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL 2014, implement ETL
More informationCourse Design Document. IS417: Data Warehousing and Business Analytics
Course Design Document IS417: Data Warehousing and Business Analytics Version 2.1 20 June 2009 IS417 Data Warehousing and Business Analytics Page 1 Table of Contents 1. Versions History... 3 2. Overview
More informationBusiness Intelligence System Using Goal-Ontology Approach: A Case Study in Universiti Utara Malaysia
Business Intelligence System Using Goal-Ontology Approach: A Case Study in Universiti Utara Malaysia Azman Ta a and Mohd Syazwan Abdullah Universiti Utara Malaysia, Malaysia azman@uum.edu.my, syazwan@uum.edu.my
More informationHETEROGENEOUS DATA TRANSFORMING INTO DATA WAREHOUSES AND THEIR USE IN THE MANAGEMENT OF PROCESSES
HETEROGENEOUS DATA TRANSFORMING INTO DATA WAREHOUSES AND THEIR USE IN THE MANAGEMENT OF PROCESSES Pavol TANUŠKA, Igor HAGARA Authors: Assoc. Prof. Pavol Tanuška, PhD., MSc. Igor Hagara Workplace: Institute
More informationBUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT
BUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT Hongqin Fan (hfan@ualberta.ca) Graduate Research Assistant, University of Alberta, AB, T6G 2E1, Canada Hyoungkwan
More informationCourse 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012
Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012 OVERVIEW About this Course Data warehousing is a solution organizations use to centralize business data for reporting and analysis.
More informationData warehouse life-cycle and design
SYNONYMS Data Warehouse design methodology Data warehouse life-cycle and design Matteo Golfarelli DEIS University of Bologna Via Sacchi, 3 Cesena Italy matteo.golfarelli@unibo.it DEFINITION The term data
More informationIMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
More informationBUSINESS INTELLIGENCE TOOLS FOR IMPROVE SALES AND PROFITABILITY
Revista Tinerilor Economişti (The Young Economists Journal) BUSINESS INTELLIGENCE TOOLS FOR IMPROVE SALES AND PROFITABILITY Assoc. Prof Luminiţa Şerbănescu Ph. D University of Piteşti Faculty of Economics,
More informationSimCorp Solution Guide
SimCorp Solution Guide Data Warehouse Manager For all your reporting and analytics tasks, you need a central data repository regardless of source. SimCorp s Data Warehouse Manager gives you a comprehensive,
More informationImplementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days
Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5
More informationEzgi Dinçerden. Marmara University, Istanbul, Turkey
Economics World, Mar.-Apr. 2016, Vol. 4, No. 2, 60-65 doi: 10.17265/2328-7144/2016.02.002 D DAVID PUBLISHING The Effects of Business Intelligence on Strategic Management of Enterprises Ezgi Dinçerden Marmara
More information