Design of a Multi Dimensional Database for the Archimed DataWarehouse

Size: px
Start display at page:

Download "Design of a Multi Dimensional Database for the Archimed DataWarehouse"

Transcription

1 169 Design of a Multi Dimensional Database for the Archimed DataWarehouse Claudine Bréant, Gérald Thurler, François Borst, Antoine Geissbuhler Service of Medical Informatics University Hospital of Geneva, Geneva, Switzerland Abstract The Archimed data warehouse project started in 1993 at the Geneva University Hospital. It has progressively integrated seven data marts (or domains of activity) archiving medical data such as Admission/Discharge/Transfer (ADT) data, laboratory results, radiology exams, diagnoses, and procedure codes. The objective of the Archimed data warehouse is to facilitate the access to an integrated and coherent view of patient medical in order to support analytical activities such as medical statistics, clinical studies, retrieval of similar cases and data mining processes. This paper discusses three principal design aspects relative to the conception of the database of the data warehouse: 1) the granularity of the database, which refers to the level of detail or summarization of data, 2) the database model and architecture, describing how data will be presented to end users and how new data is integrated, 3) the life cycle of the database, in order to ensure long term scalability of the environment. Both, the organization of patient medical data using a standardized elementary fact representation and the use of the multi dimensional model have proved to be powerful design tools to integrate data coming from the multiple heterogeneous database systems part of the transactional Hospital Information System (HIS). Concurrently, the building of the data warehouse in an incremental way has helped to control the evolution of the data content. These three design aspects bring clarity and performance regarding data access. They also provide long term scalability to the system and resilience to further changes that may occur in source systems feeding the data warehouse. Keywords: Data warehouse; Medical Informatics; Database 1. Introduction A data warehouse can be very simply defined as a copy of enterprise transaction data specifically structured, integrated, and organized for query, data analysis, and decision support applications [1,2]. In that regard, the architecture, the life cycle, and the end users of a data warehouse are profoundly different from those of transactional systems [1,2]. Data models and server technology that speeds up transactional processing may not be appropriate for query and reporting processing. It is therefore recommended to develop a data warehouse separately from the transactional environment. Its main requirement is to provide an intuitive, easy and performant access to enterprise wide data so that queries and reports can be quickly produced on a regular basis.

2 170 A hospital data warehouse organized around patient medical data involves integrating a wide variety of health data, including patient records, medical images, and genetic information, for the purpose of researching and improving the diagnosis and treatment of ARCHIMED data warehouse (analytical and decision processes) patient medical service (activity) ADT Diagnoses Laboratory Procedures Radiology episode of care (analysis) data marts Hospital Information System (HIS) components (on line transactional systems) ADT Diagnoses Laboratory Radiology Procedures Figure 1 The Hospital Information System components and the Archimed data warehouse diseases. The objective is to use information technology to help achieve personalized health care by leveraging all the information and knowledge that exist about treated patients and, for instance, by studying how new patients compare with other patients with similar characteristics [3,4]. The development of the Archimed medical data warehouse started at the Geneva University Hospital in It has progressively integrated seven data marts (or domains of activity) archiving Admission/Discharge/Transfer (ADT) data, laboratory results, radiology exams, physiotherapy exams, clinical data relative to childbirth, diagnoses, and procedures codes. In the architecture of Archimed a data mart is simply a subset of the database for a specific domain of activity within the patient treatment workflow. As shown in figure 1, HIS operational systems are organized around applications managing for example ADT data or laboratory results. Archimed proposes a different organization of the data, which is dedicated to analytical activities such as medical statistics, clinical studies, retrieval of similar cases and data mining processes. As shown in figure 1, the Archimed data warehouse is organized around subject areas such as patient medical data, episodes of care, and medical specialties. End users can also access to the database through web-based applications [5]. This paper discusses three principal design aspects relative to the conception of the database of the Archimed data warehouse, towards the archiving and organization of patient medical data. The three following topics are described and discussed in the following sections: 1) the granularity of the database, which refers to the level of detail or summarization of data in the data warehouse, 2) the database model and architecture, describing how data will be presented to end users and how new data is integrated 3) the life cycle of the database, in order to ensure long term scalability of the environment. The many other aspects of the Archimed system such as data source acquisition, end user applications, and meta data, can be found in [6,7]. 2. The Archimed database records patient atomic data (or elementary facts) The granularity of the data warehouse database refers to the level of detail or summarization of data in the data warehouse. The storage of atomic data (low level of

3 171 granularity) versus storage of aggregated data (high level of granularity) is a major design issue since that it will affects the entire architecture of the data warehouse environment and more specifically the volume of the database and the type of query that can be answered. It is generally recognized that expressing the data at a low level of granularity will help to achieve a robust data warehouse, as it becomes resilient to changes and can easily accommodate new user needs. Moreover, when the s are granular, they serve as the natural destination for operational data that may be extracted frequently from the operational systems. The transactional hospital information system (HIS) creates and updates records describing most events occurring during patient in-patient or out-patient care. Events of interest for the Archimed data warehouse include encounters, observations, treatments, diagnosis and procedures. Relevant and validated facts such as encounter dates, laboratory results, radiology exams, diagnoses and procedure codes are transmitted to Archimed. The database has been designed to record patient medical data with that same level of detail under a format called standardized elementary fact. This offers many advantages over the storage of aggregated data. Indeed, whereas aggregated data already embeds the user queries, atomic data can be reshaped and presented in any format needed. Future unknown requirements or unusual queries can be accommodated, achieving flexibility of the system and reusability of the data. Each elementary fact in the Archimed database is represented by its value (generally a numeric or a code), along with several additional properties describing the context of this value such as : patient/episode identification - patient identification number, - episode identification number, medical structures - service (medical responsibility), - medical unit (patient location), basic patient description to facilitate statistical queries - patient age, - patient sex, link to dictionaries and nomenclatures which are specific to each domain of activity - property name (laboratory result name, radiology exam, diagnosis or procedure description) Corresponding dictionaries are necessary for a correct interpretation of the fact at the time the value was produced. In that regard, historic versions of these dictionaries are managed if necessary. link to the time axis - date of the event that produced the value - date of archiving of the fact in the database. Furthermore, elementary facts in Archimed are standardized the following way. Facts coming from different domains of activity are expressed using a common template, including a value and the set of corresponding properties. Basic properties are mandatory, additional ones are optional and their number may vary according to the domain of activity as shown in figure 2. The Archimed standardized elementary facts can therefore easily accommodate a new domain of activity. The storage of aggregated data can be useful to speed up queries and optimize the database usage. They can be calculated from a group of selected facts from which can be applied aggregated functions (sum, average, count, etc.). The Archimed database doesn t currently

4 172 store aggregated data; this area is however under consideration for future developments. Patient lengths of stay, indicators describing counts of diagnoses, or surgical procedures, and re admissions rates are some of the aggregates of interest. The next section describes the database model used to structure these elementary facts. Instanciated ADT fact Instanciated Laboratory fact Instanciated Procedure fact patient: episode: medservice: pediatric medunit: 1-AL patientage: 7 patientsex: M property name: out-patient value: regular entry dateoffact: dateofarchive: chief complaint: accident patient: episode: medservice: INTERNAL MED medunit: 7-AL patientage: 46 patientsex: F property name: glucose value: 6.8 dateoffact: dateofarchive: unit: mml/l material: plasma range: patient: episode: medservice: OBGYN medunit: 2-AL patientage: 34 patientsex: F property name: C-section value: elected dateoffact: dateofarchive: additional optional properties Figure 2 - Three instanciated Archimed facts according to the standardized elementary fact template 3. Patient medical data is organized using a multi dimensional approach The Archimed database relies on a multi dimensional modeling approach. The dimensional model is a logical database design method particularly well suited to data warehouse databases [2]. In particular, it has the great advantage to allow data coming from various heterogeneous and independent sources, such as Archimed elementary patient facts, to be presented in an intuitive and standardized fashion. Contrary to the usual Entity/Relational data model used by transactional systems, it limits the number of tables to fact and dimension tables. For instance, the multi dimensional model in Archimed for the Laboratory data mart includes: one, recording the elementary facts produced by the operational system, namely the patient laboratory result values. a set of usually smaller dimension tables, each linked to the through a primary key, and describing the context of interpretation of elementary facts. The dimension tables in the Laboratory data mart include tables describing patients, medical services, medical units, laboratory exams, laboratory ranges values, and units. medical service patient diagnostic code date/time Figure 3 Three dimensional data cube describing diagnostic codes along three dimensions : patient, medical service, and time Figure 4 shows the general star schema data model used for the representation of an Archimed data mart. The s all contain zero or more facts that represent values (measurements) taken at each combination of the dimension key components. The fact table is also often represented by a data cube where each axis corresponds to one dimension, as shown in figure 3 [1,2]. As the Archimed data warehouse is composed of several data marts, the whole data model results in a set of inter connected star schemas including a collection of s

5 173 describing patient medical data (encounters, laboratory results, diagnoses and procedures, dimensions conformed dimensions (shared by at least 2 fact tables) dimensions patient episode med. services med. unit Figure 4 - Four Archimed data marts inter connected through conformed dimension tables radiology exams and so on) and a set of dimension tables where some of them are shared by many facts tables. Indeed, the patient table or the medical services table record data meaning exactly the same thing in each data mart. Dimension tables shared by several s are called conformed dimensions [2]. Conformed dimensions constitute a very important and powerful aspect of the multi dimensional model. Indeed, the definition of conformed dimensions establishes the links between the data marts, providing an integrated view of the elementary facts. It builds the foundation for a simple and performant querying of the patient medical data as if it were initially part of the same database. For example, the queries retrieve all medical data of patient of age during its last hospitalization in the internal medicine department or retrieve all patients of the pediatric department having low hemoglobin and diagnosis of diabetes require many steps of laborious treatment when issued at the various hospital transactional database systems. It is however very simple to solve with the Archimed database model. Establishing, enforcing, and maintaining, the dimension tables of the data warehouse Archimed are therefore important tasks respectively during the initial planning of the database architecture, during the integration process of a new data mart, and for the maintenance of the database overall coherence. 4. Implementation and life cycle of the Archimed database The Archimed database was first developed using the Ingres database system before being recently migrated to the Oracle relational database system. In brief, the physical implementation of the database has required the definition of a physical data model, proper indexation of the tables, and definition of rules to consistently name tables and attributes. Unlike classical transactional systems, a data warehouse is in perpetual evolution and cannot remain static. Indeed, its evolution must follow the changes occurring in the organization which it serves. User needs appear or change. New data sources become accessible and must be integrated in the warehouse. The process of adding a new data mart to the architecture complies with the following steps. First, a detailed analysis of the source data provided by the transactional systems is carried out. Elementary facts are identified, setting the granularity of the. Then, conformed dimensions are carefully highlighted; they define the links with the other data marts. Other dimensions are also identified which complement the definition of the

6 174 elementary facts. A mapping between source data items and the data warehouse table attributes is established. Then, the physical table and index definitions can be derived and implemented. The Archimed data warehouse has been built incrementally, in order to break the implementation task to manageable proportions. Each data mart has been successively implemented and connected to the overall architecture. Moreover, the Archimed database was designed to anticipate changes and the evolution of existing data sources. As shown above, the data model accommodates the need for new descriptors and new dimensions without to have to change the database schema. 5. Conclusion The planning, realization, maintenance and evolution of a large data warehouse database system can be overwhelming. Source data is complex, volumes are large, and portions of data will be dirty, erroneous, or hard to understand. In the case of a data warehouse for medical data, we have discussed the three main design aspects of the Archimed database. The organization of elementary facts (atomic) using the dimensional model has proved to be a powerful tool in order to integrate data coming from multiple heterogeneous database systems developed independently throughout the institution. It brings clarity and performance regarding data access. It also provides scalability to the system and resilience to further changes that may occur in source systems feeding the data warehouse. Moreover, the building of the data warehouse in an incremental way has helped to control the evolution of the data content These design options constitute the main rules and act as a road map to follow when a new medical set of data (or data mart) is integrated to the system. Sticking to these decisions has enabled to ensure the coherence of the integrated data, long term scalability, and to greatly simplify the access to data produced by tens of independent hospital wide operational databases. These rules have been established progressively and result from more that 15 years of prior experience in providing information to hospital administrators and medical staff including departmental statistics, patient similar case retrievals, and clinical studies. 6. References [1] Inmon WH, Building the Data Warehouse, Wiley; 3rd edition, [2] Kimball R, Ross M. The Data Warehouse Lifecycle Toolkit, Wiley; 2nd edition, [3] Kerkri R, Quantin C, Yetongnon K, Dusserre L. Les entrepôts de données:application au suivi épidémiologique. Informatique et Santé, Springer-Verlag, France, Paris 1998(10): [4] Ledbetter CS, Morgan MW, Toward best practice: leveraging the electronic patient record as a clinical data warehouse J Healthc Inf Manag Summer;15(2): [5] Lehner B, Thurler G, Bréant C, Tahintzi P, Borst F. Retrieval of Similar Cases using the ARCHIMED Navigator. MIE, [6] Thurler G, Bréant C, Lehner B, Bunge M, Samii K, Hochstrasser D, Nendaz M, Gaspoz JM, Tahintzi P, Borst F. Toward a Systemic Approach to Disease. Complexus, 2003;1: [7] Thurler G, Borst F, Bréant C, Campi D, Jenc j, Lehner B, Maricot P, and Scherrer JR. ARCHIMED: A network of Integrated Information Systems. Method Inform Med 2000; 39: Address for correspondence Claudine Bréant HUG Service d Informatique Médicale 24, rue Micheli-du-Crêts 1211 Genève 14, Switzerland, claudine.breant@sim.hcuge.ch

Designing ETL Tools to Feed a Data Warehouse Based on Electronic Healthcare Record Infrastructure

Designing ETL Tools to Feed a Data Warehouse Based on Electronic Healthcare Record Infrastructure Digital Healthcare Empowering Europeans R. Cornet et al. (Eds.) 2015 European Federation for Medical Informatics (EFMI). This article is published online with Open Access by IOS Press and distributed under

More information

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28 Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT

More information

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Systems: Foundations and Architectures Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository

More information

A Design and implementation of a data warehouse for research administration universities

A Design and implementation of a data warehouse for research administration universities A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon

More information

Lection 3-4 WAREHOUSING

Lection 3-4 WAREHOUSING Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing

More information

The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, srecko@vizija.

The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, srecko@vizija. The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, srecko@vizija.si ABSTRACT Health Care Statistics on a state level is a

More information

Dimensional Modeling for Data Warehouse

Dimensional Modeling for Data Warehouse Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or

More information

Data Warehousing and Data Mining in Business Applications

Data Warehousing and Data Mining in Business Applications 133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business

More information

THE QUALITY OF DATA AND METADATA IN A DATAWAREHOUSE

THE QUALITY OF DATA AND METADATA IN A DATAWAREHOUSE THE QUALITY OF DATA AND METADATA IN A DATAWAREHOUSE Carmen Răduţ 1 Summary: Data quality is an important concept for the economic applications used in the process of analysis. Databases were revolutionized

More information

IMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS

IMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS IMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS Maria Dan Ştefan Academy of Economic Studies, Faculty of Accounting and Management Information Systems, Uverturii Street,

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1 Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics

More information

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction

More information

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

When to consider OLAP?

When to consider OLAP? When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP

More information

BUILDING OLAP TOOLS OVER LARGE DATABASES

BUILDING OLAP TOOLS OVER LARGE DATABASES BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,

More information

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application

More information

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram Cognizant Technology Solutions, Newbury Park, CA Clinical Data Repository (CDR) Drug development lifecycle consumes a lot of time, money

More information

Life Cycle of a Data Warehousing Project in Healthcare

Life Cycle of a Data Warehousing Project in Healthcare Life Cycle of a Data Warehousing Project in Healthcare Ravi Verma, Jeannette Harper ABSTRACT Hill Physicians Medical Group (and its medical management firm, PriMed Management) early on recognized the need

More information

SENG 520, Experience with a high-level programming language. (304) 579-7726, Jeff.Edgell@comcast.net

SENG 520, Experience with a high-level programming language. (304) 579-7726, Jeff.Edgell@comcast.net Course : Semester : Course Format And Credit hours : Prerequisites : Data Warehousing and Business Intelligence Summer (Odd Years) online 3 hr Credit SENG 520, Experience with a high-level programming

More information

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take

More information

Mario Guarracino. Data warehousing

Mario Guarracino. Data warehousing Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the

More information

What is Management Reporting from a Data Warehouse and What Does It Have to Do with Institutional Research?

What is Management Reporting from a Data Warehouse and What Does It Have to Do with Institutional Research? What is Management Reporting from a Data Warehouse and What Does It Have to Do with Institutional Research? Emily Thomas Stony Brook University AIRPO Winter Workshop January 2006 Data to Information Historically

More information

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach 2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,

More information

SQL Server 2012 Business Intelligence Boot Camp

SQL Server 2012 Business Intelligence Boot Camp SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment Cross-Border Challenges in Informatics with a Focus on Disease Surveillance and Utilising Big-Data L. Stoicu-Tivadar et al. (Eds.) 2014 The authors. This article is published online with Open Access by

More information

University Data Warehouse Design Issues: A Case Study

University Data Warehouse Design Issues: A Case Study Session 2358 University Data Warehouse Design Issues: A Case Study Melissa C. Lin Chief Information Office, University of Florida Abstract A discussion of the design and modeling issues associated with

More information

Methodology Framework for Analysis and Design of Business Intelligence Systems

Methodology Framework for Analysis and Design of Business Intelligence Systems Applied Mathematical Sciences, Vol. 7, 2013, no. 31, 1523-1528 HIKARI Ltd, www.m-hikari.com Methodology Framework for Analysis and Design of Business Intelligence Systems Martin Závodný Department of Information

More information

Key organizational factors in data warehouse architecture selection

Key organizational factors in data warehouse architecture selection Key organizational factors in data warehouse architecture selection Ravi Kumar Choudhary ABSTRACT Deciding the most suitable architecture is the most crucial activity in the Data warehouse life cycle.

More information

Designing a Dimensional Model

Designing a Dimensional Model Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and

More information

Deriving Business Intelligence from Unstructured Data

Deriving Business Intelligence from Unstructured Data International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 9 (2013), pp. 971-976 International Research Publications House http://www. irphouse.com /ijict.htm Deriving

More information

The Quality Data Warehouse: Solving Problems for the Enterprise

The Quality Data Warehouse: Solving Problems for the Enterprise The Quality Data Warehouse: Solving Problems for the Enterprise Bradley W. Klenz, SAS Institute Inc., Cary NC Donna O. Fulenwider, SAS Institute Inc., Cary NC ABSTRACT Enterprise quality improvement is

More information

SimCorp Solution Guide

SimCorp Solution Guide SimCorp Solution Guide Data Warehouse Manager For all your reporting and analytics tasks, you need a central data repository regardless of source. SimCorp s Data Warehouse Manager gives you a comprehensive,

More information

CHAPTER 4 Data Warehouse Architecture

CHAPTER 4 Data Warehouse Architecture CHAPTER 4 Data Warehouse Architecture 4.1 Data Warehouse Architecture 4.2 Three-tier data warehouse architecture 4.3 Types of OLAP servers: ROLAP versus MOLAP versus HOLAP 4.4 Further development of Data

More information

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration DW Source Integration, Tools, and Architecture Overview DW Front End Tools Source Integration DW architecture Original slides were written by Torben Bach Pedersen Aalborg University 2007 - DWML course

More information

ETL-EXTRACT, TRANSFORM & LOAD TESTING

ETL-EXTRACT, TRANSFORM & LOAD TESTING ETL-EXTRACT, TRANSFORM & LOAD TESTING Rajesh Popli Manager (Quality), Nagarro Software Pvt. Ltd., Gurgaon, INDIA rajesh.popli@nagarro.com ABSTRACT Data is most important part in any organization. Data

More information

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,

More information

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 asistithod@gmail.com

More information

Business Intelligence Systems

Business Intelligence Systems 12 Business Intelligence Systems Business Intelligence Systems Bogdan NEDELCU University of Economic Studies, Bucharest, Romania bogdannedelcu@hotmail.com The aim of this article is to show the importance

More information

IST722 Data Warehousing

IST722 Data Warehousing IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge

More information

Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling

Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling Thanks for Attending! Roland Bouman, Leiden the Netherlands MySQL AB, Sun, Strukton, Pentaho (1 nov) Web- and Business Intelligence

More information

Data Integration and ETL Process

Data Integration and ETL Process Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second

More information

DATA WAREHOUSING AND OLAP TECHNOLOGY

DATA WAREHOUSING AND OLAP TECHNOLOGY DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are

More information

Data warehouse Architectures and processes

Data warehouse Architectures and processes Database and data mining group, Data warehouse Architectures and processes DATA WAREHOUSE: ARCHITECTURES AND PROCESSES - 1 Database and data mining group, Data warehouse architectures Separation between

More information

High-Volume Data Warehousing in Centerprise. Product Datasheet

High-Volume Data Warehousing in Centerprise. Product Datasheet High-Volume Data Warehousing in Centerprise Product Datasheet Table of Contents Overview 3 Data Complexity 3 Data Quality 3 Speed and Scalability 3 Centerprise Data Warehouse Features 4 ETL in a Unified

More information

A Service-oriented Architecture for Business Intelligence

A Service-oriented Architecture for Business Intelligence A Service-oriented Architecture for Business Intelligence Liya Wu 1, Gilad Barash 1, Claudio Bartolini 2 1 HP Software 2 HP Laboratories {name.surname@hp.com} Abstract Business intelligence is a business

More information

Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring

Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty

More information

Multidimensional Modeling - Stocks

Multidimensional Modeling - Stocks Bases de Dados e Data Warehouse 06 BDDW 2006/2007 Notice! Author " João Moura Pires (jmp@di.fct.unl.pt)! This material can be freely used for personal or academic purposes without any previous authorization

More information

Upon successful completion of this course, a student will meet the following outcomes:

Upon successful completion of this course, a student will meet the following outcomes: College of San Mateo Official Course Outline 1. COURSE ID: CIS 364 TITLE: Enterprise Data Warehousing Semester Units/Hours: 4.0 units; a minimum of 48.0 lecture hours/semester; a minimum of 48.0 lab hours/semester

More information

A Critical Review of Data Warehouse

A Critical Review of Data Warehouse Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin

More information

Patient Trajectory Modeling and Analysis

Patient Trajectory Modeling and Analysis Patient Trajectory Modeling and Analysis Jalel Akaichi and Marwa Manaa Higher Institute of Management of tunis, 41 Rue de la Liberté, Cité Bouchoucha, 2000 Bardo, Tunisia j.akaichi@gmail.com, manaamarwa@gmail.com

More information

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES MUHAMMAD KHALEEL (0912125) SZABIST KARACHI CAMPUS Abstract. Data warehouse and online analytical processing (OLAP) both are core component for decision

More information

INFORMATION TECHNOLOGIES FOR PATIENT CARE MANAGEMENT

INFORMATION TECHNOLOGIES FOR PATIENT CARE MANAGEMENT SUMMARY Features INTERIN Technology, a complex of software tools and techniques for building health care information systems, was developed in the Program Systems Institute, Russian Academy of Sciences.

More information

HETEROGENEOUS DATA TRANSFORMING INTO DATA WAREHOUSES AND THEIR USE IN THE MANAGEMENT OF PROCESSES

HETEROGENEOUS DATA TRANSFORMING INTO DATA WAREHOUSES AND THEIR USE IN THE MANAGEMENT OF PROCESSES HETEROGENEOUS DATA TRANSFORMING INTO DATA WAREHOUSES AND THEIR USE IN THE MANAGEMENT OF PROCESSES Pavol TANUŠKA, Igor HAGARA Authors: Assoc. Prof. Pavol Tanuška, PhD., MSc. Igor Hagara Workplace: Institute

More information

Problem-Centered Care Delivery

Problem-Centered Care Delivery HOW INTERFACE TERMINOLOGY MAKES STANDARDIZED HEALTH INFORMATION POSSIBLE Terminologies ensure that the languages of medicine can be understood by both humans and machines. by June Bronnert, RHIA, CCS,

More information

BUILDING A HEALTH CARE DATA WAREHOUSE FOR CANCER DISEASES

BUILDING A HEALTH CARE DATA WAREHOUSE FOR CANCER DISEASES BUILDING A HEALTH CARE DATA WAREHOUSE FOR CANCER DISEASES Dr.Osama E.Sheta 1 and Ahmed Nour Eldeen 2 1,2 Department of Mathematics Faculty of Science, Zagazig University, Zagazig, Elsharkia, Egypt. 1 oesheta75@gmail.com,

More information

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server Extending Hyperion BI with the Oracle BI Server Mark Ostroff Sr. BI Solutions Consultant Agenda Hyperion BI versus Hyperion BI with OBI Server Benefits of using Hyperion BI with the

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998

Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998 1 of 9 5/24/02 3:47 PM Dimensional Modeling and E-R Modeling In The Data Warehouse By Joseph M. Firestone, Ph.D. White Paper No. Eight June 22, 1998 Introduction Dimensional Modeling (DM) is a favorite

More information

Data warehouse design

Data warehouse design DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Data warehouse design DATA WAREHOUSE: DESIGN - 1 Risk factors Database

More information

AN INTEGRATION APPROACH FOR THE STATISTICAL INFORMATION SYSTEM OF ISTAT USING SDMX STANDARDS

AN INTEGRATION APPROACH FOR THE STATISTICAL INFORMATION SYSTEM OF ISTAT USING SDMX STANDARDS Distr. GENERAL Working Paper No.2 26 April 2007 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann Trivadis White Paper Comparison of Data Modeling Methods for a Core Data Warehouse Dani Schnider Adriano Martino Maren Eschermann June 2014 Table of Contents 1. Introduction... 3 2. Aspects of Data Warehouse

More information

A Case Study in Integrated Quality Assurance for Performance Management Systems

A Case Study in Integrated Quality Assurance for Performance Management Systems A Case Study in Integrated Quality Assurance for Performance Management Systems Liam Peyton, Bo Zhan, Bernard Stepien School of Information Technology and Engineering, University of Ottawa, 800 King Edward

More information

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives

More information

CHAPTER 3. Data Warehouses and OLAP

CHAPTER 3. Data Warehouses and OLAP CHAPTER 3 Data Warehouses and OLAP 3.1 Data Warehouse 3.2 Differences between Operational Systems and Data Warehouses 3.3 A Multidimensional Data Model 3.4Stars, snowflakes and Fact Constellations: 3.5

More information

14. Data Warehousing & Data Mining

14. Data Warehousing & Data Mining 14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,

More information

Reflections on Agile DW by a Business Analytics Practitioner. Werner Engelen Principal Business Analytics Architect

Reflections on Agile DW by a Business Analytics Practitioner. Werner Engelen Principal Business Analytics Architect Reflections on Agile DW by a Business Analytics Practitioner Werner Engelen Principal Business Analytics Architect Introduction Werner Engelen Active in BI & DW since 1998 + 6 years at element61 Previously:

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS DATA WAREHOUSE CONCEPTS A fundamental concept of a data warehouse is the distinction between data and information. Data is composed of observable and recordable facts that are often found in operational

More information

Metadata Management for Data Warehouse Projects

Metadata Management for Data Warehouse Projects Metadata Management for Data Warehouse Projects Stefano Cazzella Datamat S.p.A. stefano.cazzella@datamat.it Abstract Metadata management has been identified as one of the major critical success factor

More information

1. Dimensional Data Design - Data Mart Life Cycle

1. Dimensional Data Design - Data Mart Life Cycle 1. Dimensional Data Design - Data Mart Life Cycle 1.1. Introduction A data mart is a persistent physical store of operational and aggregated data statistically processed data that supports businesspeople

More information

ISO 18308 INTERNATIONAL STANDARD. Health informatics Requirements for an electronic health record architecture

ISO 18308 INTERNATIONAL STANDARD. Health informatics Requirements for an electronic health record architecture INTERNATIONAL STANDARD ISO 18308 First edition 2011-04-15 Health informatics Requirements for an electronic health record architecture Informatique de santé Exigences relatives à une architecture de l'enregistrement

More information

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining BUSINESS INTELLIGENCE Bogdan Mohor Dumitrita 1 Abstract A Business Intelligence (BI)-driven approach can be very effective in implementing business transformation programs within an enterprise framework.

More information

Data Warehousing. Yeow Wei Choong Anne Laurent

Data Warehousing. Yeow Wei Choong Anne Laurent Data Warehousing Yeow Wei Choong Anne Laurent Databases Databases are developed on the IDEA that DATA is one of the cri>cal materials of the Informa>on Age Informa>on, which is created by data, becomes

More information

Nursing Diagnosis and Multidimensional Design

Nursing Diagnosis and Multidimensional Design Proceedings of the 3 rd INFORMS Workshop on Data Mining and Health Informatics (DM-HI 2008) J. Li, D. Aleman, R. Sikora, eds. NursingCareWare: Warehousing for Nursing Care Research and Knowledge Discovery

More information

Impact Intelligence. Flexibility. Security. Ease of use. White Paper

Impact Intelligence. Flexibility. Security. Ease of use. White Paper Impact Intelligence Health care organizations continue to seek ways to improve the value they deliver to their customers and pinpoint opportunities to enhance performance. Accurately identifying trends

More information

Index Selection Techniques in Data Warehouse Systems

Index Selection Techniques in Data Warehouse Systems Index Selection Techniques in Data Warehouse Systems Aliaksei Holubeu as a part of a Seminar Databases and Data Warehouses. Implementation and usage. Konstanz, June 3, 2005 2 Contents 1 DATA WAREHOUSES

More information

Establish and maintain Center of Excellence (CoE) around Data Architecture

Establish and maintain Center of Excellence (CoE) around Data Architecture Senior BI Data Architect - Bensenville, IL The Company s Information Management Team is comprised of highly technical resources with diverse backgrounds in data warehouse development & support, business

More information

College of Engineering, Technology, and Computer Science

College of Engineering, Technology, and Computer Science College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Description: This five-day instructor-led course teaches students how to design and implement a BI infrastructure. The

More information

Rocky Mountain Technology Ventures. Exploring the Intricacies and Processes Involving the Extraction, Transformation and Loading of Data

Rocky Mountain Technology Ventures. Exploring the Intricacies and Processes Involving the Extraction, Transformation and Loading of Data Rocky Mountain Technology Ventures Exploring the Intricacies and Processes Involving the Extraction, Transformation and Loading of Data 3/25/2006 Introduction As data warehousing, OLAP architectures, Decision

More information

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Data Sheet IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Overview Multidimensional analysis is a powerful means of extracting maximum value from your corporate

More information

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option The following is intended to outline our general product direction. It is intended for

More information

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches Concepts of Database Management Seventh Edition Chapter 9 Database Management Approaches Objectives Describe distributed database management systems (DDBMSs) Discuss client/server systems Examine the ways

More information

MEDHOST Integration. Improve continuity of care, resulting in more informed care decisions

MEDHOST Integration. Improve continuity of care, resulting in more informed care decisions Improve continuity of care, resulting in more informed care decisions Integration Data exchange, visibility, timeliness and mobility directly influence patient safety and satisfaction, care transitions,

More information

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on

More information

Week 3 lecture slides

Week 3 lecture slides Week 3 lecture slides Topics Data Warehouses Online Analytical Processing Introduction to Data Cubes Textbook reference: Chapter 3 Data Warehouses A data warehouse is a collection of data specifically

More information

Course Design Document. IS417: Data Warehousing and Business Analytics

Course Design Document. IS417: Data Warehousing and Business Analytics Course Design Document IS417: Data Warehousing and Business Analytics Version 2.1 20 June 2009 IS417 Data Warehousing and Business Analytics Page 1 Table of Contents 1. Versions History... 3 2. Overview

More information

INTEROPERABILITY IN DATA WAREHOUSES

INTEROPERABILITY IN DATA WAREHOUSES INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content

More information

CHAPTER-6 DATA WAREHOUSE

CHAPTER-6 DATA WAREHOUSE CHAPTER-6 DATA WAREHOUSE 1 CHAPTER-6 DATA WAREHOUSE 6.1 INTRODUCTION Data warehousing is gaining in popularity as organizations realize the benefits of being able to perform sophisticated analyses of their

More information

Oracle Warehouse Builder 10g

Oracle Warehouse Builder 10g Oracle Warehouse Builder 10g Architectural White paper February 2004 Table of contents INTRODUCTION... 3 OVERVIEW... 4 THE DESIGN COMPONENT... 4 THE RUNTIME COMPONENT... 5 THE DESIGN ARCHITECTURE... 6

More information

3.1 Data Warehouse Methodology. Literature Review on Data Warehouse

3.1 Data Warehouse Methodology. Literature Review on Data Warehouse Chapter 3 Literature Review on Data Warehouse This chapter aims to present the literature review, a study about data warehouse technology, specifying concepts, characteristics and different types of data

More information

SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE

SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE Sukanta Singh Department of Computer Science & Engineering, Global Institute of Management & Technology, Krishnagar, Nadia, West Bengal sukantasingh2008@gmail.com

More information