6NF CONCEPTUAL MODELS AND DATA WAREHOUSING 2.0
|
|
|
- Marybeth Smith
- 10 years ago
- Views:
Transcription
1 ABSTRACT 6NF CONCEPTUAL MODELS AND DATA WAREHOUSING 2.0 Curtis Knowles Georgia Southern University Sixth Normal Form (6NF) is a term used in relational database theory by Christopher Date to describe databases which decompose relational variables to irreducible elements. While this form may be unimportant for non-temporal data, it is certainly important when maintaining data containing temporal variables of a point-in-time or interval nature. With the advent of Data Warehousing 2.0 (DW 2.0), there is now an increased emphasis on using fully-temporalized databases in the context of data warehousing, in particular with next generation approaches such as Anchor Modeling. In this paper, we will explore the concepts of temporal data, 6NF conceptual database models, and their relationship with DW 2.0. Further, we will also evaluate Anchor Modeling as a conceptual design method in which to capture temporal data. Using these concepts, we will indicate a path forward for evaluating a possible translation of 6NF-compliant data into an extensible Markup Language (XML) Schema for the purpose of describing and presenting such data to disparate systems in a structured format suitable for data exchange. Keywords 6NF, XML, sixth normal form, anchor modeling, temporal database, semantically temporal data, semantically stable data, semantic data change, valid time, transaction time, DW 2.0, Dimension Fact Model, DFM INTRODUCTION A temporal database is a database which contains aspects of time analysis. Specifically, such databases often deal with two concepts of time valid time and transaction time. Valid time is defined as the time period during which a data fact may be true with respect to the real world (i.e., a time when something actually happened), while transaction time refers to the time period during which a data fact may be stored in the database (i.e., a time when something is measured and/or reported). The concepts of valid time and transaction time were introduced and described as a part of TSQL2, a language specification developed in 1993 in response to a proposal by Richard Snodgrass for the development of temporal extensions to the Structured Query Language (SQL) (Snodgrass and Ahn, 1986). In this research, we will include the modeling of both concepts of valid and transaction time, combining them to represent a bitemporal database which is able to store not only current data, but also historical and even future data expected to occur. TEMPORAL DATABASE DESIGN AND 6NF DATA MODELING In his book Temporal Data and the Relational Model, Chris Date points out several shortcomings when attempting to model fully temporalized data using standard Fifth Normal Form (5NF) principles. To illustrate these shortcomings, first consider a non-temporalized, 5NF-compliant table schema (see Figure 1) in which is defined a predicate for the Suppliers relation variable (relvar) S as such: Supplier S# is under contract, is named SNAME, has status STATUS, and is located in city CITY: Figure 1. Non-temporalized Table Schema To introduce a temporal element into table S, we simply add a temporal attribute SINCE to the table which accounts for valid time (see Figure 2). The resulting table S_SINCE now represents that supplier S# has been under contract since day SINCE: Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA March 23 rd -24 th,
2 Figure 2. Semi-temporalized Table Schema using SINCE Alternatively, we may want to track an interval period during which a relvar is true (see Figure 3). The resulting table S_DURING would represent that supplier S# was under contract during time period DURING: Figure 3. Semi-temporalized Table Schema using DURING In both of these cases, even though a temporal element has been introduced into the database, the relvars are badly designed for two reasons: 1. One temporal attribute alone is insufficient to model conditions where each of the other individual attributes may vary independently of one another over time, and 2. Data can be lost when single temporal attributes are updated. Historical data thus cannot be tracked in this scenario. Due to these shortcomings, Date refers to this type of modeling as semi-temporalizing (Date, Darwen and Lorentzos, 2002). To overcome the first issue, a SINCE attribute may be added for each descriptive attribute in the table (see Figure 4). The result is a horizontal explosion of table S_SINCE which now can be defined as follows - Supplier S# was under contract since day S#_SINCE, has been named SNAME since day SNAME_SINCE, has had status STATUS since day STATUS_SINCE, and has been located in CITY since day CITY_SINCE: Figure 4. Semi-temporalized Table Schema using SINCE It should be pointed out that S_SINCE is still considered to be semi-temporalized since it contains no concept of historical data. For example, if supplier S1 s status changes on D12 to 30, the STATUS value of 20 is replaced with 30 and the STATUS_SINCE value of D10 is replaced with D12. The data fact that supplier S1 ever had a status of 20 is now lost. To overcome the issues of tracking historical data, consider the S_DURING relvar. Given the tuple in Figure 3, consider a scenario where supplier S1 s status changes to 30 on day 7. The obvious way to account for this change in S_DURING is to delete the existing tuple and insert two new ones as follows: Figure 5. Semi-temporalized Table Schema using DURING Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA March 23 rd -24 th,
3 The problem that exists with this solution is that relvar S_DURING timestamps too much information (supplier is under contract, supplier has a name, supplier has a status, and supplier has a city). To resolve this problem, we must vertically decompose the relvar into a series of four separate relvars, each with its own timestamp attribute (see Figure 6). Figure 6. Fully-temporalized 6NF-compliant Table Schema Adding transaction time columns CREATED and UPDATED to indicate when a tuple was first created and last updated, respectively, results in a fully-temporalized set of 6NF-compliant relvars capable of tracking historical data (see Figure 7). Figure 7. Fully-temporalized 6NF-compliant Table Schema using Valid Time and Transaction Time This vertical decomposition of relvar S_DURING is similar to a classic normalization process and leads us to the introduction by Date of the Sixth Normal Form (6NF) with this formal definition: A relvar (table) R is in 6NF if and only if R satisfies no nontrivial join dependencies at all, in which case R is said to be irreducible. (Date et al., 2002) Before adding historical data, relvar S_DURING (as seen in Figure 3) is 5NF-compliant since the only nontrivial join dependencies it satisfies are ones that are implied by its sole candidate key [S#, DURING]. After adding historical data, however, S_DURING is no longer in 5NF form since the tuple [S#, SNAME, CITY] is redundant over candidate key [S#, DURING]. Therefore, S_DURING can benefit from further decomposition into the set of 6NF projections as shown in Figure 6. The result is a set of relvars with irreducible elements, each containing its own interval temporal attribute and each tied to the surrogate key S#. By using conceptual models, 6NF datasets such as those shown in Figure 7 can be easily visualized and integrity constraints can be clearly defined. Furthermore, XML documents and schemas can be created by database analysts and designers to produce a method for exchanging complex 6NF-compliant data among disparate systems in a structured format. Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA March 23 rd -24 th,
4 RELATIONSHIP TO DATA WAREHOUSING 2.0 Data Warehousing 2.0 (DW 2.0) is a second-generation attempt to define a standard Data Warehouse architecture. One of the advantages introduced in DW 2.0 is its ability to support changes of data over time. The authors of DW 2.0 address this issue by saying, The answer is that semantically static and semantically temporal data should be physically separate in all database designs. (Inmon, Strauss and Neuschloss, 2008) Semantic data change occurs when the definition of a data entity changes (i.e., new columns are added to an existing entity). This is different from content data change, where the value of an entity changes (i.e., an amount increases from $100 to $200). Semantically temporal data is likely to undergo semantic data change whereas semantically stable data is unlikely to undergo such change. An example of semantically temporal data is a customer profile, where you may want to track new types of information about a customer over time and therefore add new columns of data to the customer entity. For instance, if a company decides to send out a monthly newsletter, it may add an address to its customer profiles. An example of semantically stable data is the details of a customer sale. Once a date of sale, amount of sale, item sold, etc. has been determined, this data is unlikely to change over time. If the data is ever updated, the change is most likely the result of correcting an error in the original recording of data and not due to an actual change in the transaction itself. The designers of DW 2.0 point out that, in an environment where temporal and stable data are grouped together, a simple change in business requirements usually causes the technology infrastructure to go haywire (Inmon et al., 2008). However, separating temporal and stable data can mitigate such an effect. At the point where business change occurs, a new snapshot of the semantics can simply be created and delimited by a time factor using to and from dates. Date s temporalization approach comes into play here because historical snapshots must be created in such a way as to decompose data into a series of irreducible 6NF-compliant tables. By using this approach, semantically stable data is not affected at all and previously existing semantically temporal data also remains unchanged. Capturing these snapshots generates a historical record of data by which an analyst or end-user can easily locate and retrieve information while also providing a technological infrastructure that can withstand change over time (Inmon et al., 2008). ANCHOR MODELING AS A CONCEPTUAL METHOD Anchor Modeling is a database modeling technique built on the premise that the environment surrounding a data warehouse is in a constant state of change, and furthermore that a large change on the outside of the model should result in only a small change on the inside of the model (Rönnbäck, Regardt, Bergholtz, Johannesson and Wohed, 2010). The goal of using the anchor modeling technique is to achieve a highly decomposed implementation that can efficiently handle growth of the data warehouse without having to redo any previous work, mimicking the ideas of isolated semantic temporal data put forth in DW 2.0 architecture (Rönnbäck et al., 2010). When anchor models are translated to physical database designs, content changes are emulated using standard 6NF principles and the tables in the resulting relational database will be 6NF-compliant. Pieces of data are tied to points in time or intervals of time. Time points are modeled as attributes (i.e., the time a part is shipped), and intervals of time are modeled as a historization of attributes or ties (i.e., times during which a part was in stock). Transaction time elements, such as the time information entered or was updated in the database, are handled by metadata and are not included in the standard anchor modeling constructs. Anchor Modeling provides a graphical notation for conceptual database modeling similar to Entity-Relationship (ER) modeling, with extensions for temporal data. The model is based on 4 constructs: the Anchor, Attribute, Tie, and Knot (see Figure 8). Anchors model entities and events, Attributes model properties of Anchors, Ties model relationships between Anchors, and Knots model shared properties. By using a double line, an Attribute or a Tie can show a history of changes for the information they model. In Figure 8, the double line around attributes SUADR and COTRM and tie SUPA indicates that a history of changes is kept for supplier addresses, contract terms, and supplier-part relationships. Each of these entities is likely to change in content over time suppliers may change addresses, contract terms may be updated or renewed, and the company may change which suppliers they order certain parts from. Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA March 23 rd -24 th,
5 Figure 8. Anchor Modeling Constructs and a Basic Example The resulting anchor model can further be translated into a model useful for data warehousing. Using the Dimension Fact Model (DFM) technique put forth by Golfarelli and Rizzi (2009), we can translate the anchor model relationships and entities into dimensions and facts (see Figure 9). Fact DELIVERY can be generated which indicates the delivery of a certain part to a customer. Pertinent information can then be built around this fact for dimensions SUPPLIER, PART, and CONTRACT. To account for temporal aspects of the data, the DATE dimension is added. Note that although the DFM can be translated from an anchor model, it is not strictly 6NF compliant. It is simply used here as a traditional conceptual DW modeling technique. Figure 9. Anchor Data Modeled as Conceptual Dimension Fact Model Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA March 23 rd -24 th,
6 CONCLUSION With the emergence of data warehousing and the emphasis placed on the advantages of using temporal modeling in DW 2.0, the issue of how to store and model changes to data over time has become one of practical importance to business and industry. In this paper, we have explored a generalization of temporal aspects in database and data warehouse design using Sixth Normal Form (6NF) and Anchor Modeling techniques. By using such methods, snapshots of data may be captured which generate a historical record of data by which an analyst or end-user can easily locate and retrieve information while also providing a business with a technological infrastructure that can withstand change over time. There is now a need to describe and present temporal data to disparate data warehouse systems in a structured format suitable for data exchange across the internet and other networked resources. Since XML has become the preferred means of data exchange, an adequate definition of a standard XML Schema document definition template that describes 6NF data is desired. It is our intention to further this research with the result of producing an original standardized XML Schema mechanism for exchanging such information. We also hope to indicate advantages in application areas benefiting from the use of the proposed formats, as well as limitations. Researchers have previously described an XML anchor schema definition in the paper From Anchor to XML (Rönnbäck, Regardt, Bergholtz, Johannesson and Wohed, 2010). We hope to expand upon this work by using metamodels to address a logical level of design in defining a XML schema profile similar to one previously employed to describe Unified Modeling Language (UML) class diagrams (Routlige, Bird, and Goodchild, 2002). Ultimately, the outcome should define a method for using XML schema to communicate with DW 2.0-compliant temporal data warehouses. REFERENCES 1. Date, C.J., Darwen, H. and Lorentzos, N. (2002) Temporal data and the relational model: A detailed investigation into the application of interval and relation theory to the problem of temporal database management, Morgan Kaufmann Publishers, Amsterdam. 2. Golfarelli, M. and Rizzi, S. (2009) Data warehouse design: Modern principles and methodologies, McGraw Hill, New York. 3. Inmon, W.H., Strauss, D. and Neushloss, G. (2008) DW 2.0: The architecture for the next generation of data warehousing, Morgan Kaufmann Publishers, Amsterdam. 4. Rönnbäck, L., Regardt, O., Bergholtz, M., Johannesson, P. and Wohed, P. (2010) Anchor modeling - agile information modeling in evolving data environments, Data & Knowledge Engineering, Rönnbäck, L., Regardt, O., Bergholtz, M., Johannesson, P. and Wohed, P. (2010) From anchor model to XML, 6. Routlige, N., Bird, L. and Goodchild, A. (2002) UML and XML Schema, 13 th Australian Database Conference, Melbourne, Australia, Snodgrass, R. and Ahn, I. (1986) Temporal databases, IEEE Computer, 19(9), Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA March 23 rd -24 th,
Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling
Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling Thanks for Attending! Roland Bouman, Leiden the Netherlands MySQL AB, Sun, Strukton, Pentaho (1 nov) Web- and Business Intelligence
IT2305 Database Systems I (Compulsory)
Database Systems I (Compulsory) INTRODUCTION This is one of the 4 modules designed for Semester 2 of Bachelor of Information Technology Degree program. CREDITS: 04 LEARNING OUTCOMES On completion of this
IT2304: Database Systems 1 (DBS 1)
: Database Systems 1 (DBS 1) (Compulsory) 1. OUTLINE OF SYLLABUS Topic Minimum number of hours Introduction to DBMS 07 Relational Data Model 03 Data manipulation using Relational Algebra 06 Data manipulation
Databases What the Specification Says
Databases What the Specification Says Describe flat files and relational databases, explaining the differences between them; Design a simple relational database to the third normal form (3NF), using entityrelationship
14 Databases. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to:
14 Databases 14.1 Source: Foundations of Computer Science Cengage Learning Objectives After studying this chapter, the student should be able to: Define a database and a database management system (DBMS)
Data warehouse Architectures and processes
Database and data mining group, Data warehouse Architectures and processes DATA WAREHOUSE: ARCHITECTURES AND PROCESSES - 1 Database and data mining group, Data warehouse architectures Separation between
Temporal features in SQL:2011
Temporal features in SQL:2011 Krishna Kulkarni, Jan-Eike Michels (IBM Corporation) {krishnak, janeike}@us.ibm.com ABSTRACT SQL:2011 was published in December of 2011, replacing SQL:2008 as the most recent
How To Write A Diagram
Data Model ing Essentials Third Edition Graeme C. Simsion and Graham C. Witt MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF ELSEVIER AMSTERDAM BOSTON LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE
Databases in Organizations
The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron
Data Hierarchy. Traditional File based Approach. Hierarchy of Data for a Computer-Based File
Management Information Systems Data and Knowledge Management Dr. Shankar Sundaresan (Adapted from Introduction to IS, Rainer and Turban) LEARNING OBJECTIVES Recognize the importance of data, issues involved
Data Warehouse Design
Data Warehouse Design Modern Principles and Methodologies Matteo Golfarelli Stefano Rizzi Translated by Claudio Pagliarani Mc Grauu Hill New York Chicago San Francisco Lisbon London Madrid Mexico City
CDC UNIFIED PROCESS PRACTICES GUIDE
Purpose The purpose of this document is to provide guidance on the practice of Modeling and to describe the practice overview, requirements, best practices, activities, and key terms related to these requirements.
Data Modeling Basics
Information Technology Standard Commonwealth of Pennsylvania Governor's Office of Administration/Office for Information Technology STD Number: STD-INF003B STD Title: Data Modeling Basics Issued by: Deputy
2 Associating Facts with Time
TEMPORAL DATABASES Richard Thomas Snodgrass A temporal database (see Temporal Database) contains time-varying data. Time is an important aspect of all real-world phenomena. Events occur at specific points
Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments.
Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments Anuraj Gupta Department of Electronics and Communication Oriental Institute
Generating Enterprise Applications from Models
Generating Enterprise Applications from Models Vinay Kulkarni, R Venkatesh, Sreedhar Reddy Tata Research Development and Design Centre, 54, Industrial estate, Hadapsar, Pune, 411 013, INDIA { vinayk, rvenky,
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
Chapter 1: Introduction. Database Management System (DBMS) University Database Example
This image cannot currently be displayed. Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Database Management System (DBMS) DBMS contains information
CHECKING AND VERIFYING TEMPORAL DATA VALIDITY USING VALID TIME TEMPORAL DIMENSION AND QUERIES IN ORACLE 12C
CHECKING AND VERIFYING TEMPORAL DATA VALIDITY USING VALID TIME TEMPORAL DIMENSION AND QUERIES IN ORACLE 12C ABSTRACT Jaypalsinh A. Gohil 1 and Dr. Prashant M. Dolia 2 1 Assistant Professor & Research Scholar,
Anchor Modeling. Agile Information Modeling in Evolving Data Environments. Resight, Kungsgatan 66, 101 30 Stockholm, Sweden. O.
Anchor Modeling Agile Information Modeling in Evolving Data Environments L. Rönnbäck Resight, Kungsgatan 66, 101 30 Stockholm, Sweden O. Regardt Teracom, Kaknästornet, 102 52 Stockholm, Sweden M. Bergholtz,
Data Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
Physical Database Design and Tuning
Chapter 20 Physical Database Design and Tuning Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley 1. Physical Database Design in Relational Databases (1) Factors that Influence
How To Improve Performance In A Database
Some issues on Conceptual Modeling and NoSQL/Big Data Tok Wang Ling National University of Singapore 1 Database Models File system - field, record, fixed length record Hierarchical Model (IMS) - fixed
Relational Database Basics Review
Relational Database Basics Review IT 4153 Advanced Database J.G. Zheng Spring 2012 Overview Database approach Database system Relational model Database development 2 File Processing Approaches Based on
PartJoin: An Efficient Storage and Query Execution for Data Warehouses
PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE [email protected] 2
1. Physical Database Design in Relational Databases (1)
Chapter 20 Physical Database Design and Tuning Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley 1. Physical Database Design in Relational Databases (1) Factors that Influence
CS143 Notes: Normalization Theory
CS143 Notes: Normalization Theory Book Chapters (4th) Chapters 7.1-6, 7.8, 7.10 (5th) Chapters 7.1-6, 7.8 (6th) Chapters 8.1-6, 8.8 INTRODUCTION Main question How do we design good tables for a relational
AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO
DW2.0 The Architecture for the Next Generation of Data Warehousing W. H. Inmon Forest Rim Technology Derek Strauss Gavroshe Genia Neushloss Gavroshe AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS
Database Systems Concepts, Languages and Architectures
These slides are for use with Database Systems Concepts, Languages and Architectures Paolo Atzeni Stefano Ceri Stefano Paraboschi Riccardo Torlone To view these slides on-screen or with a projector use
A Generic Database Schema for CIDOC-CRM Data Management
A Generic Database Schema for CIDOC-CRM Data Management Kai Jannaschk 1, Claas Anders Rathje 1, Bernhard Thalheim 1 and Frank Förster 2 1 Christian-Albrechts-University at Kiel, Information Systems Engineering
Introduction. Chapter 1. Introducing the Database. Data vs. Information
Chapter 1 Objectives: to learn The difference between data and information What a database is, the various types of databases, and why they are valuable assets for decision making The importance of database
Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications
Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications White Paper Table of Contents Overview...3 Replication Types Supported...3 Set-up &
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
Database Design Overview. Conceptual Design ER Model. Entities and Entity Sets. Entity Set Representation. Keys
Database Design Overview Conceptual Design. The Entity-Relationship (ER) Model CS430/630 Lecture 12 Conceptual design The Entity-Relationship (ER) Model, UML High-level, close to human thinking Semantic
Oracle Data Integrator: Administration and Development
Oracle Data Integrator: Administration and Development What you will learn: In this course you will get an overview of the Active Integration Platform Architecture, and a complete-walk through of the steps
Metadata Management for Data Warehouse Projects
Metadata Management for Data Warehouse Projects Stefano Cazzella Datamat S.p.A. [email protected] Abstract Metadata management has been identified as one of the major critical success factor
A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems
Proceedings of the Postgraduate Annual Research Seminar 2005 68 A Model-based Software Architecture for XML and Metadata Integration in Warehouse Systems Abstract Wan Mohd Haffiz Mohd Nasir, Shamsul Sahibuddin
Data warehouse design
DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Data warehouse design DATA WAREHOUSE: DESIGN - 1 Risk factors Database
Multidimensional Modeling - Stocks
Bases de Dados e Data Warehouse 06 BDDW 2006/2007 Notice! Author " João Moura Pires ([email protected])! This material can be freely used for personal or academic purposes without any previous authorization
City University of Hong Kong. Information on a Course offered by Department of Computer Science with effect from Semester A in 2014 / 2015
City University of Hong Kong Information on a Course offered by Department of Computer Science with effect from Semester A in 2014 / 2015 Part I Course Title: Database Systems Course Code: CS3402 Course
CONCEPTUAL DATA VAULT MODEL
CONCEPTUAL DATA VAULT MODEL Vladan Jovanovic Georgia Southern University [email protected] Ivan Bojicic Belgrade University [email protected] ABSTRACT In this paper a novel conceptual data
Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann
Trivadis White Paper Comparison of Data Modeling Methods for a Core Data Warehouse Dani Schnider Adriano Martino Maren Eschermann June 2014 Table of Contents 1. Introduction... 3 2. Aspects of Data Warehouse
Bridge from Entity Relationship modeling to creating SQL databases, tables, & relations
1 Topics for this week: 1. Good Design 2. Functional Dependencies 3. Normalization Readings for this week: 1. E&N, Ch. 10.1-10.6; 12.2 2. Quickstart, Ch. 3 3. Complete the tutorial at http://sqlcourse2.com/
A Data Warehouse Design for A Typical University Information System
(JCSCR) - ISSN 2227-328X A Data Warehouse Design for A Typical University Information System Youssef Bassil LACSC Lebanese Association for Computational Sciences Registered under No. 957, 2011, Beirut,
Data Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong [email protected] Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
Subject Description Form
Subject Description Form Subject Code Subject Title COMP417 Data Warehousing and Data Mining Techniques in Business and Commerce Credit Value 3 Level 4 Pre-requisite / Co-requisite/ Exclusion Objectives
DATABASE INTRODUCTION
Introduction The history of database system research is one of exceptional productivity and startling economic impact. We have learnt that from the days of file-based systems there are better ways to handle
Course Syllabus For Operations Management. Management Information Systems
For Operations Management and Management Information Systems Department School Year First Year First Year First Year Second year Second year Second year Third year Third year Third year Third year Third
An Approach for Facilating Knowledge Data Warehouse
International Journal of Soft Computing Applications ISSN: 1453-2277 Issue 4 (2009), pp.35-40 EuroJournals Publishing, Inc. 2009 http://www.eurojournals.com/ijsca.htm An Approach for Facilating Knowledge
Normalization in Database Design
in Database Design Marek Rychly [email protected] Strathmore University, @ilabafrica & Brno University of Technology, Faculty of Information Technology Advanced Databases and Enterprise Systems 14
CS 377 Database Systems. Database Design Theory and Normalization. Li Xiong Department of Mathematics and Computer Science Emory University
CS 377 Database Systems Database Design Theory and Normalization Li Xiong Department of Mathematics and Computer Science Emory University 1 Relational database design So far Conceptual database design
Using Master Data in Business Intelligence
helping build the smart business Using Master Data in Business Intelligence Colin White BI Research March 2007 Sponsored by SAP TABLE OF CONTENTS THE IMPORTANCE OF MASTER DATA MANAGEMENT 1 What is Master
Database Systems Introduction Dr P Sreenivasa Kumar
Database Systems Introduction Dr P Sreenivasa Kumar Professor CS&E Department I I T Madras 1 Introduction What is a Database? A collection of related pieces of data: Representing/capturing the information
Security Issues for the Semantic Web
Security Issues for the Semantic Web Dr. Bhavani Thuraisingham Program Director Data and Applications Security The National Science Foundation Arlington, VA On leave from The MITRE Corporation Bedford,
CHAPTER 6 DATABASE MANAGEMENT SYSTEMS. Learning Objectives
CHAPTER 6 DATABASE MANAGEMENT SYSTEMS Management Information Systems, 10 th edition, By Raymond McLeod, Jr. and George P. Schell 2007, Prentice Hall, Inc. 1 Learning Objectives Understand the hierarchy
A Metamodel for Master Data
A Metamodel for Master Data Baba Piprani 1 and Suneil Dham 2 1 MetaGlobal Systems, Canada 2 Office of the Superintendent of Financial Institutions, Canada [email protected], [email protected]
Relational Database Design: FD s & BCNF
CS145 Lecture Notes #5 Relational Database Design: FD s & BCNF Motivation Automatic translation from E/R or ODL may not produce the best relational design possible Sometimes database designers like to
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 [email protected]
Software Architecture Document
Software Architecture Document Natural Language Processing Cell Version 1.0 Natural Language Processing Cell Software Architecture Document Version 1.0 1 1. Table of Contents 1. Table of Contents... 2
POLAR IT SERVICES. Business Intelligence Project Methodology
POLAR IT SERVICES Business Intelligence Project Methodology Table of Contents 1. Overview... 2 2. Visualize... 3 3. Planning and Architecture... 4 3.1 Define Requirements... 4 3.1.1 Define Attributes...
COMHAIRLE NÁISIÚNTA NA NATIONAL COUNCIL FOR VOCATIONAL AWARDS PILOT. Consultative Draft Module Descriptor. Relational Database
COMHAIRLE NÁISIÚNTA NA gcáilíochtaí GAIRMOIDEACHAIS NATIONAL COUNCIL FOR VOCATIONAL AWARDS PILOT Consultative Draft Module Descriptor Relational Database Level 3 C30147 December 1998 1 Title Relational
SQL Server. 1. What is RDBMS?
SQL Server 1. What is RDBMS? Relational Data Base Management Systems (RDBMS) are database management systems that maintain data records and indices in tables. Relationships may be created and maintained
Fundamentals of Database System
Fundamentals of Database System Chapter 4 Normalization Fundamentals of Database Systems (Chapter 4) Page 1 Introduction To Normalization In general, the goal of a relational database design is to generate
Chapter 1: Introduction
Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases
Data Modeling for Big Data
Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes
Contents RELATIONAL DATABASES
Preface xvii Chapter 1 Introduction 1.1 Database-System Applications 1 1.2 Purpose of Database Systems 3 1.3 View of Data 5 1.4 Database Languages 9 1.5 Relational Databases 11 1.6 Database Design 14 1.7
Course: CSC 222 Database Design and Management I (3 credits Compulsory)
Course: CSC 222 Database Design and Management I (3 credits Compulsory) Course Duration: Three hours per week for 15weeks with practical class (45 hours) As taught in 2010/2011 session Lecturer: Oladele,
Fragmentation and Data Allocation in the Distributed Environments
Annals of the University of Craiova, Mathematics and Computer Science Series Volume 38(3), 2011, Pages 76 83 ISSN: 1223-6934, Online 2246-9958 Fragmentation and Data Allocation in the Distributed Environments
Chapter 15 Basics of Functional Dependencies and Normalization for Relational Databases
Chapter 15 Basics of Functional Dependencies and Normalization for Relational Databases Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 15 Outline Informal Design Guidelines
Schema Design and Normal Forms Sid Name Level Rating Wage Hours
Entity-Relationship Diagram Schema Design and Sid Name Level Rating Wage Hours Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Database Management Systems, 2 nd Edition. R. Ramakrishnan
C# Cname Ccity.. P1# Date1 Qnt1 P2# Date2 P9# Date9 1 Codd London.. 1 21.01 20 2 23.01 2 Martin Paris.. 1 26.10 25 3 Deen London.. 2 29.
4. Normalisation 4.1 Introduction Suppose we are now given the task of designing and creating a database. How do we produce a good design? What relations should we have in the database? What attributes
CHAPTER : 5. MODELS & HYBRID METHODS
CHAPTER : 5. MODELS & HYBRID METHODS 5.1 Data Warehouse Modeling Data warehouse modeling is the process of designing the schemas of the detailed and summarised data of the data warehouse. The aim of data
Databases -Normalization III. (N Spadaccini 2010 and W Liu 2012) Databases - Normalization III 1 / 31
Databases -Normalization III (N Spadaccini 2010 and W Liu 2012) Databases - Normalization III 1 / 31 This lecture This lecture describes 3rd normal form. (N Spadaccini 2010 and W Liu 2012) Databases -
Database Schema Management
Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie, Maryland 20716 Tele: 301-249-1142 Email: [email protected] Web: www.wiscorp.com Table of Contents 1. Objective...1 2. Topics Covered...2
Development and Management
Cloud Database Development and Management Lee Chao CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an Informa business AN AUERBACH BOOK
Implementation of Model-View-Controller Architecture Pattern for Business Intelligence Architecture
Implementation of -- Architecture Pattern for Business Intelligence Architecture Medha Kalelkar Vidyalankar Institute of Technology, University of Mumbai, Mumbai, India Prathamesh Churi Lecturer, Department
Chapter 5. Learning Objectives. DW Development and ETL
Chapter 5 DW Development and ETL Learning Objectives Explain data integration and the extraction, transformation, and load (ETL) processes Basic DW development methodologies Describe real-time (active)
ISM 318: Database Systems. Objectives. Database. Dr. Hamid R. Nemati
ISM 318: Database Systems Dr. Hamid R. Nemati Department of Information Systems Operations Management Bryan School of Business Economics Objectives Underst the basics of data databases Underst characteristics
The Evolution of the Data Warehouse Systems in Recent Years
Jacek Maślankowski * The Evolution of the Data Warehouse Systems in Recent Years Introduction Although data warehouses are used in enterprises for a long time, they has evaluated recently. In the last
Chapter 1. Database Systems. Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel
Chapter 1 Database Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel 1 In this chapter, you will learn: The difference between data and information What a
æ A collection of interrelated and persistent data èusually referred to as the database èdbèè.
CMPT-354-Han-95.3 Lecture Notes September 10, 1995 Chapter 1 Introduction 1.0 Database Management Systems 1. A database management system èdbmsè, or simply a database system èdbsè, consists of æ A collection
Model Driven Interoperability through Semantic Annotations using SoaML and ODM
Model Driven Interoperability through Semantic Annotations using SoaML and ODM JiuCheng Xu*, ZhaoYang Bai*, Arne J.Berre*, Odd Christer Brovig** *SINTEF, Pb. 124 Blindern, NO-0314 Oslo, Norway (e-mail:
TopBraid Insight for Life Sciences
TopBraid Insight for Life Sciences In the Life Sciences industries, making critical business decisions depends on having relevant information. However, queries often have to span multiple sources of information.
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Rotorcraft Health Management System (RHMS)
AIAC-11 Eleventh Australian International Aerospace Congress Rotorcraft Health Management System (RHMS) Robab Safa-Bakhsh 1, Dmitry Cherkassky 2 1 The Boeing Company, Phantom Works Philadelphia Center
Theory of Relational Database Design and Normalization
Theory of Relational Database Design and Normalization (Based on Chapter 14 and some part of Chapter 15 in Fundamentals of Database Systems by Elmasri and Navathe, Ed. 3) 1 Informal Design Guidelines for
Conceptual Design Using the Entity-Relationship (ER) Model
Conceptual Design Using the Entity-Relationship (ER) Model Module 5, Lectures 1 and 2 Database Management Systems, R. Ramakrishnan 1 Overview of Database Design Conceptual design: (ER Model is used at
