Challenges in digital preservation: Relational databases

Size: px
Start display at page:

Download "Challenges in digital preservation: Relational databases"

Transcription

1 Challenges in digital preservation: Relational databases Mark Brogan and Justin Brown School of Computer and Information Science Edith Cowan University Perth, Western Australia 6050 Abstract Chen (2001) coined the term digital preservation paradox to describe a philosophy of preservation management that recognises change as a cornerstone of managing digital records for permanence. Paradox also surrounds use of the terms structured and unstructured. As more and more so-called unstructured information sources are generated from structured repositories, preservation planning for unstructured sources must move to encompass the structured information sources from which they are created. This paper reviews strategies and methods for database preservation inclusive of issues and methods at enterprise level. Case study based on a widely used XML normalisation tool (MS Access) is used to investigate the advantages and disadvantages of XML normalisation as a preservation strategy. The rise of the structured information source In computing, the document has undergone profound changes since the emergence of information technology for business applications in the 1960s. In the early stage of its evolution, the document was unitary, proprietary and dumb. The open systems movement challenged the proprietary character of the document and led to a concept of the document as inter-operable and accessible outside the creating application context. The unitary character of the document was also challenged by the Web revolution of the 1990s. The Web created the phenomenon of the compound or so-called virtual document (Rheinhardt, 1994). As the importance of metadata descriptions of documents became established in the 1990s, documents acquired metadata wrappers describing authorship, content and change history and ceased to be dumb. These changes describe trajectories in document engineering over the past thirty years. Since 2000, the ideas of content re-use and re-purposing, have further shaped the evolution of the document. As earlier developments in document engineering have added structure to the document, so too has today s content management trajectory. This is most plainly seen in convergence on extensible Markup Language (XML) as the underlying technology of what were once unstructured types. For example, with Microsoft Office 2003, Microsoft made the transition to XML as the technology foundation of its Office productivity suite. By using XML format, an organisation can: 1

2 set up an environment for authors to create documents with a consistent look and feel, while at the same time facilitating re-use of content The Office Open XML format enables an organisation to define templates using an XML schema that is most suitable for their business requirements. This schema can consist of tags that correspond to various sections of a document, such as <Executive Summary/>, <MainBody/>, and <Conclusion/> Re-use of content is possible because the content is automatically tagged with the appropriate XML code and can be programmatically processed for document assembly, distribution, and conversion (Microsoft, 2006, pp.15-16). Parallel developments in the open source community have delivered Open Document Format (ODF), a competing XML open source file format for electronic office documents, such as spreadsheets, charts, presentations and word processing documents. Together these developments have placed what were once un-structured information sources, on a structured trajectory. Cognitive dissonance: Structured and un-structured information sources While the user experience of documents continues to suggest its unitary, unstructured character, the perception of types as being structured and unstructured is increasingly misleading. Such a perception disguises the real trend towards increasingly structured documents, many of which now emanate from technologies more appropriately described as databases or XML databases. Current trends in document engineering are poorly reflected in records programs and digital preservation, where the abiding concern continues to be un-structured types. So why isn t the current trajectory in document engineering recognised? Some of the responsibility for this can be sheeted home to the distinctions made by information and information systems managers between data, information and records. For example, many records managers believe that their sole purpose is to manage records. Innocent enough, but since records are considered a species of documents, and databases repositories of data, the records program is often defined specifically to exclude databases. As risk management discourse increasingly emphasises the importance of data retention policy, records managers and other Information Management (IM) professionals are being drawn reluctantly into a dialog with Information Systems (IS) professionals about structured information sources. Further, as the locus of recordkeeping increasingly switches to structured information sources, realignment of the records program will have to take place, particularly since information systems professionals regard data retention as a mine field and are actively seeking IM perspectives in this area. Foundation concepts in database archiving As the focus of retention policy expands to encompass databases, IM and IS professionals are actively contemplating methods and tools for database archiving. Review of the literature shows that database archiving is discarding some of its early ambiguity and taking shape around the core concepts of long term retention of usable digital memory as a core component of corporate 2

3 governance (Gartner, 2006). To understand these developments and to successfully deploy business solutions, it is important to understand foundation concepts in database archiving. These days most databases are relational. A relational database stores data in a series of related tables. Each table consists of columns and rows. These entities correspond to the user view of a database. Other frames concern the developer and physical layer view of the database. These are described in Table 1. The left hand side of the table describes the physical or file processing model of the database. The middle and right, the logical model: File Processing Environment Relational developer db Relational db user File Relation Table Record Tuple Row Field Attribute Column Table 1-Relational database frames Relations, tuples and attributes are concepts representative of the systems analysis domain that may be encountered in project documentation, but are also commonly used in connection with the theory of databases. In the implementation of databases, these theoretical constructs are translated respectively as tables, rows and columns which describe the user view. At the file processing level, tables are equivalent to files, rows to records and columns to fields. Relationships between tables are defined as to create rules by which data manipulation in one table causes corresponding changes in another. This governance of behaviour is a means by which integrity is ensured within a relational database. A table relationship is created by the establishment of a common key between two or more tables. Where a key first appears it is known as a primary key. Where it appears again to establish a relationship it is known as a foreign key. As part of the evolution of IS thinking about database archiving, the IS view has moved to a notion of archiving familiar to IM professionals. Gartner (2006) refers to db archiving as: A critical component of Information Life Cycle Management; A tool for implementing data retention policy and meeting compliance requirements; and A key component of corporate governance that enables the retention of usable digital memory for long periods. 3

4 This is quite a departure from earlier IS thinking where archiving was identified with Hierarchical Storage Tiering (HSM etc.), backing up or copying data (data management), or taking data offline. Discussion so far is sufficient to identify how and why database archiving must be different from other forms of archiving. Database archiving differs from filelevel archiving because data is stored in tables, and the rows and columns within those tables are all tightly linked. Unlike files which are self contained, information in a database is found in rows and columns which depend on other rows and columns. The primary and foreign keys are an example. Consequently, a piece of information in a row or a column of data cannot be selectively taken out of the database and moved off to an archive with standard archiving tools. Further, the meaning and evidential value of information in databases can often only be assessed in relation to various metadata sources that describe data organisation. Such sources include data dictionaries that describe rows, columns and data types; and Entity-Relationship (E-R) diagrams. Structured information sources are also highly system dependent and may undergo near constant content change. If archiving structured information sources is different from archiving unstructured sources, what possibilities exist for database archiving? Framing database archiving On reflection, the following possibilities exist regarding the record(s) within a database (Digital Preservation Testbed, 2003): the complete database system (database, Relational Database Management System -RDBMS, and application) together constitute the digital record; the database is the digital record; a single row of data stored in a database table (i.e. a tuple ) is the digital record; data distributed over a number of tables constitutes the digital record; information in the database as displayed onscreen by the application forms the digital record. There is no magic wand that can be applied to determine which of these frames of the record is best matched to discovered value(s). There are methodologies that purport to help, such as DIRKS (NAA, 2001), but archivists understanding of appraisal informs the decision making process. Conclusions from appraisal, lead to understanding of the most appropriate frame and the selection of preservation tactics. Methods and tools for RDBMS archiving Methods and tools for db archiving assume particular appraisal and disposition outcomes. The following is a survey of methods and tools removed from the independent variable of appraisal outcome. Two methods are distinguished in the literature: active and inactive archiving. 4

5 I Active Archiving Moving (archiving) records from one RDBMS table to another table located in the same or a different database instance, while preserving query and other core archival functions, defines an active archiving approach. Microsoft is an exponent of this, the simplest method of database archiving (Microsoft, 2007). With active archiving records are moved from an existing table to another table with the same structure and organisation as the source table. Structured Query Language (SQL) is used to select, copy and delete records from the source table. The user and application selectively switches context (the database connection) for historical queries, directing them to the archive instance. Figure 1 describes this process: Archive table with the same structure and organization as the source table. SQL is used to remove data from the source table to the archive table Figure 1- Active archiving The main advantage of such an approach is that it preserves user views and core functionality. By removing inactive records from the current domain of the database, information retrieval efficiency is also improved. Disadvantages of proprietary active archiving include: additional licensing costs; and the deferral of long term digital preservation planning. II. Inactive Archiving (Migration) Inactive archiving is the migrating (archiving) of records from an existing system to a newer hardware/software environment based on another system. Migration to a new system when an existing relational database system enters the legacy phase of its working life is a highly common form of database archiving. For example, records might be: migrated (archived) to a content management system (CMS); a new database host (e.g. Access->SQL Server); or 5

6 a new version of the current database. Since most enterprise RDBMS support backwards compatibility and interoperability, first generation migration can be unproblematic. However, as underlying information architecture changes over time, subsequent migration may involve loss of fidelity and function. For this reason migration is not usually thought of as a long term preservation solution for complex database systems. III. Inactive Archiving (XML Normalisation) The disadvantages of active archiving and migration suggest the role of open standards, particularly where long term preservation is contemplated and preservation of database behaviours is not mission critical. XML normalisation, where records are migrated to XML has arisen as the standards based approach of choice. Typically XML normalisation is performed as an end of life cycle activity, when databases enter a so-called legacy phase. XML normalisation involves no licensing costs and is highly vendor independent. A trade off is that XML normalisation results in flat file format and the loss of database behaviours such as queries, reports and user views. Because the preservation of referential integrity is problematic with XML normalisation, it is generally not applied as part of active archiving strategy. Figure 2 describes the consequences of XML normalisation for a case study health care database for breast cancer screening: Figure 2- Inactive Archiving (XML Normalisation) The products of normalisation typically consist of an XML schema that represents the data model and an XML file that consists of table row data. Importation of normalised archive data to a newer production environment usually requires the writing of importation scripts, that work by extracting values based on the node tree described in the XML schema. Normalisation case study Particularly for long term preservation, XML normalisation involves clear advantages compared with the other methods discussed. But how good is 6

7 normalisation as an archiving method? This question is best answered via case study with current generation normalisation tools. Beginning with MS Access 2003, Microsoft has supported XML normalisation as a migration pathway for Access databases. As a case study and precursor to a broader investigation, the authors undertook an XML normalisation of the MS Access 2003 database referred to in Figure 2 and evaluated the results. The criteria used were those originally developed by the Digital Preservation Testbed Project in These were: Authenticity; Reliability (Data integrity); Completeness; and Digital characteristics comprising context, content, structure, appearance, behaviour and metadata. Authenticity This criterion was deemed to be satisfied if sufficient evidence existed of authorship to establish the provenance of the database, the provenance of records held and their use history. In our trial database, this design feature was not supported. Experience gained suggested that if these features were supported, then normalisation might have been used successfully to capture the provenance and use history of records in the database. Reliability (Data integrity) This criterion was deemed to be satisfied if the integrity of data had been preserved. On inspection, the criterion was found to be satisfied with reservation. The Date/Time type was translated as a text string inclusive of a time stamp not found in record values. Completeness This condition was also satisfied subject to reservation. All tables and columns were successfully exported as parent and child nodes in XML. However, relationships between tables were lost. Prima facie, there is no way that table relationships can be represented with fidelity in XML. Structure Similarly, this criterion was satisfied subject to reservation. Foreign key values were often translated as numeric (foreign key) rather than text values. A project dictionary would therefore have been required to understand the translation. The root node itself was correctly translated as a complex type and all parent/child relationships were found to be correct. Appearance An XML document viewed in a parser has a very different appearance from a database. XSL transformation might have been used to improve performance against this criterion. However, if recordness is contained with RDBMS resident views, XML is not a good solution to this problem. 7

8 Behaviour Key aspects of behaviours such as input masks and validation rules were not translated during XML conversion. Likewise no method exists for translation of SQL queries and reports. A conclusion from evaluation against this criterion is that XML normalisation is not well suited to the preservation of database behaviour. Metadata The principal metadata outcome from the conversion was an XML Schema describing the vocabulary and structure of a valid patients.xml file. Data types and cardinality were often successfully captured. Primary keys were mostly correctly identified. Table, row and column structure was successfully translated as complex and simple types in correct hierarchical (Parent/Child) relationship. Figure 3- Breast Cancer Screening Schema Subject to the reservations described, the case study delivered a strong endorsement of XML normalisation. Digital Preservation Testbed (2003, p.33) concluded that XML is the most effective strategy for the durable preservation of databases. XML is highly capable of representing the context, content, and structure of databases. Enterprise application and tools While XML offers a durable way of storing the content of an RDBMS as well as its structure, not all enterprise solutions offer an out of the box functionality set that allows data to be moved from the operational to archival mode. As an example, neither of Microsoft s flagship database solutions, MS SQL Server 2000 and 2005 offer the ability to Save As to an xml structure directly. While these products allow for extraction of query results directly to xml output, and even the ability to store xml in a new xml datatype column (Juday, 2007), neither provide the ability to save table or view data to a structured xml document with accompanying xsd schema. To achieve such functionality within the context of MS SQL Server 2000/2005 requires exportation of data to an 8

9 intermediate storage solution, such as Microsoft Access, from which it can then be exported to a structured xml file. In case study, an observed problem with this approach was that of structural degradation and loss of fidelity in terms of original RDBMS formats, specifically datatypes and relational links. The left hand side of Figure 4 below shows a source table of data (in Design view) in MS SQL Server The figure on the right shows the same table design once it has been exported to Microsoft Access. Figure 4- MS SQL Server 2000 to MS Access As can be seen, export involves a loss of fidelity in terms of datatypes. Microsoft Access offers only simplistic support for datatypes. For example, the primary key identifier with associated auto incrementing counter has been lost on the ProductID column. Perhaps more importantly, as can be seen in Figure 5 below, the relational structure of the MS SQL Server 2000 database is lost during the translation to the Microsoft Access format: 9

10 Figure 5- Relational structure and XML Normalisation In some respects, the loss of datatype information during the translation process from an enterprise solution to a more basic desktop environment may not be crucial- especially if the source data is not totally reliant on custom or proprietary datatypes for the operation of any linked applications or business logic. However, the loss of relational structure could be a more pressing issue where this information may be needed in an instance where the original structure of database needs to be re-instated to its original level. In some ways this may not even be an issue in that MS SQL Server will not import relational structures from a Microsoft Access database anyway as the Data Definition Language (DDL) implementation is not compatible. Conclusion Microsoft SQL Server and Microsoft Access have been used as examples in this discussion of enterprise tools in that together they provide some interesting possibilities for database to xml archiving. Microsoft Access provides strong import and export facilities for xml and xml schema, with the added ability to place xsd style sheets onto outputted xml documents for ready display in almost any modern web browser. MS Access datatypes and relational structures can be captured to xml and schema and then re-instated back to their original form from the xml archive. However, the limitation of such functionality is that it is an 10

11 isolated implementation, with only MS Access data being truly exportable/importable to and from an xml archive. While most of the current enterprise level RDBMS solutions, such as MS SQL Server, Oracle 10g (Murthy, Liu, Krishnarprasad et al, 2007) and MySQL have varying levels of internal xml support, none export directly to xml with any level of fidelity for content and structure. This leaves information managers and the organisations that they serve in a position of having to look to 3 rd party vendors to provide the tools for implementing the RDBMS->XML->RDBMS cycle. A cycle described by the need to archive to xml but re-instate back to the enterprise environment. The alternate solution is to take the approach of an asynchronous archiving paradigm where data is archived to xml with the intention that the data later be retrievable and readable, but not able to be re-instated back into the RDBMS from which it came. References Cheng, Su-Shing Chen (2001, March). Perspectives: The paradox of digital preservation. IEEE Computer Digital Preservation Testbed. (2003). From digital volatility to digital permanence: preserving databases. Retrieved 10 July, 2007 from Gartner (2006). Archiving: Technology overview. Gartner Research. no. G Juday, J. (2007). The fundamentals of the SQL server 2005 XML datatype. [Electronic document]. Available: Microsoft (2006). Enterprise content management: Breaking the barriers to broad user adoption. Retrieved 11 October 2007 from: pdf Microsoft (2007). Periodically archived records in an Access database. Retrieved 20 July 2007 from: Murthy, R, Liu, Z, Krishnarprasad, M, Chandrasekar, S, Tran, A, Sedlar, E, Florescu, D, Kotsovolos, S, Agarwal, N, Arora, V, Krishnamurthy, V. (2005). Towards an enterprise XML architecture. Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data. pp National Archives of Australia (2001). epermanence: DIRKS a strategic approach to managing business information. Retrieved 11 October 2007 from: Reinhardt, A. (1994, August). Managing the new document. Byte. Retrieved 11 October 2007 from: 11

A WHITE PAPER By Silwood Technology Limited

A WHITE PAPER By Silwood Technology Limited A WHITE PAPER By Silwood Technology Limited Using Safyr to facilitate metadata transparency and communication in major Enterprise Applications Executive Summary Enterprise systems packages such as SAP,

More information

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Service Oriented Architecture SOA and Web Services John O Brien President and Executive Architect Zukeran Technologies

More information

www.gr8ambitionz.com

www.gr8ambitionz.com Data Base Management Systems (DBMS) Study Material (Objective Type questions with Answers) Shared by Akhil Arora Powered by www. your A to Z competitive exam guide Database Objective type questions Q.1

More information

ER/Studio 8.0 New Features Guide

ER/Studio 8.0 New Features Guide ER/Studio 8.0 New Features Guide Copyright 1994-2008 Embarcadero Technologies, Inc. Embarcadero Technologies, Inc. 100 California Street, 12th Floor San Francisco, CA 94111 U.S.A. All rights reserved.

More information

A Database Re-engineering Workbench

A Database Re-engineering Workbench A Database Re-engineering Workbench A project proposal by Anmol Sharma Abstract Data is not always available in the best form for processing, it is often provided in poor format or in a poor quality data

More information

DBMS / Business Intelligence, SQL Server

DBMS / Business Intelligence, SQL Server DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.

More information

Implementing SharePoint 2010 as a Compliant Information Management Platform

Implementing SharePoint 2010 as a Compliant Information Management Platform Implementing SharePoint 2010 as a Compliant Information Management Platform Changing the Paradigm with a Business Oriented Approach to Records Management Introduction This document sets out the results

More information

Data Modeling Basics

Data Modeling Basics Information Technology Standard Commonwealth of Pennsylvania Governor's Office of Administration/Office for Information Technology STD Number: STD-INF003B STD Title: Data Modeling Basics Issued by: Deputy

More information

BUILDING OLAP TOOLS OVER LARGE DATABASES

BUILDING OLAP TOOLS OVER LARGE DATABASES BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,

More information

14 Databases. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to:

14 Databases. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to: 14 Databases 14.1 Source: Foundations of Computer Science Cengage Learning Objectives After studying this chapter, the student should be able to: Define a database and a database management system (DBMS)

More information

Unified XML/relational storage March 2005. The IBM approach to unified XML/relational databases

Unified XML/relational storage March 2005. The IBM approach to unified XML/relational databases March 2005 The IBM approach to unified XML/relational databases Page 2 Contents 2 What is native XML storage? 3 What options are available today? 3 Shred 5 CLOB 5 BLOB (pseudo native) 6 True native 7 The

More information

Short notes on webpage programming languages

Short notes on webpage programming languages Short notes on webpage programming languages What is HTML? HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is a markup language A markup language is a set of

More information

DBMS Questions. 3.) For which two constraints are indexes created when the constraint is added?

DBMS Questions. 3.) For which two constraints are indexes created when the constraint is added? DBMS Questions 1.) Which type of file is part of the Oracle database? A.) B.) C.) D.) Control file Password file Parameter files Archived log files 2.) Which statements are use to UNLOCK the user? A.)

More information

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, info@ikeep.com)

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, info@ikeep.com) CSP CHRONOS Compliance statement for ISO 14721:2003 (Open Archival Information System Reference Model) 2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, info@ikeep.com) The international

More information

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features 1 Oracle SQL Developer 3.0: Overview and New Features Sue Harper Senior Principal Product Manager The following is intended to outline our general product direction. It is intended

More information

Business Application Services Testing

Business Application Services Testing Business Application Services Testing Curriculum Structure Course name Duration(days) Express 2 Testing Concept and methodologies 3 Introduction to Performance Testing 3 Web Testing 2 QTP 5 SQL 5 Load

More information

1. INTRODUCTION TO RDBMS

1. INTRODUCTION TO RDBMS Oracle For Beginners Page: 1 1. INTRODUCTION TO RDBMS What is DBMS? Data Models Relational database management system (RDBMS) Relational Algebra Structured query language (SQL) What Is DBMS? Data is one

More information

Core Syllabus. Version 2.6 B BUILD KNOWLEDGE AREA: DEVELOPMENT AND IMPLEMENTATION OF INFORMATION SYSTEMS. June 2006

Core Syllabus. Version 2.6 B BUILD KNOWLEDGE AREA: DEVELOPMENT AND IMPLEMENTATION OF INFORMATION SYSTEMS. June 2006 Core Syllabus B BUILD KNOWLEDGE AREA: DEVELOPMENT AND IMPLEMENTATION OF INFORMATION SYSTEMS Version 2.6 June 2006 EUCIP CORE Version 2.6 Syllabus. The following is the Syllabus for EUCIP CORE Version 2.6,

More information

Jet Data Manager 2012 User Guide

Jet Data Manager 2012 User Guide Jet Data Manager 2012 User Guide Welcome This documentation provides descriptions of the concepts and features of the Jet Data Manager and how to use with them. With the Jet Data Manager you can transform

More information

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.

More information

4. The Third Stage In Designing A Database Is When We Analyze Our Tables More Closely And Create A Between Tables

4. The Third Stage In Designing A Database Is When We Analyze Our Tables More Closely And Create A Between Tables 1. What Are The Different Views To Display A Table A) Datasheet View B) Design View C) Pivote Table & Pivot Chart View D) All Of Above 2. Which Of The Following Creates A Drop Down List Of Values To Choose

More information

Category: Business Process and Integration Solution for Small Business and the Enterprise

Category: Business Process and Integration Solution for Small Business and the Enterprise Home About us Contact us Careers Online Resources Site Map Products Demo Center Support Customers Resources News Download Article in PDF Version Download Diagrams in PDF Version Microsoft Partner Conference

More information

Sage CRM Connector Tool White Paper

Sage CRM Connector Tool White Paper White Paper Document Number: PD521-01-1_0-WP Orbis Software Limited 2010 Table of Contents ABOUT THE SAGE CRM CONNECTOR TOOL... 1 INTRODUCTION... 2 System Requirements... 2 Hardware... 2 Software... 2

More information

D61830GC30. MySQL for Developers. Summary. Introduction. Prerequisites. At Course completion After completing this course, students will be able to:

D61830GC30. MySQL for Developers. Summary. Introduction. Prerequisites. At Course completion After completing this course, students will be able to: D61830GC30 for Developers Summary Duration Vendor Audience 5 Days Oracle Database Administrators, Developers, Web Administrators Level Technology Professional Oracle 5.6 Delivery Method Instructor-led

More information

TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL

TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL INTRODUCTION WHAT IS A RECORD? AS ISO 15489-2002 Records Management defines a record as information created,

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz March 1, 2015 The Database Approach to Data Management Database: Collection of related files containing records on people, places, or things.

More information

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence

More information

n Assignment 4 n Due Thursday 2/19 n Business paper draft n Due Tuesday 2/24 n Database Assignment 2 posted n Due Thursday 2/26

n Assignment 4 n Due Thursday 2/19 n Business paper draft n Due Tuesday 2/24 n Database Assignment 2 posted n Due Thursday 2/26 Class Announcements TIM 50 - Business Information Systems Lecture 14 Instructor: John Musacchio UC Santa Cruz n Assignment 4 n Due Thursday 2/19 n Business paper draft n Due Tuesday 2/24 n Database Assignment

More information

SAP Data Services 4.X. An Enterprise Information management Solution

SAP Data Services 4.X. An Enterprise Information management Solution SAP Data Services 4.X An Enterprise Information management Solution Table of Contents I. SAP Data Services 4.X... 3 Highlights Training Objectives Audience Pre Requisites Keys to Success Certification

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases

More information

Data Modeling for Big Data

Data Modeling for Big Data Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes

More information

Archiving and Backup - The Basics

Archiving and Backup - The Basics Introduction Backup and archiving are useful tools for managing data, especially volumes of ageing data. Agency policies for retention and access necessitate that these tools should be used to implement

More information

Tutorial on Relational Database Design

Tutorial on Relational Database Design Tutorial on Relational Database Design Introduction Relational database was proposed by Edgar Codd (of IBM Research) around 1969. It has since become the dominant database model for commercial applications

More information

Choosing A CMS. Enterprise CMS. Web CMS. Online and beyond. Best-of-Breed Content Management Systems. 1300 762 912 info@ares.com.

Choosing A CMS. Enterprise CMS. Web CMS. Online and beyond. Best-of-Breed Content Management Systems. 1300 762 912 info@ares.com. Web CMS Enterprise CMS 1300 762 912 info@ares.com.au Sydney 275 alfred st north sydney nsw 2060 Melbourne 21 burwood road hawthorn vic 3122 Choosing A CMS Best-of-Breed Content Management Systems Brisbane

More information

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases.

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases. SQL Databases Course by Applied Technology Research Center. 23 September 2015 This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases. Oracle Topics This Oracle Database: SQL

More information

Chapter 14: Databases and Database Management Systems

Chapter 14: Databases and Database Management Systems 15 th Edition Understanding Computers Today and Tomorrow Comprehensive Chapter 14: Databases and Database Management Systems Deborah Morley Charles S. Parker Copyright 2015 Cengage Learning Learning Objectives

More information

Database as a Service (DaaS) Version 1.02

Database as a Service (DaaS) Version 1.02 Database as a Service (DaaS) Version 1.02 Table of Contents Database as a Service (DaaS) Overview... 4 Database as a Service (DaaS) Benefit... 4 Feature Description... 4 Database Types / Supported Versions...

More information

metaengine DataConnect For SharePoint 2007 Configuration Guide

metaengine DataConnect For SharePoint 2007 Configuration Guide metaengine DataConnect For SharePoint 2007 Configuration Guide metaengine DataConnect for SharePoint 2007 Configuration Guide (2.4) Page 1 Contents Introduction... 5 Installation and deployment... 6 Installation...

More information

Life Cycle of Records

Life Cycle of Records Discard Create Inactive Life Cycle of Records Current Retain Use Semi-current Records Management Policy April 2014 Document title Records Management Policy April 2014 Document author and department Responsible

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,

More information

Federated, Generic Configuration Management for Engineering Data

Federated, Generic Configuration Management for Engineering Data Federated, Generic Configuration Management for Engineering Data Dr. Rainer Romatka Boeing GPDIS_2013.ppt 1 Presentation Outline I Summary Introduction Configuration Management Overview CM System Requirements

More information

EFFECTIVE STORAGE OF XBRL DOCUMENTS

EFFECTIVE STORAGE OF XBRL DOCUMENTS EFFECTIVE STORAGE OF XBRL DOCUMENTS An Oracle & UBmatrix Whitepaper June 2007 Page 1 Introduction Today s business world requires the ability to report, validate, and analyze business information efficiently,

More information

Managing Third Party Databases and Building Your Data Warehouse

Managing Third Party Databases and Building Your Data Warehouse Managing Third Party Databases and Building Your Data Warehouse By Gary Smith Software Consultant Embarcadero Technologies Tech Note INTRODUCTION It s a recurring theme. Companies are continually faced

More information

Introduction to XML Applications

Introduction to XML Applications EMC White Paper Introduction to XML Applications Umair Nauman Abstract: This document provides an overview of XML Applications. This is not a comprehensive guide to XML Applications and is intended for

More information

Digital Archiving Survey

Digital Archiving Survey Digital Archiving Survey Background information Under the Public Records Act 2002 (the Act), public authorities have a responsibility to ensure that digital records under their control remain accessible

More information

Databases in Organizations

Databases in Organizations The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron

More information

Files. Files. Files. Files. Files. File Organisation. What s it all about? What s in a file?

Files. Files. Files. Files. Files. File Organisation. What s it all about? What s in a file? Files What s it all about? Information being stored about anything important to the business/individual keeping the files. The simple concepts used in the operation of manual files are often a good guide

More information

Karl Lum Partner, LabKey Software klum@labkey.com. Evolution of Connectivity in LabKey Server

Karl Lum Partner, LabKey Software klum@labkey.com. Evolution of Connectivity in LabKey Server Karl Lum Partner, LabKey Software klum@labkey.com Evolution of Connectivity in LabKey Server Connecting Data to LabKey Server Lowering the barrier to connect scientific data to LabKey Server Increased

More information

Databases What the Specification Says

Databases What the Specification Says Databases What the Specification Says Describe flat files and relational databases, explaining the differences between them; Design a simple relational database to the third normal form (3NF), using entityrelationship

More information

Introduction to Computing. Lectured by: Dr. Pham Tran Vu t.v.pham@cse.hcmut.edu.vn

Introduction to Computing. Lectured by: Dr. Pham Tran Vu t.v.pham@cse.hcmut.edu.vn Introduction to Computing Lectured by: Dr. Pham Tran Vu t.v.pham@cse.hcmut.edu.vn Databases The Hierarchy of Data Keys and Attributes The Traditional Approach To Data Management Database A collection of

More information

Data Warehouses in the Path from Databases to Archives

Data Warehouses in the Path from Databases to Archives Data Warehouses in the Path from Databases to Archives Gabriel David FEUP / INESC-Porto This position paper describes a research idea submitted for funding at the Portuguese Research Agency. Introduction

More information

Long-term Archiving of Relational Databases with Chronos

Long-term Archiving of Relational Databases with Chronos First International Workshop on Database Preservation (PresDB'07) 23 March 2007, at the UK Digital Curation Centre and the Database Group in the School of Informatics, University of Edinburgh Long-term

More information

Management Update: Powerhouse Vendors Implement Document Management

Management Update: Powerhouse Vendors Implement Document Management IGG-10302002-04 G. Landers, K. Shegda, D. Logan Article 30 October 2002 Management Update: Powerhouse Vendors Implement Document Management Interest within enterprises is growing in the management of unstructured

More information

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix Jennifer Clegg, SAS Institute Inc., Cary, NC Eric Hill, SAS Institute Inc., Cary, NC ABSTRACT Release 2.1 of SAS

More information

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Deploying a distributed data storage system on the UK National Grid Service using federated SRB Deploying a distributed data storage system on the UK National Grid Service using federated SRB Manandhar A.S., Kleese K., Berrisford P., Brown G.D. CCLRC e-science Center Abstract As Grid enabled applications

More information

Real-time Data Replication

Real-time Data Replication Real-time Data Replication from Oracle to other databases using DataCurrents WHITEPAPER Contents Data Replication Concepts... 2 Real time Data Replication... 3 Heterogeneous Data Replication... 4 Different

More information

B.Sc (Computer Science) Database Management Systems UNIT-V

B.Sc (Computer Science) Database Management Systems UNIT-V 1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used

More information

State Records Office Guideline. Management of Digital Records

State Records Office Guideline. Management of Digital Records State Records Office Guideline Management of Digital Records An Information Management Guideline for State Organizations Version 2 January 2015 www.sro.wa.gov.au Contents GLOSSARY... 2 PURPOSE... 5 BACKGROUND...

More information

Chapter 1: Introduction. Database Management System (DBMS) University Database Example

Chapter 1: Introduction. Database Management System (DBMS) University Database Example This image cannot currently be displayed. Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Database Management System (DBMS) DBMS contains information

More information

EDG Project: Database Management Services

EDG Project: Database Management Services EDG Project: Database Management Services Leanne Guy for the EDG Data Management Work Package EDG::WP2 Leanne.Guy@cern.ch http://cern.ch/leanne 17 April 2002 DAI Workshop Presentation 1 Information in

More information

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities April, 2013 gaddsoftware.com Table of content 1. Introduction... 3 2. Vendor briefings questions and answers... 3 2.1.

More information

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? www.ptr.co.uk Business Benefits From Microsoft SQL Server Business Intelligence (September

More information

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Data Sheet IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Overview Multidimensional analysis is a powerful means of extracting maximum value from your corporate

More information

COURSE NAME: Database Management. TOPIC: Database Design LECTURE 3. The Database System Life Cycle (DBLC) The database life cycle contains six phases;

COURSE NAME: Database Management. TOPIC: Database Design LECTURE 3. The Database System Life Cycle (DBLC) The database life cycle contains six phases; COURSE NAME: Database Management TOPIC: Database Design LECTURE 3 The Database System Life Cycle (DBLC) The database life cycle contains six phases; 1 Database initial study. Analyze the company situation.

More information

Digital Archives Migration Methodology. A structured approach to the migration of digital records

Digital Archives Migration Methodology. A structured approach to the migration of digital records Digital Archives Migration Methodology A structured approach to the migration of digital records Published July 2014 1 Table of contents Executive summary... 3 What is the Digital Archives Migration Methodology?...

More information

Application Express Web Application Development

Application Express Web Application Development Application Express Web Application Development Agenda What is Oracle Application Express Demonstration Features and benefits Customer examples Conclusion Next steps Q&A Does Your Organization: Use spreadsheets

More information

Toad for Data Analysts, Tips n Tricks

Toad for Data Analysts, Tips n Tricks Toad for Data Analysts, Tips n Tricks or Things Everyone Should Know about TDA Just what is Toad for Data Analysts? Toad is a brand at Quest. We have several tools that have been built explicitly for developers

More information

Microsoft SQL Server for Oracle DBAs Course 40045; 4 Days, Instructor-led

Microsoft SQL Server for Oracle DBAs Course 40045; 4 Days, Instructor-led Microsoft SQL Server for Oracle DBAs Course 40045; 4 Days, Instructor-led Course Description This four-day instructor-led course provides students with the knowledge and skills to capitalize on their skills

More information

IBM DB2 XML support. How to Configure the IBM DB2 Support in oxygen

IBM DB2 XML support. How to Configure the IBM DB2 Support in oxygen Table of Contents IBM DB2 XML support About this Tutorial... 1 How to Configure the IBM DB2 Support in oxygen... 1 Database Explorer View... 3 Table Explorer View... 5 Editing XML Content of the XMLType

More information

High-Volume Data Warehousing in Centerprise. Product Datasheet

High-Volume Data Warehousing in Centerprise. Product Datasheet High-Volume Data Warehousing in Centerprise Product Datasheet Table of Contents Overview 3 Data Complexity 3 Data Quality 3 Speed and Scalability 3 Centerprise Data Warehouse Features 4 ETL in a Unified

More information

Microsoft SharePoint and Records Management Compliance

Microsoft SharePoint and Records Management Compliance Microsoft SharePoint and Records Management Compliance White Paper Revision: 2 Date created: 20 February 2015 Principal author: Nigel Carruthers-Taylor, Principal, icognition Reference: 15/678 Summary

More information

KMx Enterprise: Integration Overview for Member Account Synchronization and Single Signon

KMx Enterprise: Integration Overview for Member Account Synchronization and Single Signon KMx Enterprise: Integration Overview for Member Account Synchronization and Single Signon KMx Enterprise includes two api s for integrating user accounts with an external directory of employee or other

More information

The Next Frontier. for Records Managers. Retention and Disposition of Structured Data:

The Next Frontier. for Records Managers. Retention and Disposition of Structured Data: Retention and Disposition of Structured Data: The Next Frontier for Records Managers Establishing a relationship with IT managers and learning about the basics of system technology will help the records

More information

XML Processing and Web Services. Chapter 17

XML Processing and Web Services. Chapter 17 XML Processing and Web Services Chapter 17 Textbook to be published by Pearson Ed 2015 in early Pearson 2014 Fundamentals of http://www.funwebdev.com Web Development Objectives 1 XML Overview 2 XML Processing

More information

Oracle Database 11g SQL

Oracle Database 11g SQL AO3 - Version: 2 19 June 2016 Oracle Database 11g SQL Oracle Database 11g SQL AO3 - Version: 2 3 days Course Description: This course provides the essential SQL skills that allow developers to write queries

More information

Search help. More on Office.com: images templates

Search help. More on Office.com: images templates Page 1 of 14 Access 2010 Home > Access 2010 Help and How-to > Getting started Search help More on Office.com: images templates Access 2010: database tasks Here are some basic database tasks that you can

More information

ibolt V3.2 Release Notes

ibolt V3.2 Release Notes ibolt V3.2 Release Notes Welcome to ibolt V3.2, which has been designed to deliver an easy-touse, flexible, and cost-effective business integration solution. This document highlights the new and enhanced

More information

Legacy Data Migration: DIY Might Leave You DOA

Legacy Data Migration: DIY Might Leave You DOA Legacy Data Migration: DIY Might Leave You DOA By Wayne Lashley, Chief Business Development Officer White Paper 1 In any application migration/renewal project, data migration is 4. Capture of all source

More information

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are

More information

Working With Templates in Web Publisher. Contributed by Paul O Mahony Developer Program

Working With Templates in Web Publisher. Contributed by Paul O Mahony Developer Program Working With Templates in Web Publisher Contributed by Paul O Mahony Developer Program Overview... 3 Template Options... 3 Web Publisher Editor Templates... 3 Advanced Content Editor... 3 ewebeditpro +

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS. Learning Objectives

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS. Learning Objectives CHAPTER 6 DATABASE MANAGEMENT SYSTEMS Management Information Systems, 10 th edition, By Raymond McLeod, Jr. and George P. Schell 2007, Prentice Hall, Inc. 1 Learning Objectives Understand the hierarchy

More information

A Framework for Data Migration between Various Types of Relational Database Management Systems

A Framework for Data Migration between Various Types of Relational Database Management Systems A Framework for Data Migration between Various Types of Relational Database Management Systems Ahlam Mohammad Al Balushi Sultanate of Oman, International Maritime College Oman ABSTRACT Data Migration is

More information

General principles and architecture of Adlib and Adlib API. Petra Otten Manager Customer Support

General principles and architecture of Adlib and Adlib API. Petra Otten Manager Customer Support General principles and architecture of Adlib and Adlib API Petra Otten Manager Customer Support Adlib Database management program, mainly for libraries, museums and archives 1600 customers in app. 30 countries

More information

Technologies for a CERIF XML based CRIS

Technologies for a CERIF XML based CRIS Technologies for a CERIF XML based CRIS Stefan Bärisch GESIS-IZ, Bonn, Germany Abstract The use of XML as a primary storage format as opposed to data exchange raises a number of questions regarding the

More information

Developing and Implementing Web Applications with Microsoft Visual C#.NET and Microsoft Visual Studio.NET

Developing and Implementing Web Applications with Microsoft Visual C#.NET and Microsoft Visual Studio.NET Unit 39: Developing and Implementing Web Applications with Microsoft Visual C#.NET and Microsoft Visual Studio.NET Learning Outcomes A candidate following a programme of learning leading to this unit will

More information

AHDS Digital Preservation Glossary

AHDS Digital Preservation Glossary AHDS Digital Preservation Glossary Final version prepared by Raivo Ruusalepp Estonian Business Archives, Ltd. January 2003 Table of Contents 1. INTRODUCTION...1 2. PROVENANCE AND FORMAT...1 3. SCOPE AND

More information

Top 10 Oracle SQL Developer Tips and Tricks

Top 10 Oracle SQL Developer Tips and Tricks Top 10 Oracle SQL Developer Tips and Tricks December 17, 2013 Marc Sewtz Senior Software Development Manager Oracle Application Express Oracle America Inc., New York, NY The following is intended to outline

More information

Executive Summary WHO SHOULD READ THIS PAPER?

Executive Summary WHO SHOULD READ THIS PAPER? The Business Value of Business Intelligence in SharePoint 2010 Executive Summary SharePoint 2010 is The Business Collaboration Platform for the Enterprise & the Web that enables you to connect & empower

More information

SQL Server Master Data Services A Point of View

SQL Server Master Data Services A Point of View SQL Server Master Data Services A Point of View SUBRAHMANYA V SENIOR CONSULTANT SUBRAHMANYA.VENKATAGIRI@WIPRO.COM Abstract Is Microsoft s Master Data Services an answer for low cost MDM solution? Will

More information

An Oracle White Paper June 2013. Migrating Applications and Databases with Oracle Database 12c

An Oracle White Paper June 2013. Migrating Applications and Databases with Oracle Database 12c An Oracle White Paper June 2013 Migrating Applications and Databases with Oracle Database 12c Disclaimer The following is intended to outline our general product direction. It is intended for information

More information

DELAWARE PUBLIC ARCHIVES POLICY STATEMENT AND GUIDELINES MODEL GUIDELINES FOR ELECTRONIC RECORDS

DELAWARE PUBLIC ARCHIVES POLICY STATEMENT AND GUIDELINES MODEL GUIDELINES FOR ELECTRONIC RECORDS DELAWARE PUBLIC ARCHIVES POLICY STATEMENT AND GUIDELINES MODEL GUIDELINES FOR ELECTRONIC RECORDS STATEMENT OF PURPOSE The Delaware Public Archives (DPA) has issued "Model Guidelines for Electronic Records"

More information

History of Database Systems

History of Database Systems History of Database Systems By Kaushalya Dharmarathna(030087) Sandun Weerasinghe(040417) Early Manual System Before-1950s Data was stored as paper records. Lot of man power involved. Lot of time was wasted.

More information

CHAPTER 5: BUSINESS ANALYTICS

CHAPTER 5: BUSINESS ANALYTICS Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse

More information

The Power of Analysis Framework

The Power of Analysis Framework All too often, users must create real-time planning and analysis reports with static and inconsistent sources of information. Data is locked in an Excel spreadsheet or a rigidly customized application

More information

Content Management Systems: Drupal Vs Jahia

Content Management Systems: Drupal Vs Jahia Content Management Systems: Drupal Vs Jahia Mrudula Talloju Department of Computing and Information Sciences Kansas State University Manhattan, KS 66502. mrudula@ksu.edu Abstract Content Management Systems

More information

Structure in documents: an introduction

Structure in documents: an introduction Structure in documents: an introduction Structure in documents: an introduction Being an introduction to the use of databases and markup languages to help the designer make electronic and paper documents

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

Introduction to Databases

Introduction to Databases Page 1 of 5 Introduction to Databases An introductory example What is a database? Why do we need Database Management Systems? The three levels of data abstraction What is a Database Management System?

More information

Advantages of XML as a data model for a CRIS

Advantages of XML as a data model for a CRIS Advantages of XML as a data model for a CRIS Patrick Lay, Stefan Bärisch GESIS-IZ, Bonn, Germany Summary In this paper, we present advantages of using a hierarchical, XML 1 -based data model as the basis

More information