Database preservation toolkit:
|
|
|
- Dortha Barnett
- 10 years ago
- Views:
Transcription
1 Nov , 2014, Lisbon, Portugal Database preservation toolkit: a flexible tool to normalize and give access to databases DLM Forum: Making the Information Governance Landscape in Europe José Carlos Ramalho [email protected] KEEP SOLUTIONS // University of Minho
2 KEEP SOLUTIONS: Projects DigitArq, CRAV (2003..[ ]) RODA (2006..[2008- [) RCAAP (2008- ) PPA (2009) Open source: RODA, KOHA, DSpace, Moodle, etc. Scientific research (european projects) SCAPE: Large scale preservation 4C: the cost for curation e- ark: presented by Kuldar This work was partially supported by the SCAPE Project. The SCAPE project is co- funded by the European Union under FP7 ICT (Grant Agreement number ). 2
3 Database Preservation Toolkit Developed within RODA project Now stand-alone open-source project Imports and exports between DBMS and DB formats Supports preservation formats: DBML, SIARD
4 The Past: RODA: Repositório de Objectos Digitais Autênticos José Carlos Ramalho Miguel Ferreira Rui Castro Luis Faria Francisco Barbedo Luis Corujo Cecília Henriques
5 SCAPE What is RODA? A digital repository created for archives, with the following features: Long term preservation and Autenticity Standards based (OAIS, EAD, PREMIS, METS, etc.) Following TRAC demands (Trustworthy Repositories Audit & Certification) Security policies Service Oriented Architecture (SOA) Nice and simple design Open source DGArq and University of Minho 5
6 Context RODA ( ) Metadata management (EAD) Digital Object management (...) Digital Preservation Policies and protocols National Project (Archives National Board) CRiB: Preservation Services Digital Repositories ( ) Distributed Migration Service Migration Service supported by knowledge base phd Thesis / U. Minho (Miguel Ferreira)
7 Context RODA ( ) Metadata management (EAD) Digital Object management (...) Digital Preservation Policies and protocols National Project (Archives National Board) CRiB: Preservation Services Digital Repositories ( ) Distributed Migration Service Migration Service supported by knowledge base phd Thesis / U. Minho (Miguel Ferreira)
8 7 OAIS ISO 14721
9 Architecture 8
10 8 Architecture Being replaced
11 Data Model 9
12 Data Model 9
13 Data Model 9
14 Data Model 9
15 Data Model 9
16 Digital Object Classes 10
17 Digital Object Classes 10
18 10 Digital Object Classes Normalization
19 10 Digital Object Classes Normalization Significant properties
20 Ingest 11
21 SIP structure 12
22 12 SIP structure Object class and format
23 12 SIP structure Object class and format Place in classification plan
24 12 SIP structure Object class and format Place in classification plan Descriptive metadata
25 12 SIP structure Object class and format Place in classification plan Descriptive metadata Preservation metadata
26 12 SIP structure Object class and format Place in classification plan Descriptive metadata Preservation metadata Technical metadata
27 12 SIP structure Object class and format Place in classification plan Descriptive metadata Preservation metadata Technical metadata Representation files
28 12 SIP structure Object class and format Place in classification plan Descriptive metadata Preservation metadata Technical metadata Representation files Identification of root file
29 Databases
30 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)?
31 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment?
32 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure?
33 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure? How can we preserve authenticity and provenance of databases?
34 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure? How can we preserve authenticity and provenance of databases? How can we preserve data while it continues to evolve? (here we can split the problem in two: operational databases and frozen databases)
35 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure? How can we preserve authenticity and provenance of databases? How can we preserve data while it continues to evolve? (here we can split the problem in two: operational databases and frozen databases) How can we have efficient preservation frameworks, while retaining the ability to query different database versions?
36 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure? How can we preserve authenticity and provenance of databases? How can we preserve data while it continues to evolve? (here we can split the problem in two: operational databases and frozen databases) How can we have efficient preservation frameworks, while retaining the ability to query different database versions? How can multi-user online access be provided to hundreds of archived databases containing terabytes of data?
37 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure? How can we preserve authenticity and provenance of databases? How can we preserve data while it continues to evolve? (here we can split the problem in two: operational databases and frozen databases) How can we have efficient preservation frameworks, while retaining the ability to query different database versions? How can multi-user online access be provided to hundreds of archived databases containing terabytes of data? Can we move from a centralized model to a distributed, redundant model of database preservation?
38 Databases How do we keep archived databases readable and usable in the long term (at acceptable cost)? How do we separate the data from a specific database management environment? How can we preserve the original data semantics and structure? How can we preserve authenticity and provenance of databases? How can we preserve data while it continues to evolve? (here we can split the problem in two: operational databases and frozen databases) How can we have efficient preservation frameworks, while retaining the ability to query different database versions? How can multi-user online access be provided to hundreds of archived databases containing terabytes of data? Can we move from a centralized model to a distributed, redundant model of database preservation? General big questions...
39 Databases
40 Databases What are the salient features of a database that should be preserved? (significant properties...)
41 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle?
42 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle? What documentation is preserved together with a database, and in what format?
43 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle? What documentation is preserved together with a database, and in what format? What are the legal encumbrances on database preservation?
44 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle? What documentation is preserved together with a database, and in what format? What are the legal encumbrances on database preservation? What can be learned from traditional archival appraisal for the selection of databases for preservation?
45 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle? What documentation is preserved together with a database, and in what format? What are the legal encumbrances on database preservation? What can be learned from traditional archival appraisal for the selection of databases for preservation? To what extent can the preservation strategies, and procedural policies developed by archivists be adapted for databases?
46 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle? What documentation is preserved together with a database, and in what format? What are the legal encumbrances on database preservation? What can be learned from traditional archival appraisal for the selection of databases for preservation? To what extent can the preservation strategies, and procedural policies developed by archivists be adapted for databases? How can we mesure the quality of preservation strategies when they are applied to data- bases? What are DB significant properties? (quality assurance...)
47 Databases What are the salient features of a database that should be preserved? (significant properties...) What are the different stages in the database preservation s life cycle? What documentation is preserved together with a database, and in what format? What are the legal encumbrances on database preservation? What can be learned from traditional archival appraisal for the selection of databases for preservation? To what extent can the preservation strategies, and procedural policies developed by archivists be adapted for databases? How can we mesure the quality of preservation strategies when they are applied to data- bases? What are DB significant properties? (quality assurance...) Technical questions...
48 Databases: goals How do we store them? How do we access them?
49 Databases: goals How do we store them? How do we access them? RODA questions...
50 Databases
51 Databases Normal evolution path: Data => Structure => Semantics
52 Databases Normal evolution path: Data => Structure => Semantics First prototype: Data Structure Only frozen DBs
53 Databases Normal evolution path: Data? Data => Structure => Semantics First prototype: Data Structure Only frozen DBs
54 Databases Normal evolution path: Data? Data => Structure => Semantics Structure? First prototype: Data Structure Only frozen DBs
55 Databases Normal evolution path: Data? Data => Structure => Semantics Structure? Views? First prototype: Data Structure Only frozen DBs
56 Databases Normal evolution path: Data? Data => Structure => Semantics Structure? Views? Reports? First prototype: Data Structure Only frozen DBs
57 Databases Normal evolution path: Data? Data => Structure => Semantics Structure? Views? Reports? First prototype: Data Stored Procedures? Structure Only frozen DBs
58 Databases Normal evolution path: Data? Data => Structure => Semantics Structure? Views? Reports? First prototype: Data Stored Procedures?... Structure Only frozen DBs
59 HOW? The need for an intermediate representation Input Formats (M) Output Formats (N)
60 HOW? The need for an intermediate representation Input Formats (M) Output Formats (N) M*N migrators
61 IR: DBML Input Formats (M) Output Formats (N) DBML
62 IR: DBML Input Formats (M) Output Formats (N) DBML For each new output or input format you only need to code one new migrator.
63 DBML design Principles Hardware independent; Software independent; Easy to process; Descriptive; It should be possible to add metadata; It should be possible to add semantics;
64 DBML design Principles Hardware independent; Software independent; Easy to process; Descriptive; It should be possible to add metadata; It should be possible to add semantics; XML was the obvious choice
65 SIP Structure (DB example) KEEPS Employees
66 SIP Structure (DB example) Manifest KEEPS Employees
67 SIP Structure (DB example) Manifest Descriptive Metadata (EAD): - producer - colection -... DBML: - Data; - Structure; - Other metadata. KEEPS Employees
68 SIP Structure (DB example) Manifest Binaries Descriptive Metadata (EAD): - producer - colection -... DBML: - Data; - Structure; - Other metadata. KEEPS Employees
69 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Descriptive Metadata (EAD): - producer - colection -... Binaries DBML: - Data; - Structure; - Other metadata. KEEPS Employees
70 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Descriptive Metadata (EAD): - producer - colection -... Binaries DBML: - Data; - Structure; - Other metadata. KEEPS Employees
71 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Descriptive Metadata (EAD): - producer - colection -... Binaries DBML: - Data; - Structure; - Other metadata. KEEPS Employees
72 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Descriptive Metadata (EAD): - producer - colection -... Binaries DBML: - Data; - Structure; - Other metadata. KEEPS Employees
73 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Descriptive Metadata (EAD): - producer - colection -... Binaries DBML: - Data; - Structure; - Other metadata. KEEPS Employees
74 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Descriptive Metadata (EAD): - producer - colection -... Binaries DBML: - Data; - Structure; - Other metadata. KEEPS Employees
75 SIP Structure (DB example) Technical Metadata: - color - dimensions -... Manifest Binaries Compressed File Descriptive Metadata (EAD): - producer - colection -... DBML: - Data; - Structure; - Other metadata. KEEPS Employees
76 DBML
77 DBML: Structure
78 DBML: Data
79 Databases: RODA answers How do we store them? DBML + binaries + technical metadata How do we access them? PhpMyAdmin (hacked version)
80 Databases: RODA answers How do we store them? DBML + binaries + technical metadata How do we access them? PhpMyAdmin (hacked version) RODA answers...
81 Some problems Extracting data is easy: SELECT * FROM... Extracting the structure is not: DBMS protect this information; Each DBMS stores it differently; Different versions of the same DBMS can also act differently; We have to prepare/hack the DBMS.
82 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Extract / migrate the raw data and table structure from the original database
83 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Relies on emulation strategy Extract / migrate the raw data and table structure from the original database
84 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Relies on emulation strategy Extract / migrate the raw data and table structure from the original database Relies on migration strategy
85 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Relies on emulation strategy Extract / migrate the raw data and table structure from the original database RODA+DBML Relies on migration strategy
86 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Relies on emulation strategy Extract / migrate the raw data and table structure from the original database RODA+DBML SIARD Relies on migration strategy
87 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Relies on emulation strategy Extract / migrate the raw data and table structure from the original database RODA+DBML SIARD SIARD-DK Relies on migration strategy
88 Open Planets: Database Archiving (Copenhagen 2012) Two approaches emerged: Preserve the database and the environment allowing authentic access to the information and any accompanying applications Relies on emulation strategy Extract / migrate the raw data and table structure from the original database RODA+DBML SIARD SIARD-DK ADDML Relies on migration strategy
89 DBML vs SIARD DBML SIARD Data Yes (no segmentation) Yes (with segmentation) Structure Yes (XML) Yes (XSD) Stored Procedures No Yes (XML) Triggers No Yes (XML) Views No Yes (XML)
90 db-preservation-toolkit architecture
91 db-preservation-toolkit architecture ADDML
92 db-preservation-toolkit architecture ADDML ADDML
93 Pre-ingest / Production / SIP creation
94 Pre-ingest / Production / SIP creation new
95 Pre-ingest / Production / SIP creation new new
96 Pre-ingest / Production / SIP creation new new ADDML
97 Pre-ingest / Production / SIP creation new new ADDML to be implemented
98 Access
99 Access Working on RDF viewer, we don t need to stay stuck in the relational model
100 New prototype Msc thesis: Database Convert Tool: exploring SIARD OWL model development Triple Store Implementation RDF endpoint implementation SPARQL based access
101 Future plan: Fast database viewer
102 Recent work Phd thesis: Graph view dissemination, relational model reverse engineering, algorithm development (from relational to graph model) [ 1822/25655] Msc thesis: SIARD support (DBML replacement) Msc thesis: RDF representation for databases
103 Questions? José Carlos Ramalho Engineer / Researcher / Teacher [email protected] / [email protected] ARQUIVOS BIBLIOTECAS MUSEUS
Database Preservation Toolkit: a flexible tool to normalize and give access to databases
Database Preservation Toolkit: a flexible tool to normalize and give access to databases José Carlos Ramalho University of Minho [email protected] Luis Faria KEEP SOLUTIONS Lda [email protected] Miguel Coutada
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project Janet Delve, University of Portsmouth Kuldar Aas, National Archives of Estonia Rainer Schmidt, Austrian Institute
Why archiving erecords influences the creation of erecords. Martin Stürzlinger scopepartner Vienna, Austria
Why archiving erecords influences the creation of erecords Martin Stürzlinger scopepartner Vienna, Austria Electronic Records In a Productive System Created Used Changed Deleted In an Archival System No
Digital Preservation. OAIS Reference Model
Digital Preservation OAIS Reference Model Stephan Strodl, Andreas Rauber Institut für Softwaretechnik und Interaktive Systeme TU Wien http://www.ifs.tuwien.ac.at/dp Aim OAIS model Understanding the functionality
Data Warehouses in the Path from Databases to Archives
Data Warehouses in the Path from Databases to Archives Gabriel David FEUP / INESC-Porto This position paper describes a research idea submitted for funding at the Portuguese Research Agency. Introduction
José Carlos Ramalho [email protected] [email protected]
José Carlos Ramalho [email protected] [email protected] 1 Database migration CLI José Carlos Ramalho [email protected] [email protected] 1 Intermediate Representation DBML; Hardware and Software independent; It can
2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, [email protected])
CSP CHRONOS Compliance statement for ISO 14721:2003 (Open Archival Information System Reference Model) 2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, [email protected]) The international
Interagency Science Working Group. National Archives and Records Administration
Interagency Science Working Group 1 National Archives and Records Administration Establishing Trustworthy Digital Repositories: A Discussion Guide Based on the ISO Open Archival Information System (OAIS)
INTEGRATING RECORDS SYSTEMS WITH DIGITAL ARCHIVES CURRENT STATUS AND WAY FORWARD
INTEGRATING RECORDS SYSTEMS WITH DIGITAL ARCHIVES CURRENT STATUS AND WAY FORWARD National Archives of Estonia Kuldar As National Archives of Sweden Karin Bredenberg University of Portsmouth Janet Delve
SEVENTH FRAMEWORK PROGRAMME THEME ICT -1-4.1 Digital libraries and technology-enhanced learning
Briefing paper: Value of software agents in digital preservation Ver 1.0 Dissemination Level: Public Lead Editor: NAE 2010-08-10 Status: Draft SEVENTH FRAMEWORK PROGRAMME THEME ICT -1-4.1 Digital libraries
DIGITAL ARCHIVES & PRESERVATION SYSTEMS
DIGITAL ARCHIVES & PRESERVATION SYSTEMS Part 1 Overview (part 1 of 7) Kari R. Smith, MIT Institute Archives Session Overview Digital archives and digital preservation systems. These open source tools are
RCAAP: Building and maintaining a national repository network
RCAAP: Building and maintaining a national repository network José Carvalho [email protected] Eloy Rodrigues [email protected] Pedro Príncipe [email protected] Ricardo Saraiva [email protected]
Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution
Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany [email protected]
Long-term Archiving of Relational Databases with Chronos
First International Workshop on Database Preservation (PresDB'07) 23 March 2007, at the UK Digital Curation Centre and the Database Group in the School of Informatics, University of Edinburgh Long-term
Databases in Organizations
The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron
Ex Libris Rosetta: A Digital Preservation System Product Description
Ex Libris Rosetta: A Digital Preservation System Product Description CONFIDENTIAL INFORMATION The information herein is the property of Ex Libris Ltd. or its affiliates and any misuse or abuse will result
A DICOM-based Software Infrastructure for Data Archiving
A DICOM-based Software Infrastructure for Data Archiving Dhaval Dalal, Julien Jomier, and Stephen R. Aylward Computer-Aided Diagnosis and Display Lab The University of North Carolina at Chapel Hill, Department
Questionnaire on Digital Preservation in Local Authority Archive Services
Questionnaire on Digital Preservation in Local Authority Archive Services A - Digital Preservation Planning 1. Would you describe your Archive Service as: Actively seeking digital material Reacting to
bigdata Managing Scale in Ontological Systems
Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural
Trends and solutions for archiving in pharmaceutical industry. Didier Coyman / Ömer Yilmaz
Trends and solutions for archiving in pharmaceutical industry. Didier Coyman / Ömer Yilmaz 2013 Waters Corporation 1 Why? Regulatory requirements Storage capacity Legal Company Ethic 2013 Waters Corporation
Achieving a Step Change in Digital Preservation Capability
Essential Guide Achieving a Step Change in Digital Preservation Capability An assessment of Preservica using the Digital Preservation Capability Maturity Model (DPCMM) Executive Summary Nearly every organization
Metadata Repositories in Health Care. Discussion Paper
Health Care and Informatics Review Online, 2008, 12(3), pp 37-44, Published online at www.hinz.org.nz ISSN 1174-3379 Metadata Repositories in Health Care Discussion Paper Dr Karolyn Kerr [email protected]
A generic approach for data integration using RDF, OWL and XML
A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6
Long-term archiving and preservation planning
Long-term archiving and preservation planning Workflow in digital preservation Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands The Challenge: Long-term Preservation
Technical concepts of kopal. Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin
Technical concepts of kopal Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin 1 Overview Project kopal Ideas Organisation Results Technical concepts DIAS kolibri Models of reusability 2
A Grid Architecture for Manufacturing Database System
Database Systems Journal vol. II, no. 2/2011 23 A Grid Architecture for Manufacturing Database System Laurentiu CIOVICĂ, Constantin Daniel AVRAM Economic Informatics Department, Academy of Economic Studies
A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire
A Selection of Questions from the Stewardship of Digital Assets Workshop Questionnaire SECTION A: Institution Information What year did your institution begin creating digital resources? What year did
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Organization of VizieR's Catalogs Archival
Organization of VizieR's Catalogs Archival Organization of VizieR's Catalogs Archival Table of Contents Foreword...2 Environment applied to VizieR archives...3 The archive... 3 The producer...3 The user...3
Information Management Advice 18 - Managing records in business systems Part 1: Checklist for decommissioning business systems
Information Management Advice 18 - Managing records in business systems Part 1: Checklist for decommissioning business systems Introduction Agencies have systems which hold business information, such as
The challenges of becoming a Trusted Digital Repository
The challenges of becoming a Trusted Digital Repository Annemieke de Jong is Preservation Officer at the Netherlands Institute for Sound and Vision (NISV) in Hilversum. She is responsible for setting out
TRANSFoRm: Vision of a learning healthcare system
TRANSFoRm: Vision of a learning healthcare system Vasa Curcin, Imperial College London Theo Arvanitis, University of Birmingham Derek Corrigan, Royal College of Surgeons Ireland TRANSFoRm is partially
Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora
Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora David Pcolar Carolina Digital Repository (CDR) [email protected] Alexandra Chassanoff School of Information &
Digital Assets Repository 3.0. PASIG User Group Conference Noha Adly Bibliotheca Alexandrina
Digital Assets Repository 3.0 PASIG User Group Conference Noha Adly Bibliotheca Alexandrina DAR 3.0 DAR manages the full lifecycle of a digital asset: its creation, ingestion, metadata management, storage,
Digital preservation a European perspective
Digital preservation a European perspective Pat Manson Head of Unit European Commission DG Information Society and Media Cultural Heritage and Technology Enhanced Learning Outline The digital preservation
UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications
UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications Gaël de Chalendar CEA LIST F-92265 Fontenay aux Roses [email protected] 1 Introduction The main data sources
MultiMimsy database extractions and OAI repositories at the Museum of London
MultiMimsy database extractions and OAI repositories at the Museum of London Mia Ridge Museum Systems Team Museum of London [email protected] Scope Extractions from the MultiMimsy 2000/MultiMimsy
Preservation Action: What, how and when? Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands
: What, how and when? Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands What is preservation action? Execution of a strategy to regain or improve access to
Preserving French Scientific data
Preserving French Scientific data Marion MASSOL (CINES) [email protected] DARIAH General VCC Meeting November 28 th, 29 th, 30 th 2012 AGENDA 1. Preserving data: our mission and strategy 4. The file
dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data
dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data www.culturaitalia.it Rosa Caffo, Director of Central Institute for the Union Catalogue of Italian Libraries (MiBACT)
EPrints Preservation Update
EPrints Preservation Update EPrints Preservation New EPrints Preservation Team Overseeing integration of preservation into EPrints software as well as involvement with preservation projects Preserv2 Ended
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Digital Stewardship Education at the Graduate School of Library & Information Science, Simmons College
Digital Stewardship Education at the Graduate School of Library & Information Science, Simmons College Martha Mahard and Ross Harvey Graduate School of Library & Information Science Simmons College Boston,
North Carolina Digital Preservation Policy. April 2014
North Carolina Digital Preservation Policy April 2014 North Carolina Digital Preservation Policy Page 1 Executive Summary The North Carolina Digital Preservation Policy (Policy) governs the operation,
Database Preservation Case Study: Review
Database Preservation Case Study: Review Mette van Essen, Maurice de Rooij, Bill Roberts, Maurice van den Dobbelsteen National Archives of the Netherlands 12 July 2011 Introduction As part of the PLANETS
Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning
Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning Simon Lambert Scientific Computing Department STFC Rutherford
Digital Archiving Policy
Digital Archiving Policy Imprint Text: Innovation and Preservation Unit; editor: Marguérite Bos, Digital Archiving Service; translation: Sonja Bonin, www.sonjabonin.com; cover: Julien Boudaud, www.jboudaud.fr;
Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)
Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM) Oracle's Sun Storage Archive Manager (SAM) self-protecting file system software reduces operating costs by providing data
DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance
DDI Lifecycle: Moving Forward Status of the Development of DDI 4 Joachim Wackerow Technical Committee, DDI Alliance Should I Wait for DDI 4? No! DDI Lifecycle 4 is a long development process DDI Lifecycle
Geospatial Data Stewardship at an Interdisciplinary Data Center
Geospatial Data Stewardship at an Interdisciplinary Data Center Robert R. Downs, PhD Senior Digital Archivist and Senior Staff Associate Officer or Research Acting Head of Cyberinfrastructure and Informatics
Introduction. Chapter 1. Introducing the Database. Data vs. Information
Chapter 1 Objectives: to learn The difference between data and information What a database is, the various types of databases, and why they are valuable assets for decision making The importance of database
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
D5.3.2b Automatic Rigorous Testing Components
ICT Seventh Framework Programme (ICT FP7) Grant Agreement No: 318497 Data Intensive Techniques to Boost the Real Time Performance of Global Agricultural Data Infrastructures D5.3.2b Automatic Rigorous
UniGR Workshop: Big Data «The challenge of visualizing big data»
Dept. ISC Informatics, Systems & Collaboration UniGR Workshop: Big Data «The challenge of visualizing big data» Dr Ir Benoît Otjacques Deputy Scientific Director ISC The Future is Data-based Can we help?
Cloud Service Contracts: An Issue of Trust
Cloud Service Contracts: An Issue of Trust Marie Demoulin Assistant Professor Université de Montréal École de Bibliothéconomie et des Sciences de l Information (EBSI) itrust 2d International Symposium,
BUSINESS REQUIREMENTS SPECIFICATION (BRS) Business domain: Archiving and records management. Transfer of digital records
BUSINESS REQUIREMENTS SPECIFICATION (BRS) Business domain: Archiving and records management Business process: Transfer of digital records Document identification: Title: Transfer of digital records Trade
Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability
Core Enterprise, SOA, and Semantic Technologies: Supporting Semantic Interoperability in a Network-Enabled Environment 2011 SOA & Semantic Technology Symposium 13-14 July 2011 Sven E. Kuehne [email protected]
GetLOD - Linked Open Data and Spatial Data Infrastructures
GetLOD - Linked Open Data and Spatial Data Infrastructures W3C Linked Open Data LOD2014 Roma, 20-21 February 2014 Stefano Pezzi, Massimo Zotti, Giovanni Ciardi, Massimo Fustini Agenda Context Geoportal
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management Zhan Liu, Fabian Cretton, Anne Le Calvé, Nicole Glassey, Alexandre Cotting, Fabrice Chapuis
DIGITAL ARCHIVING DISCUSSION PAPER: INFORMING AN APPROACH TO THE LONG-TERM MANAGEMENT AND PRESERVATION OF GOVERNMENT DIGITAL RECORDS
DIGITAL ARCHIVING DISCUSSION PAPER: INFORMING AN APPROACH TO THE LONG-TERM MANAGEMENT AND PRESERVATION OF GOVERNMENT DIGITAL RECORDS May 2010 Queensland State Archives Department of Science, Information
Tibiscus University, Timişoara
PDF/A standard for long term archiving Ramona Vasilescu Tibiscus University, Timişoara ABSTRACT. PDF/A is defined by ISO 19005-1 as a file format based on PDF format. The standard provides a mechanism
Federated, Generic Configuration Management for Engineering Data
Federated, Generic Configuration Management for Engineering Data Dr. Rainer Romatka Boeing GPDIS_2013.ppt 1 Presentation Outline I Summary Introduction Configuration Management Overview CM System Requirements
Cloud and Big Data Standardisation
Cloud and Big Data Standardisation EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University of Amsterdam
Master s Program in Information Systems
The University of Jordan King Abdullah II School for Information Technology Department of Information Systems Master s Program in Information Systems 2006/2007 Study Plan Master Degree in Information Systems
DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version 2.0. 9 July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE
DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER Version 2.0 9 July 2008 Record of Changes Version Description Of Change Revision Date Author Number 1.0 Baseline Document July 12,
Digital Archiving at the Swiss Federal Archives (SFA)
Swiss Federal Archives SFA Dr. Krystyna W. Ohnesorge Archivstrasse 24, CH-3003 Bern Phone +41 31 32 45827, Fax +41 31 32 27823 [email protected] www.bar.admin.ch itopia ag corporate information
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
The Visualization Pipeline
The Visualization Pipeline Conceptual perspective Implementation considerations Algorithms used in the visualization Structure of the visualization applications Contents The focus is on presenting the
Archival Data Format Requirements
Archival Data Format Requirements July 2004 The Royal Library, Copenhagen, Denmark The State and University Library, Århus, Denmark Main author: Steen S. Christensen The Royal Library Postbox 2149 1016
DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories
DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories MacKenzie Smith, Associate Director for Technology Massachusetts Institute of Technology Libraries, Cambridge,
Creating the ultimate digital insurance
By Piql AS Rev. 140625 Creating the ultimate digital insurance Behind the curtains Ensuring future access Digital preservation is about ensuring future access to digital material. When long-term access
The Ontological Approach for SIEM Data Repository
The Ontological Approach for SIEM Data Repository Igor Kotenko, Olga Polubelova, and Igor Saenko Laboratory of Computer Science Problems, Saint-Petersburg Institute for Information and Automation of Russian
Chapter 434-662 WAC PRESERVATION OF ELECTRONIC PUBLIC RECORDS
Chapter 434-662 WAC PRESERVATION OF ELECTRONIC PUBLIC RECORDS WAC 434-662-010 Purpose. Pursuant to the provisions of chapters 40.14 and 42.56 RCW, and RCW 43.105.250, the rules contained in this chapter
Best Archiving Practice Guidance
Best Archiving Practice Guidance This document has been published under the auspices of the EU Telematics Implementation Group - electronic submissions (TIGes) Please note that this document has been published
