Sustainable Digital Information. Corporate Memory

Size: px
Start display at page:

Download "Sustainable Digital Information. Corporate Memory"

Transcription

1 Sustainable Digital Information Corporate Memory 10 September 2009

2 Family video/pictures next generation 2

3 LongRec DATA = DIGITAL ACCESS THROUGH AEONS 3+ year project, research and case studies - DNV R&I lead, 10 partners - Start October 2006, end Overall budget 27,6 MNOK, Norwegian Research Council grant 9.2 MNOK - 3 PhD theses in work 3

4 LongRec DATA = Digital Access Through Aeons Rea d Trust Find Understand 4

5 Project partners Nasjonalbiblioteket Norsk Regnesentral Utenriksdep. Riksarkivet InterPARES 3: Brønnøysundregistrene ICRI (Interdisciplinary Centre for Law and ICT), Katholieke Universiteit Leuven 5

6 Work Packages READ TRUST FIND UNDERSTAND COMPLIANCE 2 topics across packages 6

7 Information / Data Volume Trends READ Volume Explosion 2 stacks of books from Earth to Pluto Earliest written record Gutenberg London Library ( books) Internet Explosion 12 stacks of books from Earth to Sun 0.01% is stored on paper 3000 BC

8 Trends READ Storage shortage Peta Bytes info storage : Data created will be three times amount of available storage. Lots of data will be for immediate consumption only 8

9 Different medium over years If it s not digital, it s not accessible 9

10 National Library of Norway Current state of digitalization: 5% Total volume when today s collections are digitalized ( 2018) Estimated total volume: 37 Petabyte Estimated number of files: Percentage of completed digitalization 20 % 23,2% Hardware Support: 3 (4) years only!! 15 % 10 % 5 % 0 % 6.0% 2.2% 0.1% text images sound video 10

11 Migration Calculator Migration Calculator 6 parameters: - Size of digital objects - read/write bandwidth - read/write access time - file processing speed - network transfer bandwidth - the number of replicas. Two basic models: - Basic Migration - Migration with processing Two extensions: - Replication and verification 11

12 Time (s) Findings migration pilot doing verification by another CPU, the migration time will not be increased ,0 8000,0 6000,0 4000,0 2000,0 0, # folder MB-V-2CPU MB MB-V-1CPU 12

13 time (s) Findings migration pilot When doing migration with processing, the migration time will be decreased by running multiple processes the number of processes 13

14 Further work within READ (PhD) 1. Very large amounts of data in preservation systems (Calculator) 2. Lack of a comprehensive migration framework (Framework) 3. What metadata should be preserved for migration? (Metadata) 14 14

15 Information / Data Volume Trends within Trust More and more trusted Documents are getting digitized 3000 BC Time 15

16 Trusted Information life cycle 1) TRUSTED RECORD MANAGEMEMT The authenticity, integrity and completeness of records 2) TRUSTED DIGITAL REPOSITORY The sustained integrity of records after acquisition Reliable storage Continued usability and readability Controlled access 3) TRUSTED TRANSFER The sustained authenticity of records after migration, conversion and reformatting (verification challenges) Digital Repository 16

17 Record Management and trusted transfer 17

18 Digital Repository 18

19 Further work (PhD) General Trust model Guidelines to secure trust within a repository Checklist to secure trust within a repository Critical metadata to secure trust within a repository 19

20 Information / Data Volume Trends FIND Volume Explosion 90% of all data is unstructured (pictures, video, s, blogs, ) - no data model, no meta data 70% of all data belongs to individuals and are de-centralized stored - Video, Photos, web pages, ect Massive growth in multimedia information, less in textual information 2 stacks of books from Earth to Pluto Earliest written record Gutenberg London Library ( books) Internet Explosion 12 stacks of books from Earth to Sun 0.01% is stored on paper 3000 BC

21 Overview of the ECDL paper Problem: Due to decentralized nature and the lack of standards for date/time, it is difficult to find accurate and trustworthy timestamp for web documents. For a given document with uncertain timestamp, can the contents be used to determine the timestamp with a sufficiently high confidence? Let s me see This document is probably originated in 850 A.C. with 95% confidence. I found a bible-like document. But I have no idea when it was created? You should ask Guru! 21

22 Using Temporal Language Models for Document Dating Previous Work Temporal Language Models A non-time stamped document Partition Word 1999 tsunami tsunami Thailand 1999 Japan 1999 tidal wave 2004 tsunami 2004 Thailand Similarity Scores Score(1999) = earthquake Score(2004) = = 2 Most likely timestamp is Nattiya Kanhabua and Kjetil Nørvåg (Norwegian University of Science and Technology) 1/2

23 Find - Time traveler 23

24 FIND - Research questions The main research question: How to improve the quality of search in a document archive using temporal information? Q1. How to handle large number of documents retrieved? Q2. How to search with awareness of language changes? Q3. How to rank search results wrt. temporal information? 24

25 he impact of communication on language change High: many external contacts heterogeneous participants frequent exchange English Intensity of communication Low : Icelandic few external contacts homogeneous participants infrequent exchange Low: slow adoption of foreign words conservative and predictable orthographic system Speed of language change 25 High: high turn-over of words flexible, volatile orthographic system

26 The impact of globalization on changing information needs High: many international subcontractors complex, heterogenous customer base internationalized technological development DNV 2009 Low : Degree of globalized business limited dependency on business partners no transnational, few local business partners homogenous and stable customer needs Low: slow development of new knowledge stable, predictable information needs A local carpenter producing furniture in Copenhagen, 1930 Degree of changes in information 26 High: high change rate of knowledge volatile information needs

27 Timeline information changes Time zoom scroll-bar Primary Records Secondary data Decision support 27

28 How can semantics add value to information management? Maintenance of Metadata over time Maintenance of Master data over time Verification that your information structure is good over time Automatic maintenance like with Retention Etc. 28

29 Further work UNDERSTAND Information Governance regime Master data Search vs semantic technology supplement or overlap Assessing an organization's maturity in preserving semantic value of information assets More on Evolution 29

30 Increased laws and regulations Compliance Trends EUs 8 directive /EuroSox CDBA Enron MiFiD Solvency II Data Protection Act Norsk SOX Basel II HIPPA arkivlov GLBA FISMA FOIA

31 Input in 3 areas Toolbox - Maturity model related to compliance issues Retention - automatically Compliance tool the dream 31

32 Compliance Toolbox: Maturity related to Compliance 1. Corporate governance 2. Information governance 3. Information maturity 32

33 Toolbox 1: Managing Risks of Corporate Governance Corporate Governance Compliance Risks 33

34 Toolbox 2: 10 steps Information Governance 1. Decide Vision 2. Ensure corporate Management involvement 3. Create policies 4. Design guidelines and best practices 5. Define responsibilities 6. Assign IM Professionals 7. Identify the important information objects 8. Create an information architecture 9. Define required IM services incl SLA 10. Link the Metrics to companies goals and then monitor it VISION HOW WHAT 34

35 Toolbox 3 Information Maturity CMMI EIM Maturity Model (GARTNER) IBM Data Governance Council Maturity Model Aiim 35

36 Compliance retention/automatically Frame conditions: 1. Retention policy 2. Virtual organization 3. Decide which level you need retention - Information Category - Information type etc. RM Application: 1. An event trigger an retention application - Platform is disused - New or changed law - Updated retention policy etc 2. Which of the retention rules in the rule repository to apply - Dependent on: - Type of event - Type of object - Type of information object 3. The RM Application generates a set of information objects that need to be: - reviewed for retention workflow/information owner - updated retention date - or automatically deleted 36

37 Workflow Engine Basic Scenario 1: Managing retention for a given event DNVs Retention Manager 1. Submit trigging event 2. Make query & select documents 3. Select rules & calculate actions 4. Review, monitor, decide 5. Take action Retention Ontology Thing Engine for automatic deletion or update of metadata Interface for submitting events ProductionObject Platform Employee EventType Rule Condition DocType RuleContent NormalFlow OfTime EventType RIM DisuseOf Platform User Document Owner Process Contract TerminationOfE mployment Type Interface for reviewing retention candidates Interface for manual deletion or update of metadata Meridio Rule Repository Disuse platform Oseberg Employee Smith leaves the company Identify all documents related to Oseberg Identify all documents related to Smith Calculate all rules related to disuse of a platform AND the types of documents related to Oseberg Calculate all rules related to termination of employment and types of documents related to Smith 37 Find RIM for Oseberg Find RIM HR Review Oseberg documents Delete Osebergreports, but keep contract Set deletion date for Smithdocuments to Jan 1, 2019

38 Compliance Tool? Manual handling Performed by a tool External Law changes (region) Industry law changes Specific data Eks personal data DNV Broker ( lobing ) Tool? Relevant? JA Which internal Policy are involved? Nei Continue to Keep me updated Internal Review With feedback Decision by responsible Incorporate into the business Local storage? Retention time Information category/-type Information owner 38

39 Findings across work packages Cost Factors Commercialization 39

40 General cost factors in Digital preservation - Calculator OAIS Monitoring costs Security control costs Staff costs Maintenanc e costs Infrastructu re costs Standards costs Technology costs General cost factors Training costs Quality assurance costs Organisatio n costs Retention costs Selection costs Access costs 40

41 Development Path DNV 3rd party information management services DNV Trusted Online Service Information risk Management Digital Safe Information Maturity Assessment Service

42 Digital Safe Main functionality Long-term storage - Migration, conversion etc Compliance - Retention do automatically etc Digital Safe Trust - Fingerprints 3rd party role 42

43 Sleep like a baby - Assists you to secure sustainable records 43

44 Safeguarding life, property and the environment 44

LongRec All rights reserved. This publication or parts thereof may not be reproduced or transmitted in any form or by any means, including

LongRec All rights reserved. This publication or parts thereof may not be reproduced or transmitted in any form or by any means, including LongRec All rights reserved. This publication or parts thereof may not be reproduced or transmitted in any form or by any means, including photocopying or recording, without reference to the source. Slide

More information

Archiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie. uwe.borghoff@unibw.

Archiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie. uwe.borghoff@unibw. Archiving Systems Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie uwe.borghoff@unibw.de Decision Process Reference Models Technologies Use Cases

More information

Auto-Classification for Document Archiving and Records Declaration

Auto-Classification for Document Archiving and Records Declaration Auto-Classification for Document Archiving and Records Declaration Josemina Magdalen, Architect, IBM November 15, 2013 Agenda IBM / ECM/ Content Classification for Document Archiving and Records Management

More information

Assessment of RLG Trusted Digital Repository Requirements

Assessment of RLG Trusted Digital Repository Requirements Assessment of RLG Trusted Digital Repository Requirements Reagan W. Moore San Diego Supercomputer Center 9500 Gilman Drive La Jolla, CA 92093-0505 01 858 534 5073 moore@sdsc.edu ABSTRACT The RLG/NARA trusted

More information

The Way to SOA Concept, Architectural Components and Organization

The Way to SOA Concept, Architectural Components and Organization The Way to SOA Concept, Architectural Components and Organization Eric Scholz Director Product Management Software AG Seite 1 Goals of business and IT Business Goals Increase business agility Support new

More information

Certified Information Professional 2016 Update Outline

Certified Information Professional 2016 Update Outline Certified Information Professional 2016 Update Outline Introduction The 2016 revision to the Certified Information Professional certification helps IT and information professionals demonstrate their ability

More information

Interagency Science Working Group. National Archives and Records Administration

Interagency Science Working Group. National Archives and Records Administration Interagency Science Working Group 1 National Archives and Records Administration Establishing Trustworthy Digital Repositories: A Discussion Guide Based on the ISO Open Archival Information System (OAIS)

More information

Digital preservation a European perspective

Digital preservation a European perspective Digital preservation a European perspective Pat Manson Head of Unit European Commission DG Information Society and Media Cultural Heritage and Technology Enhanced Learning Outline The digital preservation

More information

Summary Table of Contents

Summary Table of Contents Summary Table of Contents Preface VII For whom is this book intended? What is its topical scope? Summary of its organization. Suggestions how to read it. Part I: Why We Need Long-term Digital Preservation

More information

Cloud Service Contracts: An Issue of Trust

Cloud Service Contracts: An Issue of Trust Cloud Service Contracts: An Issue of Trust Marie Demoulin Assistant Professor Université de Montréal École de Bibliothéconomie et des Sciences de l Information (EBSI) itrust 2d International Symposium,

More information

Module 8 Digital Libraries and Open Access

Module 8 Digital Libraries and Open Access Module 8 Digital Libraries and Open Access Lesson 2 How is a Digital Library Built? UNESCO EIPICT MODULE 8. LESSON 2 1 Why is there a need for a Digital Library? Digital libraries o Widen access to valuable

More information

Hadoop & its Usage at Facebook

Hadoop & its Usage at Facebook Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the Storage Developer Conference, Santa Clara September 15, 2009 Outline Introduction

More information

Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections,

Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections, Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections, Kevin Bradley ABSTRACT: Though the solution to all of the problems of digital

More information

Cloud Archive & Long Term Preservation Challenges and Best Practices

Cloud Archive & Long Term Preservation Challenges and Best Practices Cloud Archive & Long Term Preservation Challenges and Best Practices Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP Author: Chad Thibodeau, Cleversafe, Inc. Author: Sebastian Zangaro, HP SNIA Legal

More information

SQL Server Master Data Services A Point of View

SQL Server Master Data Services A Point of View SQL Server Master Data Services A Point of View SUBRAHMANYA V SENIOR CONSULTANT SUBRAHMANYA.VENKATAGIRI@WIPRO.COM Abstract Is Microsoft s Master Data Services an answer for low cost MDM solution? Will

More information

Archive and Preservation in the Cloud - Business Case, Challenges and Best Practices. Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP

Archive and Preservation in the Cloud - Business Case, Challenges and Best Practices. Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP Archive and Preservation in the Cloud - Business Case, Challenges and Best Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP SNIA Legal Notice The material contained in this tutorial is copyrighted

More information

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007 Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the

More information

ENTERPRISE DOCUMENTS & RECORD MANAGEMENT

ENTERPRISE DOCUMENTS & RECORD MANAGEMENT ENTERPRISE DOCUMENTS & RECORD MANAGEMENT DOCWAY PLATFORM ENTERPRISE DOCUMENTS & RECORD MANAGEMENT 1 DAL SITO WEB OLD XML DOCWAY DETAIL DOCWAY Platform, based on ExtraWay Technology Native XML Database,

More information

ARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER. Tony Walker, Dell, Inc. Molly Rector, Spectra Logic

ARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER. Tony Walker, Dell, Inc. Molly Rector, Spectra Logic ARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER Tony Walker, Dell, Inc. Molly Rector, Spectra Logic SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise

More information

CORNWELL Consultants in Management and IT

CORNWELL Consultants in Management and IT Costing EDRM Programmes Andy Rothwell & Richard House CORNWELL Consultants in Management and IT Aim To provide you with an overview of the cost drivers for Electronic Document & Records Management (EDRM)

More information

HPSS Best Practices. Erich Thanhardt Bill Anderson Marc Genty B

HPSS Best Practices. Erich Thanhardt Bill Anderson Marc Genty B HPSS Best Practices Erich Thanhardt Bill Anderson Marc Genty B Overview Idea is to Look Under the Hood of HPSS to help you better understand Best Practices Expose you to concepts, architecture, and tape

More information

Miguel Ortiz, Sr. Systems Engineer. Globanet

Miguel Ortiz, Sr. Systems Engineer. Globanet Miguel Ortiz, Sr. Systems Engineer Globanet Agenda Who is Globanet? Archiving Processes and Standards How Does Data Archiving Help Data Management? Data Archiving to Meet Downstream ediscovery Needs Timely

More information

E-learning and Student Management System: toward an integrated and consistent learning process

E-learning and Student Management System: toward an integrated and consistent learning process E-learning and Student Management System: toward an integrated and consistent learning process Matteo Bertazzo 1, Franca Fiumana 2 1 CINECA, Information and Knowledge Management Services Department, via

More information

Information Governance

Information Governance Information Governance & Extended Content Solutions 2013 SOUND FAMILIAR? How do we connect our information together? How do we manage multiple un-integrated repositories of documents? Our users don t know

More information

Object Storage A Dell Point of View

Object Storage A Dell Point of View Object Storage A Dell Point of View Dell Product Group 1 THIS POINT OF VIEW PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED

More information

Enterprise Content Management. Image from http://webbuildinginfo.com/wp-content/uploads/ecm.jpg. José Borbinha

Enterprise Content Management. Image from http://webbuildinginfo.com/wp-content/uploads/ecm.jpg. José Borbinha Enterprise Content Management Image from http://webbuildinginfo.com/wp-content/uploads/ecm.jpg José Borbinha ECM? Let us start with the help of a professional organization http://www.aiim.org http://www.aiim.org/about

More information

A Business Case for Enterprise Content Integration using Ontology-based Content Analytics

A Business Case for Enterprise Content Integration using Ontology-based Content Analytics A Business Case for Enterprise Content Integration using Ontology-based Content Analytics Edward Curry 1, Bill McDaniel 1, Dmitry Shingarev 1, Milena C. Caires 1, Mark Leyden 1, Sean O Riain 1, Karl Flannery

More information

Achieving a Step Change in Digital Preservation Capability

Achieving a Step Change in Digital Preservation Capability Essential Guide Achieving a Step Change in Digital Preservation Capability An assessment of Preservica using the Digital Preservation Capability Maturity Model (DPCMM) Executive Summary Nearly every organization

More information

AIIM & ASSUREON AN ASSUREON BRIEF

AIIM & ASSUREON AN ASSUREON BRIEF SOLUTIONBRIEF AIIM & ASSUREON AN ASSUREON BRIEF AIIM (Association for Information and Image Management) is the global community of information professionals. Their mission is to help organizations thrive

More information

Applying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service

Applying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service Applying the OAIS standard to CCLRC s British Atmospheric Centre and the Atlas Petabyte Storage Service Corney, D.R., De Vere, M., Folkes, T., Giaretta, D., Kleese van Dam, K., Lawrence, B. N., Pepler,

More information

A Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise

A Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise WHITE PAPER A Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise NOTICE This White Paper may contain proprietary information

More information

IBM Enterprise Content Management (ECM)

IBM Enterprise Content Management (ECM) IBM Enterprise Content Management (ECM) Vesna Ilic IBM ECM Tech Pre-Sales Manager SEA Region Vesna.ilic@si.ibm.com Ahmed Shanab IBM ECM Sales Manager MEEP & SEA Region ashanab@eg.ibm.com Today s Objectives

More information

INFORMATION GOVERNANCE FOR PRIVACY COMPLIANCE

INFORMATION GOVERNANCE FOR PRIVACY COMPLIANCE Access and Privacy Conference Edmonton, June 13, 2012 Rick Klumpenhouwer, MA, MAS, CIAPP-M Partner, Cenera INFORMATION GOVERNANCE FOR PRIVACY COMPLIANCE Course Objectives Understand the principles of information

More information

Database preservation toolkit:

Database preservation toolkit: Nov. 12-14, 2014, Lisbon, Portugal Database preservation toolkit: a flexible tool to normalize and give access to databases DLM Forum: Making the Information Governance Landscape in Europe José Carlos

More information

Carestream Information Management Solutions. Managing the explosion in patient information

Carestream Information Management Solutions. Managing the explosion in patient information Managing the explosion in patient information Carestream Information Management Solutions Carestream Information Management Solutions The right information in the right place at the right time from the

More information

Apache Hadoop FileSystem and its Usage in Facebook

Apache Hadoop FileSystem and its Usage in Facebook Apache Hadoop FileSystem and its Usage in Facebook Dhruba Borthakur Project Lead, Apache Hadoop Distributed File System dhruba@apache.org Presented at Indian Institute of Technology November, 2010 http://www.facebook.com/hadoopfs

More information

Digital Libraries and Content Management

Digital Libraries and Content Management Digital Libraries and Content Management Database Research Group, University of Rostock 4th European IBM Content Manager and Media Workshop, September 2002, Essen 0. Overview 1. Content Management Systems

More information

Capacity Plan. Template. Version X.x October 11, 2012

Capacity Plan. Template. Version X.x October 11, 2012 Template Version X.x October 11, 2012 This is an integral part of infrastructure and deployment planning. It supports the goal of optimum provisioning of resources and services by aligning them to business

More information

The Department for Business, Innovation and Skills IMA Action Plan PRIORITY RECOMMENDATIONS

The Department for Business, Innovation and Skills IMA Action Plan PRIORITY RECOMMENDATIONS PRIORITY RECOMMENDATIONS R1 BIS to elevate the profile of information risk in support of KIM strategy aims for the protection, management and exploitation of information. This would be supported by: Establishing

More information

Data Governance Best Practice

Data Governance Best Practice Data Governance Best Practice Business Connexion Michelle Grimley Senior Manager EIM +27 (0)11 266 6499 Michelle.Grimley@bcx.co.za Inri Möller Master Data Manager +27 (0)11 266 5146 Inri.Möller@bcx.co.za

More information

State of Florida ELECTRONIC RECORDKEEPING STRATEGIC PLAN. January 2010 December 2012 DECEMBER 31, 2009

State of Florida ELECTRONIC RECORDKEEPING STRATEGIC PLAN. January 2010 December 2012 DECEMBER 31, 2009 State of Florida ELECTRONIC RECORDKEEPING STRATEGIC PLAN January 2010 December 2012 DECEMBER 31, 2009 Florida Department of State State Library and Archives of Florida 850.245.6750 http://dlis.dos.state.fl.us/recordsmanagers

More information

Object Storage A Fresh Approach to Long-Term File Storage

Object Storage A Fresh Approach to Long-Term File Storage Object Storage A Fresh Approach to Long-Term File Storage A Dell Technical White Paper Dell Product Group 1 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND

More information

What We ll Cover. Defensible Disposal of Records and Information Litigation Holds Information Governance the future of records management programs

What We ll Cover. Defensible Disposal of Records and Information Litigation Holds Information Governance the future of records management programs What We ll Cover Foundations of Records and Information Management Creating a Defensible Retention Schedule Paper v. Electronic Records Organization and Retrieval of Records and Information Records Management

More information

in the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP

in the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP Digital PRESENTATION Data Archive TITLE and GOES Preservation HERE in the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP SNIA Legal Notice The material contained

More information

Diagram 1: Islands of storage across a digital broadcast workflow

Diagram 1: Islands of storage across a digital broadcast workflow XOR MEDIA CLOUD AQUA Big Data and Traditional Storage The era of big data imposes new challenges on the storage technology industry. As companies accumulate massive amounts of data from video, sound, database,

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland EISCAT User Meeting, Uppsala,6 May 2013 2 Exponential growth Data trends Zettabytes

More information

Nevada 2013. October 3, 2013 8:00 5:00

Nevada 2013. October 3, 2013 8:00 5:00 Nevada 2013 E-Records Forum October 3, 2013 8:00 5:00 The E-Records Forum brings stakeholders together from various governmental entities to discuss shared interests and concerns about the creation, management,

More information

Certified Information Professional (CIP) Certification Maintenance Form http://www.aiim.org/certification

Certified Information Professional (CIP) Certification Maintenance Form http://www.aiim.org/certification Certified Information Professional (CIP) Certification Maintenance Form http://www.aiim.org/certification Name: Title: Company: Address: City: State/Province: ZIP/Postal Code: Country: Email Address: Telephone:

More information

What happens when Big Data and Master Data come together?

What happens when Big Data and Master Data come together? What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information

More information

Demographics QUESTIONS COMMENTS

Demographics QUESTIONS COMMENTS DOI SURVEY Name: Bureau: Department: Location: Telephone: Email: Date of Interview: Defining Requirements for an Electronic Records Management Solution A series of fact finding questions will be presented

More information

Information Management

Information Management G i Information Management Information Management Planning March 2005 Produced by Information Management Branch Open Government Service Alberta 3 rd Floor, Commerce Place 10155 102 Street Edmonton, Alberta,

More information

Enterprise Content Management with Microsoft SharePoint

Enterprise Content Management with Microsoft SharePoint Enterprise Content Management with Microsoft SharePoint Overview of ECM Services and Features in Microsoft Office SharePoint Server 2007 and Windows SharePoint Services 3.0. A KnowledgeLake, Inc. White

More information

EMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, 16.10.2008. Copyright 2008 EMC Corporation. All rights reserved.

EMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, 16.10.2008. Copyright 2008 EMC Corporation. All rights reserved. EMC arhiviranje Lilijana Pelko Primož Golob Sarajevo, 16.10.2008 1 Agenda EMC Today Reasons to archive EMC Centera EMC EmailXtender EMC DiskXtender Use cases 2 EMC Strategic Acquisitions: Strengthen and

More information

Introduction. 1. Name of your organisation: 2. Country (of your organisation): Page 2

Introduction. 1. Name of your organisation: 2. Country (of your organisation): Page 2 Introduction 1. Name of your organisation: 2. Country (of your organisation): 6 Page 2 Policies and Procedures The following questions address the policies and procedures regarding data management (acquisition,

More information

la conception et l'exploitation d'un système électroniques

la conception et l'exploitation d'un système électroniques Philippe NEW WORK ITEM PROPOSAL SC3 MARTIN 171 1 Date of presentation Reference number 2008/07/29 (to be given by the Secretariat) Proposer ISO/TC / SC N Secretariat 170 A proposal for a new work item

More information

Hadoop & its Usage at Facebook

Hadoop & its Usage at Facebook Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the The Israeli Association of Grid Technologies July 15, 2009 Outline Architecture

More information

Apache Hadoop FileSystem Internals

Apache Hadoop FileSystem Internals Apache Hadoop FileSystem Internals Dhruba Borthakur Project Lead, Apache Hadoop Distributed File System dhruba@apache.org Presented at Storage Developer Conference, San Jose September 22, 2010 http://www.facebook.com/hadoopfs

More information

Archiving A Dell Point of View

Archiving A Dell Point of View Archiving A Dell Point of View Dell Product Group 1 THIS POINT OF VIEW PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED

More information

Salesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION

Salesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION Salesforce Certified Data Architecture and Management Designer Study Guide Summer 16 Contents SECTION 1. PURPOSE OF THIS STUDY GUIDE... 2 SECTION 2. ABOUT THE SALESFORCE CERTIFIED DATA ARCHITECTURE AND

More information

XpoLog Center Suite Log Management & Analysis platform

XpoLog Center Suite Log Management & Analysis platform XpoLog Center Suite Log Management & Analysis platform Summary: 1. End to End data management collects and indexes data in any format from any machine / device in the environment. 2. Logs Monitoring -

More information

Chapter 7. Using Hadoop Cluster and MapReduce

Chapter 7. Using Hadoop Cluster and MapReduce Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in

More information

EII - ETL - EAI What, Why, and How!

EII - ETL - EAI What, Why, and How! IBM Software Group EII - ETL - EAI What, Why, and How! Tom Wu 巫 介 唐, wuct@tw.ibm.com Information Integrator Advocate Software Group IBM Taiwan 2005 IBM Corporation Agenda Data Integration Challenges and

More information

Archival Data Format Requirements

Archival Data Format Requirements Archival Data Format Requirements July 2004 The Royal Library, Copenhagen, Denmark The State and University Library, Århus, Denmark Main author: Steen S. Christensen The Royal Library Postbox 2149 1016

More information

DASCOSA: Database Support for Computational Science Applications. Kjetil Nørvåg Norwegian University of Science and Technology Trondheim, Norway

DASCOSA: Database Support for Computational Science Applications. Kjetil Nørvåg Norwegian University of Science and Technology Trondheim, Norway DASCOSA: Database Support for Computational Science Applications Kjetil Nørvåg Norwegian University of Science and Technology Trondheim, Norway Outline Background/context: Databases & Grids Requirements

More information

Proposal No. P16/9921 Records Management Platform

Proposal No. P16/9921 Records Management Platform Answers to Vendor Questions Questions are in black, Answers are in red 1. Please expand on the types of restrictions PCCCD is interested in. Provide what kinds of restrictions your system has the ability

More information

Reclaiming Primary Storage with Managed Server HSM

Reclaiming Primary Storage with Managed Server HSM White Paper Reclaiming Primary Storage with Managed Server HSM November, 2013 RECLAIMING PRIMARY STORAGE According to Forrester Research Inc., the total amount of data warehoused by enterprises is doubling

More information

Best Archiving Practice Guidance

Best Archiving Practice Guidance Best Archiving Practice Guidance This document has been published under the auspices of the EU Telematics Implementation Group - electronic submissions (TIGes) Please note that this document has been published

More information

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise

More information

How To Manage An Electronic Discovery Project

How To Manage An Electronic Discovery Project Optim The Rise of E-Discovery Presenter: Betsy J. Walker, MBA WW Product Marketing Manager What is E-Discovery? E-Discovery (also called Discovery) refers to any process in which electronic data is sought,

More information

The Key Elements of Digital Asset Management

The Key Elements of Digital Asset Management The Key Elements of Digital Asset Management The last decade has seen an enormous growth in the amount of digital content, stored on both public and private computer systems. This content ranges from professionally

More information

Veritas Enterprise Vault.cloud for Microsoft Office 365

Veritas Enterprise Vault.cloud for Microsoft Office 365 TM Veritas Enterprise Vault.cloud for Microsoft Office 365 Assume control over your information ecosystem Benefits at a glance Satisfies email retention requirements by journaling an immutable copy of

More information

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications

More information

GeoGrid Project and Experiences with Hadoop

GeoGrid Project and Experiences with Hadoop GeoGrid Project and Experiences with Hadoop Gong Zhang and Ling Liu Distributed Data Intensive Systems Lab (DiSL) Center for Experimental Computer Systems Research (CERCS) Georgia Institute of Technology

More information

Semantic and Organisational Interoperability Issues in Public Sector in Norway

Semantic and Organisational Interoperability Issues in Public Sector in Norway Semicolon Semantic and Organisational Interoperability Issues in Public Sector in Norway Terje Grimstad, project leader, Semicolon 1 Introduction Full electronic interoperability between public and private

More information

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Introduction

More information

Agenda. You are not in the business to manage records

Agenda. You are not in the business to manage records Global Records and Information Management Risk: Proactive and Practical Approaches to Effective Records Management September 16, 2014 Maura Dunn, MLS, CRM Lee Karas, MBA Agenda Drivers for your Records

More information

DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version 2.0. 9 July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE

DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version 2.0. 9 July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER Version 2.0 9 July 2008 Record of Changes Version Description Of Change Revision Date Author Number 1.0 Baseline Document July 12,

More information

ERA Challenges. Draft Discussion Document for ACERA: 10/7/30

ERA Challenges. Draft Discussion Document for ACERA: 10/7/30 ERA Challenges Draft Discussion Document for ACERA: 10/7/30 ACERA asked for information about how NARA defines ERA completion. We have a list of functions that we would like for ERA to perform that we

More information

Electronic Records Management, Preservation, and Best Practices in Indiana Government

Electronic Records Management, Preservation, and Best Practices in Indiana Government Electronic Records Management, Preservation, and Best Practices in Indiana Government Indiana Commission on Public Records Today s Agenda Updates and Initiatives from ICPR Retention Requirements for Electronic

More information

Why archiving erecords influences the creation of erecords. Martin Stürzlinger scopepartner Vienna, Austria

Why archiving erecords influences the creation of erecords. Martin Stürzlinger scopepartner Vienna, Austria Why archiving erecords influences the creation of erecords Martin Stürzlinger scopepartner Vienna, Austria Electronic Records In a Productive System Created Used Changed Deleted In an Archival System No

More information

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany joerg.brunsmann@fernuni-hagen.de

More information

How Does the Cloud Fit into Active Archiving. Active Archive Alliance Panel

How Does the Cloud Fit into Active Archiving. Active Archive Alliance Panel How Does the Cloud Fit into Active Archiving Active Archive Alliance Panel Active Archive Alliance Founded 2010 The Active Archive Alliance is a multi-vendor association aimed at evolving new technologies

More information

www.basho.com Technical Overview Simple, Scalable, Object Storage Software

www.basho.com Technical Overview Simple, Scalable, Object Storage Software www.basho.com Technical Overview Simple, Scalable, Object Storage Software Table of Contents Table of Contents... 1 Introduction & Overview... 1 Architecture... 2 How it Works... 2 APIs and Interfaces...

More information

Knowledge as a Service for Agriculture Domain

Knowledge as a Service for Agriculture Domain Knowledge as a Service for Agriculture Domain Asanee Kawtrakul Abstract Three key issues for providing knowledge services are how to improve the access of unstructured and scattered information for the

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Data Grid Landscape And Searching

Data Grid Landscape And Searching Or What is SRB Matrix? Data Grid Automation Arun Jagatheesan et al., University of California, San Diego VLDB Workshop on Data Management in Grids Trondheim, Norway, 2-3 September 2005 SDSC Storage Resource

More information

Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015

Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015 Hitachi Content Platform Andrej Gursky, Solutions Consultant May 2015 What Is Object Storage? Aggregate, manage, protect and use content Just like we move photos from devices to a PC Hard to use on the

More information

Archive strategy for electronic records Peter Fæster Nielsen, Novo Nordisk 13. May 2014

Archive strategy for electronic records Peter Fæster Nielsen, Novo Nordisk 13. May 2014 Digitale arkiver - Vedligeholdelse, tilgængelighed og forskning 13. maj 2014 på Rigsarkivet Archive strategy for electronic records Peter Fæster Nielsen, Novo Nordisk 13. May 2014 Peter Fæster Nielsen,

More information

Data Mining Governance for Service Oriented Architecture

Data Mining Governance for Service Oriented Architecture Data Mining Governance for Service Oriented Architecture Ali Beklen Software Group IBM Turkey Istanbul, TURKEY alibek@tr.ibm.com Turgay Tugay Bilgin Dept. of Computer Engineering Maltepe University Istanbul,

More information

Long Term Knowledge Retention and Preservation

Long Term Knowledge Retention and Preservation Long Term Knowledge Retention and Preservation Aziz Bouras University of Lyon, DISP Laboratory France abdelaziz.bouras@univ-lyon2.fr Recent years: How should digital 3D data and multimedia information

More information

Taming Big Data Storage with Crossroads Systems StrongBox

Taming Big Data Storage with Crossroads Systems StrongBox BRAD JOHNS CONSULTING L.L.C Taming Big Data Storage with Crossroads Systems StrongBox Sponsored by Crossroads Systems 2013 Brad Johns Consulting L.L.C Table of Contents Taming Big Data Storage with Crossroads

More information

Common Operating-System Components

Common Operating-System Components Common Operating-System Components Process Management Main Memory Management File Management I/O System Management Secondary Management Protection System Oct-03 1 Process Management A process is a program

More information

Transition Guidelines: Managing legacy data and information. November 2013 v.1.0

Transition Guidelines: Managing legacy data and information. November 2013 v.1.0 Transition Guidelines: Managing legacy data and information November 2013 v.1.0 Document Control Document history Date Version No. Description Author October 2013 November 2013 0.1 Draft Department of

More information

Electronic Recordkeeping

Electronic Recordkeeping Electronic Recordkeeping 1 Agenda 1. Provide Overview of Electronic Records in Government. 2. Provide Definitions for Understanding ERK. 3. Describe Objectives of ERK. 4. Identify Critical Success Factors

More information

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation 2014 MarkLogic. All rights reserved. Reproduction of this white paper by any means is strictly prohibited. TABLE

More information

Record Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC

Record Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC Record Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC 1 Agenda Definitions Electronic Records Management EDMS and ERM ECM Objectives Benefits Legal and Regulatory Requirements

More information

Document Management. Introduction. CAE DS Product data management, document data management systems and concurrent engineering

Document Management. Introduction. CAE DS Product data management, document data management systems and concurrent engineering Document Management Introduction Document Management aims to manage organizational information expressed in form of electronic documents. Documents in this context can be of any format text, pictures or

More information

Introduction to Cloud Computing

Introduction to Cloud Computing Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services

More information

Long-term Archiving of Relational Databases with Chronos

Long-term Archiving of Relational Databases with Chronos First International Workshop on Database Preservation (PresDB'07) 23 March 2007, at the UK Digital Curation Centre and the Database Group in the School of Informatics, University of Edinburgh Long-term

More information