Sustainable Digital Information. Corporate Memory
|
|
- Abel McKenzie
- 7 years ago
- Views:
Transcription
1 Sustainable Digital Information Corporate Memory 10 September 2009
2 Family video/pictures next generation 2
3 LongRec DATA = DIGITAL ACCESS THROUGH AEONS 3+ year project, research and case studies - DNV R&I lead, 10 partners - Start October 2006, end Overall budget 27,6 MNOK, Norwegian Research Council grant 9.2 MNOK - 3 PhD theses in work 3
4 LongRec DATA = Digital Access Through Aeons Rea d Trust Find Understand 4
5 Project partners Nasjonalbiblioteket Norsk Regnesentral Utenriksdep. Riksarkivet InterPARES 3: Brønnøysundregistrene ICRI (Interdisciplinary Centre for Law and ICT), Katholieke Universiteit Leuven 5
6 Work Packages READ TRUST FIND UNDERSTAND COMPLIANCE 2 topics across packages 6
7 Information / Data Volume Trends READ Volume Explosion 2 stacks of books from Earth to Pluto Earliest written record Gutenberg London Library ( books) Internet Explosion 12 stacks of books from Earth to Sun 0.01% is stored on paper 3000 BC
8 Trends READ Storage shortage Peta Bytes info storage : Data created will be three times amount of available storage. Lots of data will be for immediate consumption only 8
9 Different medium over years If it s not digital, it s not accessible 9
10 National Library of Norway Current state of digitalization: 5% Total volume when today s collections are digitalized ( 2018) Estimated total volume: 37 Petabyte Estimated number of files: Percentage of completed digitalization 20 % 23,2% Hardware Support: 3 (4) years only!! 15 % 10 % 5 % 0 % 6.0% 2.2% 0.1% text images sound video 10
11 Migration Calculator Migration Calculator 6 parameters: - Size of digital objects - read/write bandwidth - read/write access time - file processing speed - network transfer bandwidth - the number of replicas. Two basic models: - Basic Migration - Migration with processing Two extensions: - Replication and verification 11
12 Time (s) Findings migration pilot doing verification by another CPU, the migration time will not be increased ,0 8000,0 6000,0 4000,0 2000,0 0, # folder MB-V-2CPU MB MB-V-1CPU 12
13 time (s) Findings migration pilot When doing migration with processing, the migration time will be decreased by running multiple processes the number of processes 13
14 Further work within READ (PhD) 1. Very large amounts of data in preservation systems (Calculator) 2. Lack of a comprehensive migration framework (Framework) 3. What metadata should be preserved for migration? (Metadata) 14 14
15 Information / Data Volume Trends within Trust More and more trusted Documents are getting digitized 3000 BC Time 15
16 Trusted Information life cycle 1) TRUSTED RECORD MANAGEMEMT The authenticity, integrity and completeness of records 2) TRUSTED DIGITAL REPOSITORY The sustained integrity of records after acquisition Reliable storage Continued usability and readability Controlled access 3) TRUSTED TRANSFER The sustained authenticity of records after migration, conversion and reformatting (verification challenges) Digital Repository 16
17 Record Management and trusted transfer 17
18 Digital Repository 18
19 Further work (PhD) General Trust model Guidelines to secure trust within a repository Checklist to secure trust within a repository Critical metadata to secure trust within a repository 19
20 Information / Data Volume Trends FIND Volume Explosion 90% of all data is unstructured (pictures, video, s, blogs, ) - no data model, no meta data 70% of all data belongs to individuals and are de-centralized stored - Video, Photos, web pages, ect Massive growth in multimedia information, less in textual information 2 stacks of books from Earth to Pluto Earliest written record Gutenberg London Library ( books) Internet Explosion 12 stacks of books from Earth to Sun 0.01% is stored on paper 3000 BC
21 Overview of the ECDL paper Problem: Due to decentralized nature and the lack of standards for date/time, it is difficult to find accurate and trustworthy timestamp for web documents. For a given document with uncertain timestamp, can the contents be used to determine the timestamp with a sufficiently high confidence? Let s me see This document is probably originated in 850 A.C. with 95% confidence. I found a bible-like document. But I have no idea when it was created? You should ask Guru! 21
22 Using Temporal Language Models for Document Dating Previous Work Temporal Language Models A non-time stamped document Partition Word 1999 tsunami tsunami Thailand 1999 Japan 1999 tidal wave 2004 tsunami 2004 Thailand Similarity Scores Score(1999) = earthquake Score(2004) = = 2 Most likely timestamp is Nattiya Kanhabua and Kjetil Nørvåg (Norwegian University of Science and Technology) 1/2
23 Find - Time traveler 23
24 FIND - Research questions The main research question: How to improve the quality of search in a document archive using temporal information? Q1. How to handle large number of documents retrieved? Q2. How to search with awareness of language changes? Q3. How to rank search results wrt. temporal information? 24
25 he impact of communication on language change High: many external contacts heterogeneous participants frequent exchange English Intensity of communication Low : Icelandic few external contacts homogeneous participants infrequent exchange Low: slow adoption of foreign words conservative and predictable orthographic system Speed of language change 25 High: high turn-over of words flexible, volatile orthographic system
26 The impact of globalization on changing information needs High: many international subcontractors complex, heterogenous customer base internationalized technological development DNV 2009 Low : Degree of globalized business limited dependency on business partners no transnational, few local business partners homogenous and stable customer needs Low: slow development of new knowledge stable, predictable information needs A local carpenter producing furniture in Copenhagen, 1930 Degree of changes in information 26 High: high change rate of knowledge volatile information needs
27 Timeline information changes Time zoom scroll-bar Primary Records Secondary data Decision support 27
28 How can semantics add value to information management? Maintenance of Metadata over time Maintenance of Master data over time Verification that your information structure is good over time Automatic maintenance like with Retention Etc. 28
29 Further work UNDERSTAND Information Governance regime Master data Search vs semantic technology supplement or overlap Assessing an organization's maturity in preserving semantic value of information assets More on Evolution 29
30 Increased laws and regulations Compliance Trends EUs 8 directive /EuroSox CDBA Enron MiFiD Solvency II Data Protection Act Norsk SOX Basel II HIPPA arkivlov GLBA FISMA FOIA
31 Input in 3 areas Toolbox - Maturity model related to compliance issues Retention - automatically Compliance tool the dream 31
32 Compliance Toolbox: Maturity related to Compliance 1. Corporate governance 2. Information governance 3. Information maturity 32
33 Toolbox 1: Managing Risks of Corporate Governance Corporate Governance Compliance Risks 33
34 Toolbox 2: 10 steps Information Governance 1. Decide Vision 2. Ensure corporate Management involvement 3. Create policies 4. Design guidelines and best practices 5. Define responsibilities 6. Assign IM Professionals 7. Identify the important information objects 8. Create an information architecture 9. Define required IM services incl SLA 10. Link the Metrics to companies goals and then monitor it VISION HOW WHAT 34
35 Toolbox 3 Information Maturity CMMI EIM Maturity Model (GARTNER) IBM Data Governance Council Maturity Model Aiim 35
36 Compliance retention/automatically Frame conditions: 1. Retention policy 2. Virtual organization 3. Decide which level you need retention - Information Category - Information type etc. RM Application: 1. An event trigger an retention application - Platform is disused - New or changed law - Updated retention policy etc 2. Which of the retention rules in the rule repository to apply - Dependent on: - Type of event - Type of object - Type of information object 3. The RM Application generates a set of information objects that need to be: - reviewed for retention workflow/information owner - updated retention date - or automatically deleted 36
37 Workflow Engine Basic Scenario 1: Managing retention for a given event DNVs Retention Manager 1. Submit trigging event 2. Make query & select documents 3. Select rules & calculate actions 4. Review, monitor, decide 5. Take action Retention Ontology Thing Engine for automatic deletion or update of metadata Interface for submitting events ProductionObject Platform Employee EventType Rule Condition DocType RuleContent NormalFlow OfTime EventType RIM DisuseOf Platform User Document Owner Process Contract TerminationOfE mployment Type Interface for reviewing retention candidates Interface for manual deletion or update of metadata Meridio Rule Repository Disuse platform Oseberg Employee Smith leaves the company Identify all documents related to Oseberg Identify all documents related to Smith Calculate all rules related to disuse of a platform AND the types of documents related to Oseberg Calculate all rules related to termination of employment and types of documents related to Smith 37 Find RIM for Oseberg Find RIM HR Review Oseberg documents Delete Osebergreports, but keep contract Set deletion date for Smithdocuments to Jan 1, 2019
38 Compliance Tool? Manual handling Performed by a tool External Law changes (region) Industry law changes Specific data Eks personal data DNV Broker ( lobing ) Tool? Relevant? JA Which internal Policy are involved? Nei Continue to Keep me updated Internal Review With feedback Decision by responsible Incorporate into the business Local storage? Retention time Information category/-type Information owner 38
39 Findings across work packages Cost Factors Commercialization 39
40 General cost factors in Digital preservation - Calculator OAIS Monitoring costs Security control costs Staff costs Maintenanc e costs Infrastructu re costs Standards costs Technology costs General cost factors Training costs Quality assurance costs Organisatio n costs Retention costs Selection costs Access costs 40
41 Development Path DNV 3rd party information management services DNV Trusted Online Service Information risk Management Digital Safe Information Maturity Assessment Service
42 Digital Safe Main functionality Long-term storage - Migration, conversion etc Compliance - Retention do automatically etc Digital Safe Trust - Fingerprints 3rd party role 42
43 Sleep like a baby - Assists you to secure sustainable records 43
44 Safeguarding life, property and the environment 44
LongRec All rights reserved. This publication or parts thereof may not be reproduced or transmitted in any form or by any means, including
LongRec All rights reserved. This publication or parts thereof may not be reproduced or transmitted in any form or by any means, including photocopying or recording, without reference to the source. Slide
More informationArchiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie. uwe.borghoff@unibw.
Archiving Systems Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie uwe.borghoff@unibw.de Decision Process Reference Models Technologies Use Cases
More informationAuto-Classification for Document Archiving and Records Declaration
Auto-Classification for Document Archiving and Records Declaration Josemina Magdalen, Architect, IBM November 15, 2013 Agenda IBM / ECM/ Content Classification for Document Archiving and Records Management
More informationAssessment of RLG Trusted Digital Repository Requirements
Assessment of RLG Trusted Digital Repository Requirements Reagan W. Moore San Diego Supercomputer Center 9500 Gilman Drive La Jolla, CA 92093-0505 01 858 534 5073 moore@sdsc.edu ABSTRACT The RLG/NARA trusted
More informationThe Way to SOA Concept, Architectural Components and Organization
The Way to SOA Concept, Architectural Components and Organization Eric Scholz Director Product Management Software AG Seite 1 Goals of business and IT Business Goals Increase business agility Support new
More informationCertified Information Professional 2016 Update Outline
Certified Information Professional 2016 Update Outline Introduction The 2016 revision to the Certified Information Professional certification helps IT and information professionals demonstrate their ability
More informationInteragency Science Working Group. National Archives and Records Administration
Interagency Science Working Group 1 National Archives and Records Administration Establishing Trustworthy Digital Repositories: A Discussion Guide Based on the ISO Open Archival Information System (OAIS)
More informationDigital preservation a European perspective
Digital preservation a European perspective Pat Manson Head of Unit European Commission DG Information Society and Media Cultural Heritage and Technology Enhanced Learning Outline The digital preservation
More informationSummary Table of Contents
Summary Table of Contents Preface VII For whom is this book intended? What is its topical scope? Summary of its organization. Suggestions how to read it. Part I: Why We Need Long-term Digital Preservation
More informationCloud Service Contracts: An Issue of Trust
Cloud Service Contracts: An Issue of Trust Marie Demoulin Assistant Professor Université de Montréal École de Bibliothéconomie et des Sciences de l Information (EBSI) itrust 2d International Symposium,
More informationModule 8 Digital Libraries and Open Access
Module 8 Digital Libraries and Open Access Lesson 2 How is a Digital Library Built? UNESCO EIPICT MODULE 8. LESSON 2 1 Why is there a need for a Digital Library? Digital libraries o Widen access to valuable
More informationHadoop & its Usage at Facebook
Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the Storage Developer Conference, Santa Clara September 15, 2009 Outline Introduction
More informationDigital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections,
Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections, Kevin Bradley ABSTRACT: Though the solution to all of the problems of digital
More informationCloud Archive & Long Term Preservation Challenges and Best Practices
Cloud Archive & Long Term Preservation Challenges and Best Practices Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP Author: Chad Thibodeau, Cleversafe, Inc. Author: Sebastian Zangaro, HP SNIA Legal
More informationSQL Server Master Data Services A Point of View
SQL Server Master Data Services A Point of View SUBRAHMANYA V SENIOR CONSULTANT SUBRAHMANYA.VENKATAGIRI@WIPRO.COM Abstract Is Microsoft s Master Data Services an answer for low cost MDM solution? Will
More informationArchive and Preservation in the Cloud - Business Case, Challenges and Best Practices. Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP
Archive and Preservation in the Cloud - Business Case, Challenges and Best Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP SNIA Legal Notice The material contained in this tutorial is copyrighted
More informationData Management in an International Data Grid Project. Timur Chabuk 04/09/2007
Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the
More informationENTERPRISE DOCUMENTS & RECORD MANAGEMENT
ENTERPRISE DOCUMENTS & RECORD MANAGEMENT DOCWAY PLATFORM ENTERPRISE DOCUMENTS & RECORD MANAGEMENT 1 DAL SITO WEB OLD XML DOCWAY DETAIL DOCWAY Platform, based on ExtraWay Technology Native XML Database,
More informationARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER. Tony Walker, Dell, Inc. Molly Rector, Spectra Logic
ARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER Tony Walker, Dell, Inc. Molly Rector, Spectra Logic SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise
More informationCORNWELL Consultants in Management and IT
Costing EDRM Programmes Andy Rothwell & Richard House CORNWELL Consultants in Management and IT Aim To provide you with an overview of the cost drivers for Electronic Document & Records Management (EDRM)
More informationHPSS Best Practices. Erich Thanhardt Bill Anderson Marc Genty B
HPSS Best Practices Erich Thanhardt Bill Anderson Marc Genty B Overview Idea is to Look Under the Hood of HPSS to help you better understand Best Practices Expose you to concepts, architecture, and tape
More informationMiguel Ortiz, Sr. Systems Engineer. Globanet
Miguel Ortiz, Sr. Systems Engineer Globanet Agenda Who is Globanet? Archiving Processes and Standards How Does Data Archiving Help Data Management? Data Archiving to Meet Downstream ediscovery Needs Timely
More informationE-learning and Student Management System: toward an integrated and consistent learning process
E-learning and Student Management System: toward an integrated and consistent learning process Matteo Bertazzo 1, Franca Fiumana 2 1 CINECA, Information and Knowledge Management Services Department, via
More informationInformation Governance
Information Governance & Extended Content Solutions 2013 SOUND FAMILIAR? How do we connect our information together? How do we manage multiple un-integrated repositories of documents? Our users don t know
More informationObject Storage A Dell Point of View
Object Storage A Dell Point of View Dell Product Group 1 THIS POINT OF VIEW PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED
More informationEnterprise Content Management. Image from http://webbuildinginfo.com/wp-content/uploads/ecm.jpg. José Borbinha
Enterprise Content Management Image from http://webbuildinginfo.com/wp-content/uploads/ecm.jpg José Borbinha ECM? Let us start with the help of a professional organization http://www.aiim.org http://www.aiim.org/about
More informationA Business Case for Enterprise Content Integration using Ontology-based Content Analytics
A Business Case for Enterprise Content Integration using Ontology-based Content Analytics Edward Curry 1, Bill McDaniel 1, Dmitry Shingarev 1, Milena C. Caires 1, Mark Leyden 1, Sean O Riain 1, Karl Flannery
More informationAchieving a Step Change in Digital Preservation Capability
Essential Guide Achieving a Step Change in Digital Preservation Capability An assessment of Preservica using the Digital Preservation Capability Maturity Model (DPCMM) Executive Summary Nearly every organization
More informationAIIM & ASSUREON AN ASSUREON BRIEF
SOLUTIONBRIEF AIIM & ASSUREON AN ASSUREON BRIEF AIIM (Association for Information and Image Management) is the global community of information professionals. Their mission is to help organizations thrive
More informationApplying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service
Applying the OAIS standard to CCLRC s British Atmospheric Centre and the Atlas Petabyte Storage Service Corney, D.R., De Vere, M., Folkes, T., Giaretta, D., Kleese van Dam, K., Lawrence, B. N., Pepler,
More informationA Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise
WHITE PAPER A Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise NOTICE This White Paper may contain proprietary information
More informationIBM Enterprise Content Management (ECM)
IBM Enterprise Content Management (ECM) Vesna Ilic IBM ECM Tech Pre-Sales Manager SEA Region Vesna.ilic@si.ibm.com Ahmed Shanab IBM ECM Sales Manager MEEP & SEA Region ashanab@eg.ibm.com Today s Objectives
More informationINFORMATION GOVERNANCE FOR PRIVACY COMPLIANCE
Access and Privacy Conference Edmonton, June 13, 2012 Rick Klumpenhouwer, MA, MAS, CIAPP-M Partner, Cenera INFORMATION GOVERNANCE FOR PRIVACY COMPLIANCE Course Objectives Understand the principles of information
More informationDatabase preservation toolkit:
Nov. 12-14, 2014, Lisbon, Portugal Database preservation toolkit: a flexible tool to normalize and give access to databases DLM Forum: Making the Information Governance Landscape in Europe José Carlos
More informationCarestream Information Management Solutions. Managing the explosion in patient information
Managing the explosion in patient information Carestream Information Management Solutions Carestream Information Management Solutions The right information in the right place at the right time from the
More informationApache Hadoop FileSystem and its Usage in Facebook
Apache Hadoop FileSystem and its Usage in Facebook Dhruba Borthakur Project Lead, Apache Hadoop Distributed File System dhruba@apache.org Presented at Indian Institute of Technology November, 2010 http://www.facebook.com/hadoopfs
More informationDigital Libraries and Content Management
Digital Libraries and Content Management Database Research Group, University of Rostock 4th European IBM Content Manager and Media Workshop, September 2002, Essen 0. Overview 1. Content Management Systems
More informationCapacity Plan. Template. Version X.x October 11, 2012
Template Version X.x October 11, 2012 This is an integral part of infrastructure and deployment planning. It supports the goal of optimum provisioning of resources and services by aligning them to business
More informationThe Department for Business, Innovation and Skills IMA Action Plan PRIORITY RECOMMENDATIONS
PRIORITY RECOMMENDATIONS R1 BIS to elevate the profile of information risk in support of KIM strategy aims for the protection, management and exploitation of information. This would be supported by: Establishing
More informationData Governance Best Practice
Data Governance Best Practice Business Connexion Michelle Grimley Senior Manager EIM +27 (0)11 266 6499 Michelle.Grimley@bcx.co.za Inri Möller Master Data Manager +27 (0)11 266 5146 Inri.Möller@bcx.co.za
More informationState of Florida ELECTRONIC RECORDKEEPING STRATEGIC PLAN. January 2010 December 2012 DECEMBER 31, 2009
State of Florida ELECTRONIC RECORDKEEPING STRATEGIC PLAN January 2010 December 2012 DECEMBER 31, 2009 Florida Department of State State Library and Archives of Florida 850.245.6750 http://dlis.dos.state.fl.us/recordsmanagers
More informationObject Storage A Fresh Approach to Long-Term File Storage
Object Storage A Fresh Approach to Long-Term File Storage A Dell Technical White Paper Dell Product Group 1 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND
More informationWhat We ll Cover. Defensible Disposal of Records and Information Litigation Holds Information Governance the future of records management programs
What We ll Cover Foundations of Records and Information Management Creating a Defensible Retention Schedule Paper v. Electronic Records Organization and Retrieval of Records and Information Records Management
More informationin the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP
Digital PRESENTATION Data Archive TITLE and GOES Preservation HERE in the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP SNIA Legal Notice The material contained
More informationDiagram 1: Islands of storage across a digital broadcast workflow
XOR MEDIA CLOUD AQUA Big Data and Traditional Storage The era of big data imposes new challenges on the storage technology industry. As companies accumulate massive amounts of data from video, sound, database,
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure
EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland EISCAT User Meeting, Uppsala,6 May 2013 2 Exponential growth Data trends Zettabytes
More informationNevada 2013. October 3, 2013 8:00 5:00
Nevada 2013 E-Records Forum October 3, 2013 8:00 5:00 The E-Records Forum brings stakeholders together from various governmental entities to discuss shared interests and concerns about the creation, management,
More informationCertified Information Professional (CIP) Certification Maintenance Form http://www.aiim.org/certification
Certified Information Professional (CIP) Certification Maintenance Form http://www.aiim.org/certification Name: Title: Company: Address: City: State/Province: ZIP/Postal Code: Country: Email Address: Telephone:
More informationWhat happens when Big Data and Master Data come together?
What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information
More informationDemographics QUESTIONS COMMENTS
DOI SURVEY Name: Bureau: Department: Location: Telephone: Email: Date of Interview: Defining Requirements for an Electronic Records Management Solution A series of fact finding questions will be presented
More informationInformation Management
G i Information Management Information Management Planning March 2005 Produced by Information Management Branch Open Government Service Alberta 3 rd Floor, Commerce Place 10155 102 Street Edmonton, Alberta,
More informationEnterprise Content Management with Microsoft SharePoint
Enterprise Content Management with Microsoft SharePoint Overview of ECM Services and Features in Microsoft Office SharePoint Server 2007 and Windows SharePoint Services 3.0. A KnowledgeLake, Inc. White
More informationEMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, 16.10.2008. Copyright 2008 EMC Corporation. All rights reserved.
EMC arhiviranje Lilijana Pelko Primož Golob Sarajevo, 16.10.2008 1 Agenda EMC Today Reasons to archive EMC Centera EMC EmailXtender EMC DiskXtender Use cases 2 EMC Strategic Acquisitions: Strengthen and
More informationIntroduction. 1. Name of your organisation: 2. Country (of your organisation): Page 2
Introduction 1. Name of your organisation: 2. Country (of your organisation): 6 Page 2 Policies and Procedures The following questions address the policies and procedures regarding data management (acquisition,
More informationla conception et l'exploitation d'un système électroniques
Philippe NEW WORK ITEM PROPOSAL SC3 MARTIN 171 1 Date of presentation Reference number 2008/07/29 (to be given by the Secretariat) Proposer ISO/TC / SC N Secretariat 170 A proposal for a new work item
More informationHadoop & its Usage at Facebook
Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the The Israeli Association of Grid Technologies July 15, 2009 Outline Architecture
More informationApache Hadoop FileSystem Internals
Apache Hadoop FileSystem Internals Dhruba Borthakur Project Lead, Apache Hadoop Distributed File System dhruba@apache.org Presented at Storage Developer Conference, San Jose September 22, 2010 http://www.facebook.com/hadoopfs
More informationArchiving A Dell Point of View
Archiving A Dell Point of View Dell Product Group 1 THIS POINT OF VIEW PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED
More informationSalesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION
Salesforce Certified Data Architecture and Management Designer Study Guide Summer 16 Contents SECTION 1. PURPOSE OF THIS STUDY GUIDE... 2 SECTION 2. ABOUT THE SALESFORCE CERTIFIED DATA ARCHITECTURE AND
More informationXpoLog Center Suite Log Management & Analysis platform
XpoLog Center Suite Log Management & Analysis platform Summary: 1. End to End data management collects and indexes data in any format from any machine / device in the environment. 2. Logs Monitoring -
More informationChapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
More informationEII - ETL - EAI What, Why, and How!
IBM Software Group EII - ETL - EAI What, Why, and How! Tom Wu 巫 介 唐, wuct@tw.ibm.com Information Integrator Advocate Software Group IBM Taiwan 2005 IBM Corporation Agenda Data Integration Challenges and
More informationArchival Data Format Requirements
Archival Data Format Requirements July 2004 The Royal Library, Copenhagen, Denmark The State and University Library, Århus, Denmark Main author: Steen S. Christensen The Royal Library Postbox 2149 1016
More informationDASCOSA: Database Support for Computational Science Applications. Kjetil Nørvåg Norwegian University of Science and Technology Trondheim, Norway
DASCOSA: Database Support for Computational Science Applications Kjetil Nørvåg Norwegian University of Science and Technology Trondheim, Norway Outline Background/context: Databases & Grids Requirements
More informationProposal No. P16/9921 Records Management Platform
Answers to Vendor Questions Questions are in black, Answers are in red 1. Please expand on the types of restrictions PCCCD is interested in. Provide what kinds of restrictions your system has the ability
More informationReclaiming Primary Storage with Managed Server HSM
White Paper Reclaiming Primary Storage with Managed Server HSM November, 2013 RECLAIMING PRIMARY STORAGE According to Forrester Research Inc., the total amount of data warehoused by enterprises is doubling
More informationBest Archiving Practice Guidance
Best Archiving Practice Guidance This document has been published under the auspices of the EU Telematics Implementation Group - electronic submissions (TIGes) Please note that this document has been published
More informationGEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington
GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise
More informationHow To Manage An Electronic Discovery Project
Optim The Rise of E-Discovery Presenter: Betsy J. Walker, MBA WW Product Marketing Manager What is E-Discovery? E-Discovery (also called Discovery) refers to any process in which electronic data is sought,
More informationThe Key Elements of Digital Asset Management
The Key Elements of Digital Asset Management The last decade has seen an enormous growth in the amount of digital content, stored on both public and private computer systems. This content ranges from professionally
More informationVeritas Enterprise Vault.cloud for Microsoft Office 365
TM Veritas Enterprise Vault.cloud for Microsoft Office 365 Assume control over your information ecosystem Benefits at a glance Satisfies email retention requirements by journaling an immutable copy of
More informationElastic Application Platform for Market Data Real-Time Analytics. for E-Commerce
Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications
More informationGeoGrid Project and Experiences with Hadoop
GeoGrid Project and Experiences with Hadoop Gong Zhang and Ling Liu Distributed Data Intensive Systems Lab (DiSL) Center for Experimental Computer Systems Research (CERCS) Georgia Institute of Technology
More informationSemantic and Organisational Interoperability Issues in Public Sector in Norway
Semicolon Semantic and Organisational Interoperability Issues in Public Sector in Norway Terje Grimstad, project leader, Semicolon 1 Introduction Full electronic interoperability between public and private
More informationIaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures
IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Introduction
More informationAgenda. You are not in the business to manage records
Global Records and Information Management Risk: Proactive and Practical Approaches to Effective Records Management September 16, 2014 Maura Dunn, MLS, CRM Lee Karas, MBA Agenda Drivers for your Records
More informationDIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version 2.0. 9 July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE
DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER Version 2.0 9 July 2008 Record of Changes Version Description Of Change Revision Date Author Number 1.0 Baseline Document July 12,
More informationERA Challenges. Draft Discussion Document for ACERA: 10/7/30
ERA Challenges Draft Discussion Document for ACERA: 10/7/30 ACERA asked for information about how NARA defines ERA completion. We have a list of functions that we would like for ERA to perform that we
More informationElectronic Records Management, Preservation, and Best Practices in Indiana Government
Electronic Records Management, Preservation, and Best Practices in Indiana Government Indiana Commission on Public Records Today s Agenda Updates and Initiatives from ICPR Retention Requirements for Electronic
More informationWhy archiving erecords influences the creation of erecords. Martin Stürzlinger scopepartner Vienna, Austria
Why archiving erecords influences the creation of erecords Martin Stürzlinger scopepartner Vienna, Austria Electronic Records In a Productive System Created Used Changed Deleted In an Archival System No
More informationSemantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution
Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany joerg.brunsmann@fernuni-hagen.de
More informationHow Does the Cloud Fit into Active Archiving. Active Archive Alliance Panel
How Does the Cloud Fit into Active Archiving Active Archive Alliance Panel Active Archive Alliance Founded 2010 The Active Archive Alliance is a multi-vendor association aimed at evolving new technologies
More informationwww.basho.com Technical Overview Simple, Scalable, Object Storage Software
www.basho.com Technical Overview Simple, Scalable, Object Storage Software Table of Contents Table of Contents... 1 Introduction & Overview... 1 Architecture... 2 How it Works... 2 APIs and Interfaces...
More informationKnowledge as a Service for Agriculture Domain
Knowledge as a Service for Agriculture Domain Asanee Kawtrakul Abstract Three key issues for providing knowledge services are how to improve the access of unstructured and scattered information for the
More informationThe Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
More informationData Grid Landscape And Searching
Or What is SRB Matrix? Data Grid Automation Arun Jagatheesan et al., University of California, San Diego VLDB Workshop on Data Management in Grids Trondheim, Norway, 2-3 September 2005 SDSC Storage Resource
More informationHitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015
Hitachi Content Platform Andrej Gursky, Solutions Consultant May 2015 What Is Object Storage? Aggregate, manage, protect and use content Just like we move photos from devices to a PC Hard to use on the
More informationArchive strategy for electronic records Peter Fæster Nielsen, Novo Nordisk 13. May 2014
Digitale arkiver - Vedligeholdelse, tilgængelighed og forskning 13. maj 2014 på Rigsarkivet Archive strategy for electronic records Peter Fæster Nielsen, Novo Nordisk 13. May 2014 Peter Fæster Nielsen,
More informationData Mining Governance for Service Oriented Architecture
Data Mining Governance for Service Oriented Architecture Ali Beklen Software Group IBM Turkey Istanbul, TURKEY alibek@tr.ibm.com Turgay Tugay Bilgin Dept. of Computer Engineering Maltepe University Istanbul,
More informationLong Term Knowledge Retention and Preservation
Long Term Knowledge Retention and Preservation Aziz Bouras University of Lyon, DISP Laboratory France abdelaziz.bouras@univ-lyon2.fr Recent years: How should digital 3D data and multimedia information
More informationTaming Big Data Storage with Crossroads Systems StrongBox
BRAD JOHNS CONSULTING L.L.C Taming Big Data Storage with Crossroads Systems StrongBox Sponsored by Crossroads Systems 2013 Brad Johns Consulting L.L.C Table of Contents Taming Big Data Storage with Crossroads
More informationCommon Operating-System Components
Common Operating-System Components Process Management Main Memory Management File Management I/O System Management Secondary Management Protection System Oct-03 1 Process Management A process is a program
More informationTransition Guidelines: Managing legacy data and information. November 2013 v.1.0
Transition Guidelines: Managing legacy data and information November 2013 v.1.0 Document Control Document history Date Version No. Description Author October 2013 November 2013 0.1 Draft Department of
More informationElectronic Recordkeeping
Electronic Recordkeeping 1 Agenda 1. Provide Overview of Electronic Records in Government. 2. Provide Definitions for Understanding ERK. 3. Describe Objectives of ERK. 4. Identify Critical Success Factors
More informationInvestment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation
Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation 2014 MarkLogic. All rights reserved. Reproduction of this white paper by any means is strictly prohibited. TABLE
More informationRecord Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC
Record Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC 1 Agenda Definitions Electronic Records Management EDMS and ERM ECM Objectives Benefits Legal and Regulatory Requirements
More informationDocument Management. Introduction. CAE DS Product data management, document data management systems and concurrent engineering
Document Management Introduction Document Management aims to manage organizational information expressed in form of electronic documents. Documents in this context can be of any format text, pictures or
More informationIntroduction to Cloud Computing
Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services
More informationLong-term Archiving of Relational Databases with Chronos
First International Workshop on Database Preservation (PresDB'07) 23 March 2007, at the UK Digital Curation Centre and the Database Group in the School of Informatics, University of Edinburgh Long-term
More information