Digital Assets Repository 3.0. PASIG User Group Conference Noha Adly Bibliotheca Alexandrina



Similar documents
DAR: A Digital Assets Repository for Library Collections

Ex Libris Rosetta: A Digital Preservation System Product Description

DAR: A Digital Assets Repository for Library Collections An Extended Overview

Functional Requirements for Digital Asset Management Project version /30/2006

Technical. Overview. ~ a ~ irods version 4.x

Digital Preservation. OAIS Reference Model

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

A Digital Library Feasibility Study

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context

ORACLE HYPERION DATA RELATIONSHIP MANAGEMENT

In ediscovery and Litigation Support Repositories MPeterson, June 2009

SHared Access Research Ecosystem (SHARE)

Extending Microsoft SharePoint Environments with EMC Documentum ApplicationXtender Document Management

Database Preservation Toolkit: a flexible tool to normalize and give access to databases

ENTERPRISE DOCUMENTS & RECORD MANAGEMENT

JOURNAL OF OBJECT TECHNOLOGY

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

Queensland recordkeeping metadata standard and guideline

Long Term Knowledge Retention and Preservation

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

The Open Source CMS. Open Source Java & XML

Flattening Enterprise Knowledge

Digitization Workflow Management System for Massive Digitization Projects

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Collaborative Open Market to Place Objects at your Service

Nexus Professional Whitepaper. Repository Management: Stages of Adoption

TopBraid Insight for Life Sciences

VMware Mirage Web Manager Guide

Principal MDM Components and Capabilities

Ontario Ombudsman. Goals

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

Tools for Researchers

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform

GetLOD - Linked Open Data and Spatial Data Infrastructures

Introduction to TIBCO MDM

Community Edition. Master Data Management 3.X. Administrator Guide

MBooks: Google Books Online at the University of Michigan Library

ANSYS EKM Overview. What is EKM?

Columbia University Digital Library Architecture. Robert Cartolano, Director Library Information Technology Office October, 2009

CONDIS. IT Service Management and CMDB

Mercy Baggot Street Canopy Intranet

WHITE PAPER. Integrating Adobe Premiere Pro with emam for a Collaborative Workflow

Autonomy Consolidated Archive

How To Manage Your Digital Assets On A Computer Or Tablet Device

Sage Intelligence Financial Reporting for Sage ERP X3 Version 6.5 Installation Guide

Using EMC Documentum with Adobe LiveCycle ES

Sun Open Archive Framework and Fedora Repository Solutions

GV STRATUS The Next Step in Collaborative Workflows. Régis André Product Manager, STRATUS September 2011

General concepts: DDI

The Ontological Approach for SIEM Data Repository

ILM et Archivage Les solutions IBM

Mediasite A Video Content Management & Distribution Platform. Technical planner: TP-10

MarkLogic Enterprise Data Layer

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

Maximise your Microsoft investment to provide Legal Matter Management

OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar

Brown County Information Technology Aberdeen, SD. Request for Proposals For Document Management Solution. Proposals Deadline: Submit proposals to:

EMC PERSPECTIVE EMC SourceOne Management

Rotorcraft Health Management System (RHMS)

Authoring Within a Content Management System. The Content Management Story

EMC PERSPECTIVE. Understanding Content Management and Digital Asset Management Functionality

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (

ISLANDORA STAFF USER GUIDE. Version 1.3

Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology

Building An Institutional Repository With DSpace

Diagram 1: Islands of storage across a digital broadcast workflow

Example of Implementing Folder Synchronization with ProphetX

A collaborative platform for knowledge management

ADAM Agency solutions

ENTERPRISE CONTENT MANAGEMENT. Trusted by Government Easy to Use Vast Scalability Flexible Deployment Automate Business Processes

<Insert Picture Here> Solution Direction for Long-Term Archive

Business Proposition. Digital Asset Management. Media Intelligent

#MMTM15 #INFOARCHIVE #EMCWORLD 1

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway

WebCenter Release notes

Building an open source based, open standards, infrastructure for the large scale provisioning of reusable open content

Meister Going Beyond Maven

Building Views and Charts in Requests Introduction to Answers views and charts Creating and editing charts Performing common view tasks

WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA?

Content Management for Content Enrichment: Architectural Issues and Strategies

Test Data Management Concepts

IFS-8000 V2.0 INFORMATION FUSION SYSTEM

SQL Server Training Course Content

Secure file sharing and collaborative working solution

Content Management Implementation Guide 5.3 SP1

Taking Control of Library Metadata and Websites using the extensible Catalog

Long-term archiving and preservation planning

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Web Publisher Administration Guide

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

Transcription:

Digital Assets Repository 3.0 PASIG User Group Conference Noha Adly Bibliotheca Alexandrina

DAR 3.0 DAR manages the full lifecycle of a digital asset: its creation, ingestion, metadata management, storage, dissemination, publishing and archival An eco-system of components for an integrated institutional repository.

DAR 3.0 Modular design with integrated components Consolidation of assets Flexible content model for different types of digital objects based on current standards Integration with different sources of metadata, e.g ILS, repositories, databases, Repository-bound applications Preservation

Conceptual Overview Digital Assets Factory (DAF) Flexible management for the digitization workflow Unified means of ingestion into the system Support both physical and born digital materials Digital Assets Metadata (DAM) manages the metadata even in an incomplete state. Digital Assets Publishing (DAP) components allow applications to synchronize objects and their metadata stored in their databases/indexes with the repository Digital Assets Keeper (DAK) manages access to the object files, versions and caching.

Conceptual Overview Collections/Sets: DAR manages one instance of the object Objects are consolidated into sets/collections An object can belong to different sets Objects are shared among applications Applications synch with repository getting latest updates of their objects Applications maintain different derivatives of same object Relies on RDF to define sets and relations between objects

Conceptual Overview Discovery layer Core files are kept online on spinning drives Simple derivatives for display Users can browse and search using simple viewers Provides full text search across the whole collection, based on the access rights granted to the user. Ingestion plugins Flexible Integration with different sources of metadata Allow ingestion and synchronization with external sources

Digital Assets Factory (DAF) Full control over the digitization process workflow Configurable and flexible management tool for any digitization workflow Flexible workflow definition including Definition of sequence of phases Pre-phase and post-phase checks Redirects Special workflows are defined for different object types

Digital Assets Factory (DAF) Automated integrity checks at each step of the workflow. Automated ingestion into the repository and archiving. Integrates with external sources of metadata thru plugins Integrates with enterprise tools and automated software used for digitization Compliant with OAIS Available for download at http://wiki.bibalex.org/dafwiki

Metadata Management METS and MODS standards for recording metadata Fedora as a metadata registry Content Models (Hybrid) Photo (atomistic) / Album (aggregate) Book (compound ) / Bibliographic (aggregate)

Triple Store and Handles Triple Store RDF relations between objects are stored in Triple Store Currently using Mulgara Scalability Issues Alternatives: 4Store? Integration with Fedora Handles Each object has a unique identifier UUID UUID is used to generate Handle list of external identifiers is maintained

METS Store A METS skeleton is created for each object even if metadata is incomplete When metadata complete, send to Fedora and disseminate Accommodate digitizing objects before metadata is ready METS store can be used to reconstruct Fedora

Metadata Synchronization External sources Synchronization is based on XML templates Templates map the output of ILS or DB into MODS Templates can be easily created for different sources Metadata Tool No source of info to extract metadata Relies on human data entry (normal users) Generates human friendly forms thru configurable XML templates Offers type validation, controlled vocabulary, authority lists Metadata is synchronized with METS store Allows full text search (Solr) across items in sets/collections Represent s objects in a hierarchy depicting sets /collections Supports simple workflow with designated roles e.g. editors, reviewers, etc.

Copyright and Access Module Access control policy for specific sets or objects Can define rights to certain operations (e.g. view, print, download etc) based on the application requesting access Can define exceptions to override rules (e.g. prevent a certain object from being displayed) Coordinate access to objects based on the number of licenses

Authentication and Authorization Single Sign On module Set management and ACLs LDAP integration and local users

Digital Assets Keeper Keep a working copy of the object online Maintain a unique copy of the object with persistent identifier Handle entries and external identifiers A storage abstraction layer isolate repository from storage implementation Manages different versions of items Manages caching and derivates Load balancing among nodes

Online Archive (OnA) Complete hardware and software solution for archival Provides reliable and scalable storage based on commodity hardware with spinning hard drives uses in house developed software for data management Any AIP ingested is mirrored at least once Heavily relies on Checksums to ensure the integrity of the data

Digital Assets Publishing (DAP) Different Viewers and applications are built using the Restful API Applications are highly integrated with repository; not separate silos: Repository-bound DAR manages one instance of each object Applications have access to slice of the data (Sets of Objects) based on their access rights Applications synch with DAR: queries API for new or updated metadata and files Applications maintain different derivatives independently

Discovery Layer Stores simple derivatives for all objects Users can browse and search all assets stored within using simple viewers. Provides full text search across the metadata and textual content, based on the access rights granted to the user. Full text search is built on Solr with support for 5 languages: Arabic, English, French, Spanish and Italian

Current Status More than 430,000 objects including Books Photos Manuscripts Maps Documents Specialized viewers been built to display items stored within the repository, such as books and photos. More viewers are still under e.g. tiled image viewer and manuscript viewer. Print on demand (POD) integration layer makes part of DAR available through the POD system. Several interfaces can also be built on top of this API to integrate DAR with other systems.

DAR Books Application built on top of DAR using Restful API displays books stored in the repository (185,000) Faceted Search, including content Morphological full text search (5 languages) Search results highlighting Embeddable book viewer, can be added to any webpage. Whenever a book is added to or updated in DAR, it is automatically retrieved by DAR books.

DAR Books Annotations Tools Sticky Notes Highlight and underline, colors More to come Open Annotations, Annotea, etc Web 2.0 Social Features: Rating and comments Create your own bookshelves Sharing and embedding Adding to other social sites: Facebook, Twitter,

Text Highlighting

Text Underlining

Adding Sticky Notes

Future Work Enhance the Storage Layer: exploring irods, pair trees etc Extending the Copyright and Access module Explore the potential of triple stores Beyond defining sets and collections Scalability Migrating existing applications into repository-bound

Thank You