Digitization Workflow of the. Bavarian State Library. Gabriele Messmer. Bavarian State Library. Munich, Germany



Similar documents
Mass Digitisation and Long-term Preservation Processes and Production at Munich Digitisation Centre

Mass Digitization of Manuscripts and Rare Books: Challenges and Experiences at Bavarian State Library

Long-term preservation activities of the Bavarian State Library

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

How To Build A Map Library On A Computer Or Computer (For A Museum)

Mass digitisation workflow management

DAR: A Digital Assets Repository for Library Collections

OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015

DIGITAL OBJECT an item or resource in digital format. May be the result of digitization or may be born digital.

3 C i t y C e n t e r D r i v e S u i t e S t. L o u i s, MO w w w. k n o w l e d g e l a k e. c o m P a g e 3

Laserfiche Content Management Software

DFG form /15 page 1 of 8. for the Purchase of Licences funded by the DFG

Long-term archiving and preservation planning

1.00 ATHENS LOG IN BROWSE JOURNALS 4

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

WRDC Gets the ITQAN Enterprise Content Management Treatment

Business Software Defined DMS DOCUMENT MANAGEMENT SYSTEM ZETA SOFTWARE

Version5 +AutoCapture & Workflow

Taking Control of Library Metadata and Websites using the extensible Catalog

Visendo Fax Server Integration With SharePoint Server

Whitepaper Document Solutions

Get the Best Digital Images Possible. What s it all about anyway?

User Guide of edox Archiver, the Electronic Document Handling Gateway of

EXPLORE THE BRITISH LIBRARY: WHAT NON-PRINT ITEMS CAN I VIEW?

The Australian War Memorial s Digital Asset Management System

Taking full advantage of the medium does also mean that publications can be updated and the changes being visible to all online readers immediately.

Solutions Pocket Book for Touchstone

Service Plan Fiscal Year 2016

Introducing Version 7 SOFTWARE MODULES

NeoDocs Document Management Software

Therefore. People. Process. Information Product Brochure

Laserfiche Product Suite

Islandora: An Open Source Institutional Repository Solution. Consortium of MnPALS Libraries Annual Meeting April 2014

Overview of NDNP Technical Specifications

Emerging Career Trends for Information Professionals: A Snapshot of Job Titles in Summer 2013

Image Galleries: How to Post and Display Images in Digital Commons

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Customer Tips. Xerox Network Scanning TWAIN Configuration for the WorkCentre 7328/7335/7345. for the user. Purpose. Background

ADOS 6.1. ADOS version 6.1 ADOS 6.1. Effective information archive and retrieval. Security and compliance. Return on investment.

Melvin P. Thatcher FamilySearch Utah, USA

DMS Document Management System

ENTERPRISE DOCUMENTS & RECORD MANAGEMENT

E-Content Service Group Virtual Meeting. Digital Preservation: How to Get Started

The German Name Authority File (PND) in the Union Catalogue: principles experiences and costs. Gabriele Meßmer Bayerische Staatsbibliothek

Oxford Reference Answers with Authority. LOUIS Conference Oct 2012

Using Open Source Software to Manage Policies and Clinical Guidelines. Library & Knowledge Service Derby Teaching Hospitals NHS Foundation Trust

ProLibis Solutions for Libraries in the Digital Age. Automation of Core Library Business Processes

Document scanning. Done right. TM

FIVE-MINUTES-TO-CONTRACT The DESKO over-all concept for digital contract management and ID verification.

ImageNow User. Getting Started Guide. ImageNow Version: 6.7. x

Laserfiche. and SharePoint Integration. Your potential, realized. Learn More Inside. Include imaged documents in collaborative processes.

Perfect PDF & Print 9

SECURITY WHERE THE HISTORY LIVES

Desktop Computing in Skillport Finding Approved Folders and Printing Certificates of Completion

Technical concepts of kopal. Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin

Research Data Management Guide

A. The Treeno Data Center maintains audited advanced security systems equal to the most sophisticated systems of large corporations.

EverSuite. Enterprise Content Management Solutions. Capture. Manage. Store. Preserve. Publish

d3 Document Management Solution

Guide by. A Guide to the Talkdesk and MAGENTO. Integration. Advantages / How to use / Activate and setup the integration / more...

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols

Building next generation consortium services. Part 3: The National Metadata Repository, Discovery Service Finna, and the New Library System

Digital Assets Repository 3.0. PASIG User Group Conference Noha Adly Bibliotheca Alexandrina

I want to run my business with state of the art technology.

IT Service Desk Manual Ver Document Prepared By: IT Department. Page 1 of 12

Treeno File Monitor. Installation and Configuration Guide

oit Manage Digital Images with Picasa March 2010

How To Manage Documents On A Cloud On A Pc Or Mac Or Mac (For Pc Or Ipa)

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway

The Laserfiche Rio Advantage. Automate, Optimize and Transform Business Processes. Unlimited document repositories and servers

Table of Contents 2. Table of Contents

You can access it anywhere - on your desktop, online, or on your ipad. Benefits include:-

OCR and PDF Compression

ISLANDORA STAFF USER GUIDE. Version 1.3

Capture INTEGRATED SCANNING WORKFLOW CAPTURE PROCESS DISTRIBUTE

INTEGRATED SCANNING WORKFLOW

WebCenter Forms Recognition Learn Sets to the Rescue! August 14, 2014

Assignment 1 Briefing Paper on the Pratt Archives Digitization Projects

Implementing. Springer ebooks. Wouter van der Velde

Shared service components infrastructure for enriching electronic publications with online reading and full-text search

About the Digitization Programme & History of ITU Portal

Document Archiving White Paper. Secure. Accessible. Reliable

e CABINET AND DOCULEX Document Capture and Electronic File Conversion

For TruckMate SOLUTION OVERVIEW

springerlink.com Introduction to SpringerLink springerlink.com

A Digital Library Feasibility Study

THE HELMHOLTZ INVENIO REPOSITORY PROJECT :

Transcription:

Digitization Workflow of the Bavarian State Library Gabriele Messmer Bavarian State Library Munich, Germany

Digitization process at a glance 2

ERaTO ERaTO a tool to create fill in and print an order form that informs the patron about the estimated price 3

Order Form for Digitization on Demand (DoD) 4

ERaTO 5

ERaTO 6

ZEND ZEND = Zentrale Erfassungs- und NachweisDatenbank [digital asset management system] Mapping of the entire production processes to a modular system Different service providers (scanning, text capture) can supply unlimited data to ZEND Workflow control Every object of the BSB, which will be digitized, follows the ZEND workflow Time and cost reduction through extensive automation 7

ZEND at a glance o o o Digitisation on Demand Project-oriented Digitisation Conservation Purposes 1 Zahlb ar an $ Digitisation 2 2 Order Digitised Object URN- Resolving (XEpicur) DNB 5 ZEND Definitive file name 3 Network of Bavarian Libraries Catalog (OPAC) Portals (BLO, ZvDD, Chronicon,...) 4 Exchange of metadate URN 5 6 7 Administration of all metadata (bibliographic, technical, administrative) Automatic image processing Production System 8 OCR 3 OAI 4 Web publication Inhouse-only publication Search engines Online Publication Archival Storage 8

ZEND modules Modules of the ZEND Workflow Tool ZEND Administration ZEND Enduser Interface ERATo Order Tool XML-Editor Deep Indexing Viewer Search/Browse Title / Full Text Order Management Collection Management RSS Feeds PDF-Download DaVeDi Workflow Database Administration Acces & Administration Subject Portals DoD Services Order & Billing ZEND Workflow Modules Image Conversion Catalog Enrichment Resolving Link OCR Processing Archival Storage Interfaces OAI, Z39.50 Metsificator METS Export Data Harvester Data Management via TSM-Client URN Generator Resolving Link Monitoring Access & Error Log Repository Maintenance Retrieval from Archive ZEND Data Database Object Informaton Metadata Repository Structural Information Images Full Text Archival Storage (TSM) 9

Scanning process step by step 1. Transport of the original objects to the Scanning Center 2. Conservational check 3. Preparation for the scanning process 4. Image capture 5. Quality control 6. Indexing and OCR 7. Storage and digital long-term preservation 8. Publication on the web / Access 10

ZEND login screen 11

Digitization order in ZEND 12

Digitization order in ZEND 13

Import of the bibliographic metadata into ZEND via Z39.50 from the local catalogue system 14

Data import via Z39.50 into ZEND 15

and at the same time generation of the definitive file name and allocation of an URN 16

Escursus: File name and URN Creation of the definitive ZEND identification number Example: bsb000019118_00001.tif and at the same time creation of the Unified Resource Name (URN) Example : urn:nbn:de:bvb:12-bsb00019118-6 Assignment is done locally in BSB, the resolving service is hosted by the German National Library Resolving link: http://nbn-resolving.de/urn/resolver.pl?urn=urn:nbn:de:bvb:12- bsb00019118-6 17

ZEND Process slip with barcode 18

Focus of in-house digitization Manuscripts Music manuscripts and printed music Old books (6th to 16th centuries) Rare and expensive books Digitization parameters Production parameters 24 bit High resolution (300-600 ppi in relationship to the originals size) ICC profiles TIFF uncompressed 19

Scanning speed Scanning speed depends of Desired reproduction quality Preservation requirements Age and state of the original ( difficult fixing) Format/binding of the original 20

Conservational requirements at BSB Institute of Book and Manuscript Conservation (IBR) The scanning devices have to follow the book requirements and not vice versa! General requirements: ligthing of the object and room climate Specific requirement: use the scanning device most appropriate to the original work 21

Scanning devices of Munich Digitization Center 20 scanners (state of technology 2006-2010) among them 4 automated scanning devices 1 thermographic scanner for watermarks Hasselblad digital camera 22

Munich Scanning Center 23

Camera Work Table from Graz/Austria 24

Scanning by robots the VD16 Project Challenge: to scan old books with scan-robots Project cofinanced by the German Research Foundation ScanRobot: EU ICT Grand Prize Winner at the Cebit 2007 Start: December 2007 Equipment 2011: 4 scanrobots with each 800 pages/h (old books) 25

Scanning process Choice of most appropriate scanner Installing of a scan-job on the scanner PC Image capture Finalizing the scan job Moving the digitized images to a file called Abholfach 26

Quality control and approval Quality control Entering structural data Approval Return of the original work to the ordering library department or to the stocks 27

Automatic processing after scanning is finalized OCR processing of the images (optional) Creation of an index file for the web presentation of the images Automatic production of image formats for the web presentation (JPG, PDF etc.) Creation of the browsing structure for the object (ToC-Editor) Storage of all the data in the Leibniz Supercomputing Centre for long-term preservation 28

ZEND: XML-ToC(Table of Contents) editor Quality control and correction (image size & orientation, etc.) Flagging the structural elements of the document (i.e. frontispiece, chapter titles, images etc.) - allows navigation inside the digital object Subsequent "activation" and instant availability on the WWW. 29

ZEND: ToC editor 30

DILL Gabriele - Parma Messmer: - Gabriele Digitization Messmer Workflow - Bayerische Staatsbibliothek 2011 31

Finished version for Web presentation 32

Immediate availability inside the local catalogue (OPAC) 33

Archival storage of the data 34

Digitization workflow archiving report 35

Challenge OCR output Cited from: http://www.abbyy.de/recognition_server/brochure_en/ 36

Challenge OCR output 37

Challenge OCR output 38

Fulltext search in books digitized by Google 39

Contact messmer@bsb-muenchen.de 40