Austrian Books Online

Size: px
Start display at page:

Download "Austrian Books Online"

Transcription

1 Austrian Books Online Google Books based mass digitisation Stefan Majewski OPF Hackathon Austrian National Library, Vienna

2 Overview The project How the data is acquired, from carrying the book to storing the files. The delights and perils of mass digitisation Some challenges How to work with the data? Data organisation

3 Austrian Books Online The Project

4 Key Facts Scope: 600, Mio Pages Progress: 180,000 -> 5,500/3weeks Workforce: 20+ FTE -> 60+ P Areas Logistics Metadata Conservation Download & QA Online Presentation Storage PM

5 Material legal deposit >> wide variety of material from: 16th century 19th 2nd half of century _

6 Public Access Google Books Digital Library Austrian National Library

7 13 Libraries in Europe 5 National Libraries Italy Austria The Netherlands Czech Republic Great Britain

8 >20 Mio. books > 50% non-english ~ 75% from libraries ~ 2 Mio. books from European libraries > 3 Mio. books public domain

9 digitisation of the entire historical book holdings of the Austrian National Library 16th to 19th century

10 70+ staff members 20+ exclusively for project book logistics metadata adaptation cataloguing conservation / restoration quality control software implementation project management

11 48,8 person years

12 Austrian Books Online Jahrhunderte 2% 10% 16. Jh. 43% 31% 14% 17. Jh. 18. Jh. 19. Jh. no year

13 Austrian Books Online Sprachen 3% 8% 13% 31% 14% eng ita fre lat ger 31% others

14 70% 60% Austrian Books Online 50% 40% 30% eng ita fre lat ger 20% 10% 0% 16. Jh. 17. Jh. 18. Jh. 19. Jh.

15

16 Ende 2013 ~ Bände digitalisiert

17 ÖNB Buch-Viewer

18 52+ Millionen Seiten 1+ Milliarde unterschiedliche Terme

19

20

21 Information

22 Weitere Bände

23 Austrian Books Online Delights and Perils

24

25

26 ... und doch, verschiedene Qualitäten

27 OCR: Deutsch

28 OCR: Latein

29 OCR: Ungarisch OCR: Ungarisch

30

31 Beispiel Fraktur (schlechte Qualität): Dis ist das buch der wyszheit der alten wysen von geschlecht der welt.; Bidpai, Person der Antike oder des Mittelalters; Straßburg: Grüninger; 1501 Hainrich; 1618

32

33 Austrian Books Online

34 Austrian Books Online Working with the Data

35 Buchlogistik Digitization Daten-Download ADOCO (Austrian Books Online Download & Control) Storage QA Access

36 Workflow in ADOCO Download package via HTTP Decrypt with gnupg Unzip tarball Md5 sum Store to pairtree Unified Access Pairtree (Symlinks) Update metadata

37 Volume Average per Volume (~Book): 101 MB 101 MB * = 60 TB

38 Image courtesy of The University of Pennsylvania and Michel T. Huber. big data

39 Datenspeicherung & Access Datenspeicherung: inhouse Daten redundant gespeichert Access-Kopien on-the-fly generiert

40 Download und Speicherung ADOCO ABO NAS-Speicher Pair Tree-Algorithmus ca. 60 TB JPEG2000 HOCR METS TXT

41 Pair Tree: ABO NAS +Z ^2/ bz/ 15/ 69/ 41/ 20/ 3/abo/ ONB_+Z xml html jp txt

42 Datenorganisation METS (Metadata Encoding & Transmission Standard) MARC/XML / MODS PREMIS GBS specific metadata Images (JPEG2000) OCR Daten Coordinated OCR plain TXT

43 " uod ſingular. contigit, ut _ - iungantur THARINGO RVM ~ multorum dio coñiunctum eſt. quae hinc Orta eſt, laetitia, RVM GENTIvM prouínciis, ac NI finibus, continetur,º ſed' et in ultimas usque terras terrarum, Data arrangement METS: ONB_+Z xml TEI text/xml ONB_+Z tei Manifest: checksum.md5 Images: JPEG jp2 coordocr: hocr (xhtml) 001.html OCR: text/plain UTF txt

44 METS Reference: Namespaces: xmlns:mets=" xmlns:xlink=" xmlns:gbs=" xmlns:premis="info:lc/xmlns/premis-v2" xmlns:marc="

45 METS Structure METS:mets METS:metsHdr METS:dmdSec METS:amdSec METS:fileSec METS:structMap

46 METS:metsHdr

47 METS:dmdSec

48 METS:amdSec

49 METS:fileSec

50 METS:structMap

51 METS:amdSec METS:techMD production notes (badpages, missing Pages, tightboundpages) method of image production calibration target Definition of gbs:pagetag

52 METS:amdSec METS:digiprovMD production notes (badpages, missing Pages, tightboundpages) method of image production calibration target Definition of gbs:pagetag

53 METS:amdSec METS:sourceMD Source library information METS:digiprovMD PREMIS:premis representation scanning date processing date analyzed date rubbish

54 hocr LhwPcjtAUFwBlzE8EWnKAxlgVf0/

55 Using the data, locally

56 Using the data, cluster Paths: /user/onbfue/input/abo/paths/mets/abo_mets_file_paths.txt /user/onbfue/input/abo/paths/text/abo_text_file_paths.txt /user/onbfue/input/abo/paths/html/abo_html_file_paths.txt Data: /user/onbfue/input/abo/data/html/seqfiles (page level) /user/onbfue/input/abo/data/text/seqfiles (book level)

Integrating the Fedora based DOMS repository with Hadoop

Integrating the Fedora based DOMS repository with Hadoop Integrating the Fedora based DOMS repository with Hadoop Asger Askov Blekinge State and University Library, Denmark SCAPE Information Day State and University Library, Denmark, June 25 th 2014 Our Repositories

More information

Overview Motivation MapReduce/Hadoop in a nutshell Experimental cluster hardware example Application areas at the Austrian National Library

Overview Motivation MapReduce/Hadoop in a nutshell Experimental cluster hardware example Application areas at the Austrian National Library Overview Motivation MapReduce/Hadoop in a nutshell Experimental cluster hardware example Application areas at the Austrian National Library Web Archiving Austrian Books Online SCAPE at the Austrian National

More information

Archives Ready To the AIPs Transmission. PREMIS Implementation Fair. Reminding the ipres2010 Presentation

Archives Ready To the AIPs Transmission. PREMIS Implementation Fair. Reminding the ipres2010 Presentation FONDAZIONE RINASCIMENTO DIGITALE Foundation promoted by Ente Cassa di Risparmio of Florence 7th International Conference on Preservation of Digital Objects (ipres2010) September 19-24, 2010, Vienna, Austria

More information

Introduction. What are online publications?

Introduction. What are online publications? http://conference.ifla.org/ifla77 Date submitted: June 28, 2011 Managing Legal Deposit for Online Publications in Germany Renate Gömpel & Dr. Lars G. Svensson Deutsche Nationalbibliothek Frankfurt, Germany

More information

The Australian War Memorial s Digital Asset Management System

The Australian War Memorial s Digital Asset Management System The Australian War Memorial s Digital Asset Management System Abstract The Memorial is currently developing an Enterprise Content Management System (ECM) of which a Digital Asset Management System (DAMS)

More information

Strategy and Cooperation on Long-term Preservation in the Czech Republic

Strategy and Cooperation on Long-term Preservation in the Czech Republic Strategy and Cooperation on Long-term Preservation in the Czech Republic Bohdana Stoklasova National Library of the Czech Republic bohdana.stoklasova@nkp.cz Content Short introduction of the National Library

More information

Accessing the Deep Web: A Survey

Accessing the Deep Web: A Survey VL Text Analytics Accessing the Deep Web: A Survey Marc Bux, Tobias Mühl Accessing the Deep Web: A Survey, 2007 by Bin He, Mitesh Patel, Zhen Zhang, Kevin Chen Chuan Chang Computer Science Department University

More information

How To Digitise Newspapers On A Computer At Nla.Com

How To Digitise Newspapers On A Computer At Nla.Com Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley ANDP Manager ANPlan/ANDP Workshop, 28 November 2008 1 Requirements Manage, store and organise

More information

A Digital Library Feasibility Study

A Digital Library Feasibility Study A Digital Library Feasibility Study C. Henshaw, D. Thompson, M. Savage-Jones Wellcome Library London, UK LIBER Annual Conference Aarhus, Denmark June 2010 Introduction 1. Who we are 2. Vision and strategy

More information

Libraries and Disaster Recovery

Libraries and Disaster Recovery Libraries and Disaster Recovery A Framework for Regional Co-operation in Digital Preservation and Recovery Presentation to CDNLAO Meeting, Tokyo By N Varaprasad, NLB Singapore World Disasters & Impact

More information

Technical concepts of kopal. Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin

Technical concepts of kopal. Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin Technical concepts of kopal Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin 1 Overview Project kopal Ideas Organisation Results Technical concepts DIAS kolibri Models of reusability 2

More information

Specifying the content and formal specifications of document formats for QES

Specifying the content and formal specifications of document formats for QES NATIONAL SECURITY AUTHORITY Version 1.0 Specifying the content and formal specifications of document formats for QES 24 July 2007 No.: 3198/2007/IBEP-013 NSA Page 1/14 This English version of the Slovak

More information

Project PESSIS 2 Title: Social Dialogue in the Social Services Sector in Europe

Project PESSIS 2 Title: Social Dialogue in the Social Services Sector in Europe Project PESSIS 2 Title: Social Dialogue in the Social Services Sector in Europe Location Brussels Date 23 September 2014 Presenter/contact details: Jane Lethbridge, Director, Public Services International

More information

International comparisons of road safety using Singular Value Decomposition

International comparisons of road safety using Singular Value Decomposition International comparisons of road safety using Singular Value Decomposition Siem Oppe D-2001-9 International comparisons of road safety using Singular Value Decomposition D-2001-9 Siem Oppe Leidschendam,

More information

Digitization a precondition to generate new research questions and information products in the digital humanities

Digitization a precondition to generate new research questions and information products in the digital humanities Digitization a precondition to generate new research questions and information products in the digital humanities Dr. Susanne Dobratz e-publishing & digital media Consulting sdobratz@dobratz-consulting.de

More information

MBooks: Google Books Online at the University of Michigan Library

MBooks: Google Books Online at the University of Michigan Library MBooks: Google Books Online at the University of Michigan Library Phil Farber, Chris Powell, Cory Snavely University of Michigan Library Information Technology Architecture overview Four basic pieces:

More information

Digitisation of cultural material Digital Libraries and Copyright Madrid, 12 April 2010

Digitisation of cultural material Digital Libraries and Copyright Madrid, 12 April 2010 Seminar Digitisation of cultural material Digital Libraries and Copyright Madrid, 12 April 2010 www.arrow-net.eu Co-funded by the Community programme econtentplus Origins of the project Inclusion of copyrighted

More information

Long-term archiving and preservation planning

Long-term archiving and preservation planning Long-term archiving and preservation planning Workflow in digital preservation Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands The Challenge: Long-term Preservation

More information

WESTERNACHER OUTLOOK E-MAIL-MANAGER OPERATING MANUAL

WESTERNACHER OUTLOOK E-MAIL-MANAGER OPERATING MANUAL TABLE OF CONTENTS 1 Summary 3 2 Software requirements 3 3 Installing the Outlook E-Mail Manager Client 3 3.1 Requirements 3 3.1.1 Installation for trial customers for cloud-based testing 3 3.1.2 Installing

More information

METADATA GENERATION FOR CULTURAL HERITAGE

METADATA GENERATION FOR CULTURAL HERITAGE METADATA GENERATION FOR CULTURAL HERITAGE Creative Histories The Josefsplatz Experience Brigitte Krenn, Gregor Sieber, Hans Petschar {brigitte.krenn, gregor.sieber}@ofai.at hans.petschar@onb.ac.at Talk

More information

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

The Czech Digital Library and Tools for the Management of Complex Digitization Processes The Czech Digital Library and Tools for the Management of Complex Digitization Processes Martin LHOTÁK Library of the Academy of Sciences of the Czech Republic lhotak@knav.cz INFORUM 2012: 18th Conference

More information

Fishing for Cyclists. Peter Eich mail@peter-eich.de Radweg Service & Bikemap

Fishing for Cyclists. Peter Eich mail@peter-eich.de Radweg Service & Bikemap Fishing for Cyclists Peter Eich mail@peter-eich.de Radweg Service & Bikemap 450 self guided fully organised bike tours in Europe Peter Eich mail@peter-eich.de Radweg Service & Bikemap 450 self guided

More information

Understanding KVK, the technical base of artlibraries.net

Understanding KVK, the technical base of artlibraries.net Understanding KVK, the technical base of artlibraries.net Uwe Dierolf Library of the Karlsruhe Institute of Technology (KIT) KIT-BIBLIOTHEK KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum

More information

OCR for historical printings a tutorial

OCR for historical printings a tutorial CIS Dr. Uwe Springmann a tutorial Kolloquium Korpuslinguistik HU Berlin, 23.04.2014 OCR @ CIS: Centrum für Informations- und Sprachverarbeitung OCR group (led by Prof. Dr. Klaus Schulz) has existed for

More information

Scala Storage Scale-Out Clustered Storage White Paper

Scala Storage Scale-Out Clustered Storage White Paper White Paper Scala Storage Scale-Out Clustered Storage White Paper Chapter 1 Introduction... 3 Capacity - Explosive Growth of Unstructured Data... 3 Performance - Cluster Computing... 3 Chapter 2 Current

More information

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire A Selection of Questions from the Stewardship of Digital Assets Workshop Questionnaire SECTION A: Institution Information What year did your institution begin creating digital resources? What year did

More information

Open Data Open Government

Open Data Open Government Open Data Open Government Perspective of the Austrian federal level Prague public sector open data meeting Parliament of the Czech Republic, Chamber of Deputies 28th February 2012 Austrian Federal Chancellery

More information

KIM. www.kim-forum.org

KIM. www.kim-forum.org KIM www.kim-forum.org KIM Competence Center for Interoperable Metadata German Translation of DCMES 1.1 Dublin Core Conference 2007, Singapur Christine Frodl (German National Library) Stefanie Ruehle (State

More information

Digitization Workflow of the. Bavarian State Library. Gabriele Messmer. Bavarian State Library. Munich, Germany

Digitization Workflow of the. Bavarian State Library. Gabriele Messmer. Bavarian State Library. Munich, Germany Digitization Workflow of the Bavarian State Library Gabriele Messmer Bavarian State Library Munich, Germany Digitization process at a glance 2 ERaTO ERaTO a tool to create fill in and print an order form

More information

Department of Geological Survey and Mines (DGSM) Republic of Uganda

Department of Geological Survey and Mines (DGSM) Republic of Uganda Department of Geological Survey and Mines (DGSM) Republic of Uganda Dtp. of Geological Survey and Mines Sustainable Management of Mineral Resources Project Establishment of a Modern Documentation Centre

More information

How To Build A Map Library On A Computer Or Computer (For A Museum)

How To Build A Map Library On A Computer Or Computer (For A Museum) Russ Hunt OCLC Tools Managing & Preserving Digitised Map Libraries Keywords: Digital collection management; digital collection access; digital preservation; long term preservation. Summary This paper explains

More information

Bebras Contest An International Contest on Informatics and Computer Fluency for all Secondary School Pupils

Bebras Contest An International Contest on Informatics and Computer Fluency for all Secondary School Pupils Bebras Contest An International Contest on Informatics and Computer Fluency for all Secondary School Pupils Gerald Futschek Vienna University of Technology Austrian Computer Society OCG 1 Aim of Information

More information

Long-term preservation activities of the Bavarian State Library

Long-term preservation activities of the Bavarian State Library Long-term preservation activities of the Bavarian State Library Latest challenges and developments aêk=qüçã~ë=tçäñjhäçëíéêã~åå=== aáöáí~ä=iáäê~êó=aéé~êíãéåí g~åì~êó OSI=OMNM The Bavarian State Library

More information

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1]

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1] Content 1. Empirical linguistics 2. Text corpora and corpus linguistics 3. Concordances 4. Application I: The German progressive 5. Part-of-speech tagging 6. Fequency analysis 7. Application II: Compounds

More information

Mathematical Risk Analysis

Mathematical Risk Analysis Springer Series in Operations Research and Financial Engineering Mathematical Risk Analysis Dependence, Risk Bounds, Optimal Allocations and Portfolios Bearbeitet von Ludger Rüschendorf 1. Auflage 2013.

More information

NEWS IN A BOX PLUS POINTS. All you need for small newsrooms, production suites and disaster recovery solutions in one box

NEWS IN A BOX PLUS POINTS. All you need for small newsrooms, production suites and disaster recovery solutions in one box NEWS IN A BOX NEWS IN A BOX Xedio I/0 Storage All you need for small newsrooms, production suites and disaster recovery solutions in one box XS NewsFlash is EVS in-a-box concept that has the power to take

More information

Luc Declerck AUL, Technology Services Declan Fleming Director, Information Technology Department

Luc Declerck AUL, Technology Services Declan Fleming Director, Information Technology Department Luc Declerck AUL, Technology Services Declan Fleming Director, Information Technology Department What is cyberinfrastructure? Outline Examples of cyberinfrastructure t Why is this relevant to Libraries?

More information

Call: 08715 900800. Disaster Recovery/Business Continuity (DR/BC) Services From VirtuousIT

Call: 08715 900800. Disaster Recovery/Business Continuity (DR/BC) Services From VirtuousIT Disaster Recovery/Business Continuity (DR/BC) Services From VirtuousIT The VirtuousIT DR/BC solution is designed around RecoveryShield from Thinking SAFE. The service includes a local backup appliance

More information

Company Presentation. Vienna -@ Forum IT Romania - June 2014

Company Presentation. Vienna -@ Forum IT Romania - June 2014 Company Presentation Vienna -@ Forum IT Romania - June 2014 Reliable IT nearshore partner Our view We are the nearshore home for our clients software development and QA teams. Fortech Client Nearshore

More information

Overview of NDNP Technical Specifications

Overview of NDNP Technical Specifications Overview of NDNP Technical Specifications and Philosophy Digitization from preservation microfilm print negatives (2n) provides the most cost-efficient approach for large-scale digitization Distributed

More information

Cisco Physical Access Manager

Cisco Physical Access Manager Data Sheet Cisco Physical Access Manager 1.4.1 Cisco Physical Access Manager is the management application for the Cisco Physical Access Control solution. Cisco Physical Access Manager (Figure 1) is used

More information

WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA?

WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA? WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA? Digital asset management gives you full access to and control of to the true value hidden within your data: Stories. Digital asset management allows you to

More information

Bridging the Gap Between Real World Repositories and Scalable Preservation Environments

Bridging the Gap Between Real World Repositories and Scalable Preservation Environments Bridging the Gap Between Real World Repositories and Scalable Preservation Environments Bolette Ammitzbøll Jurik State and University Library Victor Albecks Vej 1 DK-8000 Aarhus C, Denmark baj@statsbiblioteket.dk

More information

B SVF - Bavaria Long Term Preservation

B SVF - Bavaria Long Term Preservation Klaus Kempf Long Term Preservation: Needs and Activities at the Bavarian State Library (BSB) Agenda BSB s Institutional Profile Munich Digitization Center (MDZ) Current Responsibilities, Milestones, Activities

More information

Offshore outsourcing of business services Threat or Opportunity

Offshore outsourcing of business services Threat or Opportunity Siemens Business Services Offshore outsourcing of business services Threat or Opportunity Presentation by Elie Cohen Chief Executive Officer Siemens Business Services France Agenda for the next 20 minutes

More information

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101 Digital Assets If this section is not applicable to the collection(s) being surveyed, please note that here and move to the next section. Digital collections may include born-digital material and digital

More information

Scientific Library Services and Information Systems (LIS): DFG Practical Guidelines on Digitisation

Scientific Library Services and Information Systems (LIS): DFG Practical Guidelines on Digitisation Deutsche Forschungsgemeinschaft Scientific Library Services and Information Systems (LIS): DFG Practical Guidelines on Digitisation Status: April 2009 acdmê~åíáå~ädìáçéäáåéëçåaáöáíáë~íáçå ÑçêéêçÖê~ããÉëÑìåÇáåÖpÅáÉåíáÑáÅiáÄê~êópÉêîáÅÉë~åÇfåÑçêã~íáçåpóëíÉãëK

More information

Vorarlberger Landes- und Hypothekenbank Aktiengesellschaft

Vorarlberger Landes- und Hypothekenbank Aktiengesellschaft First Supplement dated 21 October 2015 to the Prospectus dated 10 August 2015 This document constitutes a supplement (the "First Supplement") for the purposes of Article 13 of the Luxembourg Law on Prospectuses

More information

PDF/A for scanned documents

PDF/A for scanned documents Webinar PDF/A for scanned documents Paper becomes digital, LuraTech, Armin Ortmann, LuraTech, CTO 2009 PDF/A Competence Center, Existing Solutions for Scanned Documents black/white: TIFF G4 Color: JPEG.

More information

Integration of Hotel Property Management Systems (HPMS) with Global Internet Reservation Systems

Integration of Hotel Property Management Systems (HPMS) with Global Internet Reservation Systems Integration of Hotel Property Management Systems (HPMS) with Global Internet Reservation Systems If company want to be competitive on global market nowadays, it have to be persistent on Internet. If we

More information

Cloud Sync White Paper. Based on DSM 6.0

Cloud Sync White Paper. Based on DSM 6.0 Cloud Sync White Paper Based on DSM 6.0 1 Table of Contents Introduction 3 Product Features 4 Synchronization 5 Architecture File System Monitor (Local change notification) Event/List Monitor (Remote change

More information

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager paul.bevan@llgc.org.uk Structure! Background and overview! OAIS Model! Why

More information

Plants Plants Plants Plants Plants

Plants Plants Plants Plants Plants Students Presentation: Nina Wachendorf 1 Structure Company Structure Company Coordination Company Distribution EWC Structure EWC Distribution EWC Coordination EWC Parts Company Structure Corporation Country

More information

Dolby Digital Plus in HbbTV

Dolby Digital Plus in HbbTV Dolby Digital Plus in HbbTV November 2013 arnd.paulsen@dolby.com Broadcast Systems Manager HbbTV Overview HbbTV v1.0 and v1.5 Open platform standard to deliver content over broadcast and broadband for

More information

E-Signatures and E-Procurement

E-Signatures and E-Procurement E-Signatures and E-Procurement Dr. Annette Rosenkötter Rechtsanwältin Dr. Anja Hoffmann Rechtsanwältin FPS Rechtsanwälte und Notare Brüssel, 15.06.2011 Dieser Bericht ist nur für den Empfänger bestimmt.

More information

Discovery of Electronically Stored Information ECBA conference Tallinn October 2012

Discovery of Electronically Stored Information ECBA conference Tallinn October 2012 Discovery of Electronically Stored Information ECBA conference Tallinn October 2012 Jan Balatka, Deloitte Czech Republic, Analytic & Forensic Technology unit Agenda Introduction ediscovery investigation

More information

Privilege Escalation via Antivirus Software

Privilege Escalation via Antivirus Software Privilege Escalation via Antivirus Software A security vulnerability in the software component McAfee Security Agent, which is part of the antivirus software McAfee VirusScan Enterprise, can be leveraged

More information

Module 6 Other OCR engines: ABBYY, Tesseract

Module 6 Other OCR engines: ABBYY, Tesseract Uwe Springmann Module 6 Other OCR engines: ABBYY, Tesseract 2015-09-14 1 / 20 Module 6 Other OCR engines: ABBYY, Tesseract Uwe Springmann Centrum für Informations- und Sprachverarbeitung (CIS) Ludwig-Maximilians-Universität

More information

Tools for text digitisation and transcription

Tools for text digitisation and transcription Tools for text digitisation and transcription Tools for text digitisation and transcription Tomasz Parkoła Poznan Supercomputing and Networking Center CERL annual seminar, 28.10.2014, Oslo, Norway Agenda

More information

Winter and Summer Schools 2015-16

Winter and Summer Schools 2015-16 Winter and Summer Schools 2015-16 Hier finden Sie eine Auswahl von Universitäten, die 2016 Summer Schools anbieten, sortiert nach Fachbereich. Weitere Informationen hierzu finden Sie auf der jeweiligen

More information

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8 THE BRITISH LIBRARY Unlocking The Value The British Library s Collection Metadata Strategy 2015-2018 Page 1 of 8 Summary Our vision is that by 2020 the Library s collection metadata assets will be comprehensive,

More information

Echtzeit-Analyse von Social Media Daten mit Jedox und GPU-beschleunigten OLAP Datenbanken

Echtzeit-Analyse von Social Media Daten mit Jedox und GPU-beschleunigten OLAP Datenbanken Echtzeit-Analyse von Social Media Daten mit Jedox und GPU-beschleunigten OLAP Datenbanken Peter Strohm, Offenburg, 05.03.2015 Jedox: In-Memory OLAP Database 2002 Gegründet in Freiburg Heute 100+ Mitarbeiter

More information

Quantum BACKUP. RECOVERY. ARCHIVE. IT S WHAT WE DO.

Quantum BACKUP. RECOVERY. ARCHIVE. IT S WHAT WE DO. Quantum BACKUP. RECOVERY. ARCHIVE. IT S WHAT WE DO. TM Next Generation Archival--StorNext 9 2010 Quantum Corporation. Company Confidential. Forward-looking information is based upon multiple assumptions

More information

Encrypting and signing e-mail

Encrypting and signing e-mail Encrypting and signing e-mail V1.0 Developed by Gunnar Kreitz at CSC, KTH. V2.0 Developed by Pehr Söderman at ICT, KTH (Pehrs@kth.se) V3.0 Includes experiences from the 2009 course V3.1 Adaptation for

More information

Islandora: An Open Source Institutional Repository Solution. Consortium of MnPALS Libraries Annual Meeting April 2014

Islandora: An Open Source Institutional Repository Solution. Consortium of MnPALS Libraries Annual Meeting April 2014 Islandora: An Open Source Institutional Repository Solution Consortium of MnPALS Libraries Annual Meeting April 2014 Outline Introduction to Islandora (Linda) Islandora functionality and demo (Alex) SMSU

More information

FROM COLLABORATIVE DATA EDITING TO LIBRARY CATALOGUES TOWARDS A SHARABLE DATA STRATEGY

FROM COLLABORATIVE DATA EDITING TO LIBRARY CATALOGUES TOWARDS A SHARABLE DATA STRATEGY FROM COLLABORATIVE DATA EDITING TO LIBRARY CATALOGUES TOWARDS A SHARABLE DATA STRATEGY FOR THE GEOBIB PROJECT Frank Binder frank.binder@zmi.uni-giessen.de Leipzig ehumanities Seminar, Jan 8th 2014 Funded

More information

How To Get A Memory Memory Device From A Flash Flash To A Memory Card (Iomemory) For A Microsoft Flash Memory Card From A Microsable Memory Card For A Flash (Ios) For An Iomemories Memory

How To Get A Memory Memory Device From A Flash Flash To A Memory Card (Iomemory) For A Microsoft Flash Memory Card From A Microsable Memory Card For A Flash (Ios) For An Iomemories Memory Storage Memory Platform iomemory VSL iosphere ioturbine directcache iomemory Produkt Vorstellung Jens Mertes Wolfgang Bresgen Solution Architect - CE Sales Manager +49 171 225 8 225 +49 160 97332249 jmertes@fusionio.com

More information

Entrepreneurship education in Germany 1

Entrepreneurship education in Germany 1 Entrepreneurship education in Germany 1 1 OVERVIEW The German educational system is decentralised. Both the functional design and the responsibility for education lie primarily with the federal states

More information

Closed-Loop Engineering Integrated Product Development at a Vehicle Manufacturer

Closed-Loop Engineering Integrated Product Development at a Vehicle Manufacturer Closed-Loop Engineering Integrated Product Development at a Vehicle Manufacturer Dr. Stephan Kohlhoff Geschäftbereich Automotive SAP Deutschland AG & Co KG Agenda Motivation Closed-Loop Engineering Vehicle

More information

Der Aufbau von digitalen Forschungsinfrastrukturen für die Geistes- und Kulturwissenschaften in Österreich

Der Aufbau von digitalen Forschungsinfrastrukturen für die Geistes- und Kulturwissenschaften in Österreich Der Aufbau von digitalen Forschungsinfrastrukturen für die Geistes- und Kulturwissenschaften in Österreich Karlheinz Mörth Institute for Corpus Linguistics and Text Technology Austrian Centre for Digital

More information

Knowledge Base Copyright Law: An innovative Resource for Open Access Archives

Knowledge Base Copyright Law: An innovative Resource for Open Access Archives Knowledge Base Copyright Law: An innovative Resource for Open Access Archives International Conference Open Access to Digital Archives and the Open Knowledge Society, Vienna, 21-22 Oct. 2005 Dr. Michael

More information

Questionnaire on Digital Preservation in Local Authority Archive Services

Questionnaire on Digital Preservation in Local Authority Archive Services Questionnaire on Digital Preservation in Local Authority Archive Services A - Digital Preservation Planning 1. Would you describe your Archive Service as: Actively seeking digital material Reacting to

More information

Preservation Handbook

Preservation Handbook Preservation Handbook Plain text Author Version 2 Date 17.08.05 Change History Martin Wynne and Stuart Yeates Written by MW 2004. Revised by SY May 2005. Revised by MW August 2005. Page 1 of 7 File: presplaintext_d2.doc

More information

EfficientArchive V3.2.3e. albin.brandl@brandl-systemhaus.de

EfficientArchive V3.2.3e. albin.brandl@brandl-systemhaus.de V3.2.3e 1 What s EfficientArchive? EfficientArchive is a backend software provides solution for the data center, the unstructured data (file systems) ensures, archived, and at the same time a quick "Disaster

More information

Austrian Post Investor Day Mail Division. Walter Hitziger, Member of the Management Board

Austrian Post Investor Day Mail Division. Walter Hitziger, Member of the Management Board Austrian Post Investor Day Mail Division Walter Hitziger, Member of the Management Board Mail Division an overview of key indicators Mail Parcel & Logistics Branch Network Group Branch Network 8.5% Parcel

More information

Wharf T&T Cloud Backup Service User & Installation Guide

Wharf T&T Cloud Backup Service User & Installation Guide Wharf T&T Cloud Backup Service User & Installation Guide Version 1.6 Feb 2013 Table of contents BEFORE YOU INSTALL 3 Page Section 1. Installation of Client Software 5 Section 2. Account Activation 8 Section

More information

Digital Preservation Strategy, 2012-2015

Digital Preservation Strategy, 2012-2015 Digital Preservation Strategy, 2012-2015 Preface This digital preservation strategy sets out what the National Library of Wales (NLW) intends to do to preserve digital materials over the next three years.

More information

Less paper less costly way to manage documents! Document and Process Management System

Less paper less costly way to manage documents! Document and Process Management System Less paper less costly way to manage documents! and Process System What is AVILYS (eng. Hive) Processes and documents in each organisation are very closely related. Managing them is not an easy task at

More information

http://cloud.dailymotion.com July 2014

http://cloud.dailymotion.com July 2014 July 2014 Dailymotion Cloud Positioning Two video platforms based on one infrastructure Dailymotion.com DELIVER, SHARE AND MONETIZE YOUR VIDEO CONTENT Online sharing videos platform Dailymotion Cloud CONCRETIZE

More information

Policy Based Encryption Z. Administrator Guide

Policy Based Encryption Z. Administrator Guide Policy Based Encryption Z Administrator Guide Policy Based Encryption Z Administrator Guide Documentation version: 1.2 Legal Notice Legal Notice Copyright 2012 Symantec Corporation. All rights reserved.

More information

How To Manage File Access On Data Ontap On A Pc Or Mac Or Mac (For A Mac) On A Network (For Mac) With A Network Or Ipad (For An Ipad) On An Ipa (For Pc Or

How To Manage File Access On Data Ontap On A Pc Or Mac Or Mac (For A Mac) On A Network (For Mac) With A Network Or Ipad (For An Ipad) On An Ipa (For Pc Or Clustered Data ONTAP 8.3 File Access Management Guide for NFS NetApp, Inc. 495 East Java Drive Sunnyvale, CA 94089 U.S. Telephone: +1 (408) 822-6000 Fax: +1 (408) 822-4501 Support telephone: +1 (888) 463-8277

More information

Renate Gömpel. Germany on Track for International Standards: RDA

Renate Gömpel. Germany on Track for International Standards: RDA Renate Gömpel Germany on Track for International Standards: RDA 1 Deutsche Nationalbibliothek German National Library 2 EURIG-JSC Seminar on RDA, Copenhagen, August 8, 2010 Cooperative library standardization

More information

Facing future users - the challenge of transforming a traditional online database into a Web service

Facing future users - the challenge of transforming a traditional online database into a Web service Purdue University Purdue e-pubs Proceedings of the IATUL Conferences 1999 IATUL Proceedings Facing future users - the challenge of transforming a traditional online database into a Web service Eva Tolonen

More information

BUILDING BLOCKS FOR THE NEW KB E-DEPOT

BUILDING BLOCKS FOR THE NEW KB E-DEPOT FULL LATE-BREAKING PAPER BUILDING BLOCKS FOR THE NEW KB E-DEPOT Hilde van Wijngaarden Judith Rog Peter Marijnen Koninklijke Bibliotheek Prins Willem-Alexanderhof 5 2595 BE The Hague The Netherlands ABSTRACT1

More information

An Experimental Workflow Development Platform for Historical Document Digitisation and Analysis

An Experimental Workflow Development Platform for Historical Document Digitisation and Analysis An Experimental Workflow Development Platform for Historical Document Digitisation and Analysis Clemens Neudecker, Mustafa Dogan, Sven Schlarb (IMPACT) Paolo Missier, Shoaib Sufi, Alan Williams, Katy Wolstencroft

More information

How to translate VisualPlace

How to translate VisualPlace Translation tips 1 How to translate VisualPlace The international language support in VisualPlace is based on the Rosette library. There are three sections in this guide. It starts with instructions for

More information

A Service for Data-Intensive Computations on Virtual Clusters

A Service for Data-Intensive Computations on Virtual Clusters A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King rainer.schmidt@arcs.ac.at Planets Project Permanent

More information

Supplement No. 2 dated 25 September 2013 to the Base Prospectus for Equity linked Notes and Certificates dated 27 June 2013

Supplement No. 2 dated 25 September 2013 to the Base Prospectus for Equity linked Notes and Certificates dated 27 June 2013 Supplement No. 2 dated 25 September to the Base Prospectus for Equity linked Notes and Certificates dated 27 June MORGAN STANLEY & CO. INTERNATIONAL PLC (incorporated with limited liability in England

More information

Automatic updates for Websense data endpoints

Automatic updates for Websense data endpoints Automatic updates for Websense data endpoints Topic 41102 / Updated: 25-Feb-2014 Applies To: Websense Data Security v7.6, v7.7.x, and v7.8 Endpoint auto-update is a feature that lets a network server push

More information

The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion

The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion John Hagen Manager, Electronic Institutional Document Repository Programs West Virginia University

More information

Enabling a data management system to support the good laboratory practice Masterthesis Status Report Miriam Ney (13.01.

Enabling a data management system to support the good laboratory practice Masterthesis Status Report Miriam Ney (13.01. Enabling a data management system to support the good laboratory practice Masterthesis Status Report Miriam Ney (13.01.2011) Folie 1 Statusreport Masterthesis > Miriam Ney > 13.01.2011 Overview Description

More information

HDFS: Hadoop Distributed File System

HDFS: Hadoop Distributed File System Istanbul Şehir University Big Data Camp 14 HDFS: Hadoop Distributed File System Aslan Bakirov Kevser Nur Çoğalmış Agenda Distributed File System HDFS Concepts HDFS Interfaces HDFS Full Picture Read Operation

More information

Mass Digitization of Manuscripts and Rare Books: Challenges and Experiences at Bavarian State Library

Mass Digitization of Manuscripts and Rare Books: Challenges and Experiences at Bavarian State Library Mass Digitization of Manuscripts and Rare Books: Challenges and Experiences at Bavarian State Library Dr. Markus Brantl 1. The Bavarian State Library (BSB) 1. Institute for Book and Manuscript ConServation

More information

Wer steuert die europäische Forschungspolitik?

Wer steuert die europäische Forschungspolitik? Hochschulrektorenkonferenz Strasbourg 2010 Wer steuert die europäische Forschungspolitik? Dieter Imboden Präsident des Forschungsrates des SNF und von EUROHORCs Es gibt in der europäischen Forschungslandschaft

More information