Crowdsourcing manuscript transcription: the Transcribe Bentham project



Similar documents
TRANSKRIBUS. Research Infrastructure for the Transcription and Recognition of Historical Documents

The Evolution of MERLOT

An Introduction to Managing Research Data

Introduction to RefWorks

The Knowledge Sharing Infrastructure KSI. Steven Krauwer

Creating Newsletter Messages

Florida Statewide Digital Initiative: Digital Action Plan

SCHOLARONE MANUSCRIPTS TM PRODUCTION CENTER

User Guide. Chapter 6. Teacher Pages

What options do I have for creating a classroom website if I...

1 About This Proposal

Subtitles on everything for everyone Enabling community subtitling and translation for every video on the net.

Lightning Velo Online Discussion Forum User Guide

Web Development I & II*

How To Create A Charter Corpus On The Web (For Historians)

LIBER Case Study: University of Oxford Research Data Management Infrastructure

Postgraduate Diploma in Social Media Studies. Awarded by University of California Irvine Extension

Facilitating the discovery of free online content: the librarian perspective. June Open Access Author Survey March

How to get started with research data management training services for the academic library?

C2 Software. emessenger marketing for Dynamics CRM. User Guide

Using Television and Radio programme for teaching

Transcription Manual January 2013 Reviewed and Updated Annually

Getting Started With Blackboard Learn 9.1

How to Measure the Performance of Your Outreach Programs

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

9. Technology in KM. ETL525 Knowledge Management Tutorial Four. 16 January K.T. Lam

Engaging the growing Washington, DC Chapter through a dynamic online presence

Glossary of terms used in the survey

Technology (Information Technology) Benchmarks

Level 3 Certificate in Public Relations

How To Use Open Source Software For Library Work

Best Practices for Structural Metadata Version 1 Yale University Library June 1, 2008

Using the Content Management System

Trinity College Library

Dreamweaver CS6 Basics

Rich Media for Online Events How Cisco Uses Rich Media for Online Customer Events and Seminars. A Cisco on Cisco Case Study: Inside Cisco IT

The basics in ecommerce SEO

Authoring Within a Content Management System. The Content Management Story

Camtasia Studio. Creating Screen Videos

Setting Up a Blog. What Is a Blog? Blogging Web Sites. Identifying the Educational Objective. The Bible: The Living Word of God

Child & Vulnerable Adults Protection Policy

IT Academy Lesson Plan

PUBLIC INFORMATION ASSISTANT/READY COORDINATOR

Introduction to OpenOffice Writer 2.0 Jessica Kubik Information Technology Lab School of Information University of Texas at Austin Fall 2005

Determining Your Advertising Objectives

Bradford Scholars Digital Preservation Policy

University of Waterloo Department of History HIST 250 THE ART AND CRAFT OF HISTORY FALL :30-10:20, Tuesdays and Fridays in DWE 3522

UNH Strategic Technology Plan

A Digital Library Feasibility Study

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101

Talend Open Studio for MDM. Getting Started Guide 6.0.0

Application of Project-driven Teaching Practice Based on Sakai

USER GUIDE. Unit 4: Schoolwires Editor. Chapter 1: Editor

Spendster Marketing and Communications Team National Endowment for Financial Education, Ignite Agency Greenwood Village, Colorado, U.S.

ANGEL 7.3 Student Quickstart Guide

Emerging Career Trends for Information Professionals: A Snapshot of Job Titles in Summer 2013

Content Management System (CMS) CMS-1

UNIVERSITY OF ZIMBABWE LIBRARY ONLINE

NATO MOOC draft concept V 1.1

CREATIVE EXPRESS. Digital Upload MODULE 2A. Version 3 November Copyright 2010 Hewlett-Packard Development Company, L.P.

Content Management System

Setting up a Blog You can read more about why it is useful to set up an educational blog by following these links:

The Libraries Role in Research Data Management: A Case Study from the University of Minnesota

SHAREPOINT 2016 POWER USER BETA. Duration: 4 days

Columbia University Libraries / Information Services

Testing Websites with Users

Instructions for submitting articles on the Revista Mexicana de Ciencias Pecuarias.

Creative media and digital activity

Communities of Practice (CoP): Five Tips for Engagement

Transcription:

Crowdsourcing manuscript transcription: the Transcribe Bentham project Martin Moyle, Justin Tonra, Valerie Wallace UCL (University College London) m.moyle@ucl.ac.uk LIBER 2010, Aarhus, 29 June 01 July 2010

Overview About Transcribe Bentham The transcription interface Encouraging participation Expected outcomes Next steps

Transcribe Bentham A 1-year project (from April 2010) harnessing the power of crowdsourcing to facilitate the transcription of 12,500 Jeremy Bentham manuscripts. Crowdsourcing: Taking tasks traditionally performed by an employee or contractor, and outsourcing them to a group of people or community, through an "open call" to a large group of people (a crowd) asking for contributions. [Wikipedia]

Project origins 60,000 manuscripts of the philosopher and jurist Jeremy Bentham (1748-1832) held in UCL Library Fully catalogued (http://www.benthampapers.ucl.ac.uk) UCL Bentham Project Producing a complete scholarly edition of Bentham Began 1959; 26 volumes now published, from a projected 68 20,000 Bentham manuscripts previously transcribed To varying degrees of quality; no standard markup The majority of the manuscripts are untranscribed and unstudied

Project aims (1) Digitise 12,500 previously unread Bentham manuscripts Create a public transcription interface, with appropriate training tools, enabling crowdsourced TEI-encoded transcription Promote the project to specific target communities of volunteer transcribers Retrospectively convert existing transcripts to TEI

Project aims (2) Develop a web-based Ideas Bank, based on the transcripts Carry out log analysis and a user study on public interaction with the project Roll out a generic TEI transcription tool, for use by other transcription projects and services Long-term digital curation of digitised MSS and TEI transcripts in the UCL Library Services repository

Project partners UCL Bentham Project UCL Centre for Digital Humanities UCL Library Services University of London Computing Centre Arts and Humanities Research Council Jeremy Bentham present, not voting...

Project components overview...

Images Manuscripts Metadata COLLECTED WORKS Web pages Blog Ideas bank Folio catalogue TEI transcripts Legacy transcripts DIGITAL REPOSITORY Transcription tool SOURCES Registration TRANSCRIPTION WIKI Training materials Discussion forum Retro-conversion to TEI Quality assurance TEI Transcripts PROJECT EDITORS PROJECT WEBSITE

Interface design: some challenges Transcription is hard! Legibility; additions, deletions, marginal notes... TEI markup is complex for beginners Quality assurance is expensive, but to demand high quality from volunteers would be unrealistic Wiki environment may alienate some participants

Technical challenges: steps taken Help and guidance in different formats (web pages, video tutorials), and aimed at beginners Users shielded from the underlying complexity Accurate transcription no markup - is welcomed Users can begin to add markup as confidence grows Site is being user-tested and soft-launched Digitisation focusing on earlier, more legible MSS

The Transcription Desk (beta)

Transcription window

Transcription window Magnifying viewer Toolbar...

TEI Toolbar Hiding complexity Line Break - Paragraph - Addition - Deletion - Unclear Reading - Illegible Text - Note - Underline - Unusual Spelling - Foreign Language - Ampersand - Em Dash - User Comment

Completed transcript... TEI code rendered as HTML

Help pages...

Profiles for registered users

Long-term access/preservation via Library repository

Sourcing crowds Three target audiences Schools Teachers nationally, especially 16-18 year-old level Local schools, building on UCL s outreach links Academics Educators in palaeography, research methods etc Scholars in economic and social history, digital humanities, etc Amateur historians, enthusiasts, and general public Different communications strategies in place for each group

Encouraging participation Targeting each group involves a combination of activities Workshops, classes and presentations; paid-for advertisements in relevant print publications (eg History Today); approaches to disciplinary and professional bodies (eg IHR); press releases... Careful planning required Publication lead times; academic cycle; short project! Web 2.0 activity...

Outcomes and impact Stimulation of public engagement with scholarly archives and manuscript transcription Opening up Bentham s thought to new audiences Policy makers, media, public Creation of an open access, digitally-preserved resource for scholars Availability of a re-usable, user-tested transcription tool for future projects and services How do users interact with digital resources? Quantitative and qualitative data to help best practice

Progress / next steps Digitisation began April 2010 Transcription Desk (beta) in user testing Soft launch, ~20 testers, July 2010 Official launch August 2010 Publicity campaigns begin August 2010 Final report and user study May 2011

Thank you http://www.ucl.ac.uk/transcribe-bentham m.moyle@ucl.ac.uk