Automated Lecture Transcription

Similar documents
C E D A T 8 5. Innovating services and technologies for speech content management

Giuseppe Riccardi, Marco Ronchetti. University of Trento

SPeach: Automatic Classroom Captioning System for Hearing Impaired

Investigating the effectiveness of audio capture and integration with other resources to support student revision and review of classroom activities

Effect of Captioning Lecture Videos For Learning in Foreign Language 外 国 語 ( 英 語 ) 講 義 映 像 に 対 する 字 幕 提 示 の 理 解 度 効 果

Web 2.0 Tools for Language Learning. Sadykova G.V Kazan Federal University

Speech Processing Applications in Quaero

TWENTY FREQUENTLY ASKED QUESTIONS ABOUT MIT OPENCOURSEWARE

Presentation Video Retrieval using Automatically Recovered Slide and Spoken Text

Ai-Media we re all about Access.

Sample Cities for Multilingual Live Subtitling 2013

Education and Training Overview 2015

IBM AbilityLab Digital Media Captioner & Editor

TTP User Guide. MLLP Research Group. Wednesday 2 nd September, 2015

How To Promote A Major Open Online Courseware (Mooc) In Japanese

Search Engine optimization

Utilizing Automatic Speech Recognition to Improve Deaf Accessibility on the Web

Technology Scouting Video Transcription

Unifying Video Captions and Text-Based Audio Descriptions

Defining Style for Instructional Design Projects. Shalin Hai-Jew C2C s 12 th Annual SIDLIT August 4 5, 2011

Education in the 21st century is diverse not only in content and

Download Check My Words from:

Rami Viksilä. Effectiveness of Video Lecturing Technology in ICT Learning

IBM AbilityLab Media Captioner and Editor

Embedding Multimedia in Blackboard

RAMP for SharePoint Online Guide

Automated Speech to Text Transcription Evaluation

BA (Hons) Broadcast Journalism and BA (Hons) Print Journalism

ON24 CAPABILITIES STATEMENT

Closed Captioning and Educational Video Accessibility

Dragon Solutions Transcription Workflow

Digital Preservation Lifecycle Management

ELI CLASS SELECTION GUIDE

Year 1, Number 7 Deadline for nominations is near Excellence in Online Teaching Awards Status Update:

School of Communication and Information MLIS 17:610:558 Digital Library Technologies (online) Spring 2015 Course Syllabus

Creating Captions in YouTube

CASE STUDIES FOR SELF-DIRECTED LEARNING ENVIRONMENT USING LECTURE ARCHIVES

2. Broad Subject: Video content digitization, conversion, chunking and dubbing CEC / IGNOU / NCERT / SIET / OTHERS

Session Agenda: What is captioning? 11/13/2012. Video Captioning: Lessons Learned Implementing a Do-It-Yourself Approach. Accessing Higher Ground 2012

COPYRIGHT 2011 COPYRIGHT 2012 AXON DIGITAL DESIGN B.V. ALL RIGHTS RESERVED

Clarified Communications

Internal Responses to Informal Learning Data: Testing a Rapid Commissioning Approach

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

3PlayMedia. Closed Captioning, Transcription, and Subtitling

Understanding Video Lectures in a Flipped Classroom Setting. A Major Qualifying Project Report. Submitted to the Faculty

Best Practices for Implementing Video Captioning

Podcasting: The Dawning and Spawning of a New Communications Tool

Enterprise Video Search ROI

GCSE Subject Level Guidance for Modern Foreign Languages (French, German, Spanish) February 2015

Digital Asset Management. Content Control for Valuable Media Assets

Susanna-Assunta Sansone, PhD. Metadata WG3 chair.

Prentice Hall Realidades, Level

Activities to Improve Accessibility to Broadcasting in Japan

Industry Guidelines on Captioning Television Programs 1 Introduction

Video Transcription in MediaMosa

TRANSCRIBE YOUR CLASS: EMPOWERING STUDENTS, INSTRUCTORS, AND INSTITUTIONS:

Instructional Design Using Adobe Captivate

Applications of Deep Learning to the GEOINT mission. June 2015

The preliminary design of a wearable computer for supporting Construction Progress Monitoring

Master of Arts Program in English for Careers Language Institute Thammasat University Revised 2008

Where to use open source

OPENCOURSEWARE IS HERE. Gary W. Matkin, University of California, USA

The Knowledge Sharing Infrastructure KSI. Steven Krauwer

Glen Ridge Public Schools Language Arts Literacy Curriculum

National Education Technology Standards and Performance Indicators As Aligned with the ODS Educator Personal Technology Use Proficiency Exam

HUMAN LANGUAGE TECHNOLOGY

A Platform for Managing Term Dictionaries for Utilizing Distributed Interview Archives

Howard Moïse Collection,

Introductory Guide to the Common European Framework of Reference (CEFR) for English Language Teachers

Quick Start Guide: Read & Write 11.0 Gold for PC

WESCO socialsecurity.gov Accessibility Requirements Policy

Information Technology Grades Students will know and be able to: Basic Operations 1.1 (I)

D5.5 Initial EDSA Data Management Plan

Copyright Soleran, Inc. esalestrack On-Demand CRM. Trademarks and all rights reserved. esalestrack is a Soleran product Privacy Statement

Careers in Court Reporting, Captioning, and CART An Introduction

TraMOOC Project Overview Presentation. Overview Presentation

How 5 Colleges Cultivated Video Accessibility at Their Institutions

Unified Communications

User Interface Design for a Content-aware Mobile Multimedia Application: An Iterative Approach 1)

May 26th Pieter van der Linden, Program Manager Thomson. May 26 th, 2009

The Data Management Plan with. Dataverse. Mercè Crosas, Ph.D. Director of Product Development

Introduction to the Database

EU-BRIDGE: Bridges Across the Language Divide

Working people requiring a practical knowledge of English for communicative purposes

Broadcasters and video distributors are finding new ways to leverage financial and operational benefits of supporting those with hearing disabilities.

The C-Print Service: Using Captions to Support Classroom Communication Access and Learning by Deaf and Hard-of-Hearing Students

Digital Asset Optimization

Language technologies for Education: recent results by the MLLP group

isecure: Integrating Learning Resources for Information Security Research and Education The isecure team

Subtitles on everything for everyone Enabling community subtitling and translation for every video on the net.

20 Places to Educate Yourself Online for Free

Social Selling: Building Relationships in a Social Media World

Robustness of a Spoken Dialogue Interface for a Personal Assistant

INTRODUCTION. Technology is changing everything. Today: Impact on PT Prac1ce Tomorrow: Electronic Health Record and Smart Mobile Devices

Europeana Core Service Platform

Alberto Laender Speaks Out

Technology Inspires Dynamic 21 st -Century Teaching

Transcription FAQ. Can Dragon be used to transcribe meetings or interviews?

COMMUNICATION REQUIREMENTS FOR BIG TOP TENT CONTRIBUTORS

ICT Project on Text Transcription of Technical Video Lectures and Creation of Video Searchable Index, Metadata and Online Quizzes

Transcription:

Automated Lecture Transcription Brandon Muramatsu mura@mit.edu MIT, Office of Educational Innovation and Technology (Really, I m just moonlighting as an OCWC Staffer ) Citation: Muramatsu, B. (2009). Automated Lecture Transcription. Presented at the OpenCourseWare Global Meeting. Monterrey, Mexico. April 22, 2009. This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Motivation MIT OCW 8.01: Professor Lewin puts his life on the line in Lecture 11 by demonstrating his faith in the Conservation of Mechanical Energy. More & more academic videos on the Web Universities recording lectures Cultural organizations interviewing experts 2

Motivation Challenges Volume Search Accessibility 3

Research: Spoken Lecture Project James Glass glass@mit.edu Speech recognition & automated transcription of lectures Why lectures? Conversational, spontaneous, starts/stops Different from broadcast news, other types of speech recognition Specialized vocabularies 4

Research: Spoken Lecture Project James Glass glass@mit.edu Processor, browser, workflow web.sls.csail.mit.edu/lectures/ Prototyped with lecture & seminar video MIT OCW (~300 hours, lectures) MIT World (~80 hours, seminar speakers) Supported with icampus MIT/Microsoft Alliance funding 5

What problems are we trying to solve? For Learners? For Content Producers? Finding (primary) Content in videos (text metadata) Specific phrase in video (via transcript) Specific concept in video Facilitating (secondary) Accessibility (closed captioning) Translations 6

Transition: Towards a Lecture Transcription Service Develop a prototype production service MIT, University of Queensland Engage external partners (hosted service?, community?) Requirements gathering Internal MIT customers (OCW, AMPS) External (OpenCast, UC Berkeley, Others) 7

MIT Projects/Customers OpenCourseWare (Production support) Existing videos & audio, new video Lecture notes, slides, etc. for domain model Multiple videos/audio by same lecturer for speaker model Diverse topics/disciplines Improve search and retrieval (more granularity) English transcripts can facilitate translation MIT 150 th Celebration (AMPS) Highly produced, individual speakers Full transcripts available Facilitate search 8

External Customers/Interest University of Queensland Lecture podcasting 25 years of interviews with world-class scientists, Australian Broadcasting Company UC Berkeley Lecture podcasting, 500+ hours of new content per term Improve search and retrieval OpenCast Project (www.opencast.org) Extend generic podcast production workflow Harvard University Extension 100 th Anniversary 9

Lecture Transcription Workflow 10

Demo Spoken Lecture Browser web.sls.csail.mit.edu/lectures Requires Real Player 10 Alternate UI, Google Audio Indexing labs.google.com/gaudi U.S. political coverage (2008 elections, CSPAN) 11

Spoken Lecture Browser web.sls.csail.mit.edu/lectures

A Lecture Transcription Service? Under consideration Limitations (anticipated, may change) Lecture-style content (technology optimized) Approximately 80% accuracy Probably NOT full accessibility solution Other languages? (not sure) Browser open-sourced (expected) Processing hosted/limited to MIT (current thinking) So will submit jobs via MIT-run service Audio extract, domain models and transcripts available donated for further research 13

Thanks! Brandon Muramatsu mura@mit.edu MIT, Office of Educational Innovation and Technology Citation: Muramatsu, B. (2009). Automated Lecture Transcription. Presented at the OpenCourseWare Global Meeting. Monterrey, Mexico. April 22, 2009. This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)