Hands-on tutorial: Using Praat for analysing a speech corpus. Mietta Lennes Palmse, Estonia

Similar documents
Praat Tutorial. Pauline Welby and Kiwako Ito The Ohio State University. January 13, 2002

Carla Simões, Speech Analysis and Transcription Software

Planning and preparing presentations Giving presentations Features of a good presentation Poster presentations

InqScribe. From Inquirium, LLC, Chicago. Reviewed by Murray Garde, Australian National University

Creating Content for ipod + itunes

WP5 - GUIDELINES for VIDEO shooting

The use of binary codes to represent characters

Philips 9600 DPM Setup Guide for Dragon

CCS Content Conversion Specialists. METS / ALTO introduction

Dr. Pat Mirenda. Software Design Specification Document

Robust Methods for Automatic Transcription and Alignment of Speech Signals

Praat Scripting for dummies

W-PhAMT: A web tool for phonetic multilevel timeline visualization

A Short Introduction to Transcribing with ELAN. Ingrid Rosenfelder Linguistics Lab University of Pennsylvania

Recording and Editing Audio with Audacity

ADVANCED COMMUNICATION SERIES STORYTELLING. Assignment #1: THE FOLK TALE

EXMARaLDA and the FOLK tools two toolsets for transcribing and annotating spoken language

Action Steps for Setting Up a Successful Home Web Design Business

Windows Movie Maker Making a Narrated Slide Show

Sample Project: How to Write an Informational/ Explanatory Text An Informational Wiki

Transcribing with Annotation Graphs

Using WebEx Player. Playing a recording. Installing WebEx Player. System requirements for listening to audio in a recording

Windows Live Movie Maker

Acoustical Surfaces, Inc.

The use of Praat in corpus research

ICT Project on Text Transcription of Technical Video Lectures and Creation of Video Searchable Index, Metadata and Online Quizzes

MKTG204 Integrated Marketing Communications. Semester 1, Department of Marketing & Management

Requirements & Guidelines for the Preparation of the New Mexico Online Portfolio for Alternative Licensure


Pronunciation in English

Smart Board Notebook Software A guide for new Smart Board users

IPv4 Addressing Simplified. by Ken Foster B.S. IT Information; Security and Forensics Kaplan University January 23, 2011

The Language Archive at the Max Planck Institute for Psycholinguistics. Alexander König (with thanks to J. Ringersma)

Technology in language documentation

Lesson Plan. Preparation

AZATOM USER MANUAL. Droid Portable Bluetooth Speaker

Reviewed by Ok s a n a Afitska, University of Bristol

Performance and Development Review and Pay Progression Policy

User research for information architecture projects

Chapter 8: Quantitative Sampling

Leonardo Hotels Group Page 1

How Can Teachers Teach Listening?

Digital File Management

Closed captions are better for YouTube videos, so that s what we ll focus on here.

Grade 8 English Language Arts 90 Reading and Responding, Lesson 9

interviewscribe User s Guide

CONTROLLING YOUR FEAR

Mathematical modeling of speech acoustics D. Sc. Daniel Aalto

LECTURE AND NOTE TAKING

MINUTE TAKING. All material copyright of Lindsay Wright This pack is for sample purposes only, and not for re-use

Transcribing and annotating audio and video: Jeff Good MPI EVA and the Rosetta Project

Creating Captions in YouTube

Using ELAN for transcription and annotation

Camtasia: Importing, cutting, and captioning your Video Express movie Camtasia Studio: Windows

Teacher notes and activities

Greystone College TESOL FAQs

We are going to investigate what happens when we draw the three angle bisectors of a triangle using Geometer s Sketchpad.

Alison Bell Medicine in Addictions Conference

Coaching Tools

Using Multimedia with Microsoft PowerPoint 2003: A guide to inserting Video into your presentations

How to create a blog or website

THE FUTURE OF BUSINESS MEETINGS APPLICATIONS FOR AMI TECHNOLOGIES

itunes 7.0 Fall 07 fall 2007

What Customers Want from Kindle Books

Maryland 4-H Public Speaking Guide

Cognitive Development

VPAT for Apple MacBook Pro (Late 2013)

Summary Table Voluntary Product Accessibility Template

Social Semantic Emotion Analysis for Innovative Multilingual Big Data Analytics Markets

STEP 5: Giving Feedback

Lecture 1-10: Spectrograms

Directions for Administering the Graded Passages

RNK Productions, LLC 1 st Choice Transcription info@rnkproductions.com

Camtasia Studio. Creating Screen Videos

Close Reading Read Aloud

Tutorial 6 GPS/Point Shapefile Creation

Dept. of Communication Studies Senior Portfolio Instructions

USING LANGUAGES TO LEARN AND LEARNING TO USE LANGUAGES

Xerox DocuMate 3125 Document Scanner

Hector s World Lesson Plan Episode: Cyberbullying You re Not Alone Upper primary

EVENT MANAGEMENT. Examine the costs (budget) Define your goals Consider what evaluation methods you will incorporate. - Manpower.

Setting Up Outlook on Workstation to Capture s

Transcription Format

Preservation Handbook

Main Question 1: How and where do you or your family use the Internet - whether on a computer or a cell phone? Follow up questions for INTERNET USERS

BTX4E. Information and Communication Technology in the Workplace Workplace Preparation

Neil Murray University of South Australia April 2011

Form: Filled in table. Method: Peer assessment. Tool: Checklist. Form: Completed table. Method: Peer assessment. Tool: Checklist

AutoCAD Civil 3D Profile Views, Data Bands, and Styles

SHOOTING AND EDITING DIGITAL VIDEO. AHS Computing

After Effects CS4. Getting Started. Getting Started. Essential Training Introduction

SPEECH TRANSCRIPTION USING MED

Transcription:

Hands-on tutorial: Using Praat for analysing a speech corpus Mietta Lennes 12.-13.8.2005 Palmse, Estonia Department of Speech Sciences University of Helsinki

Objectives Lecture: Understanding what speech annotation means efficient annotation theoretical pitfalls Exercises: Learning to use Praat for annotating speech basic techniques and analysis displays incremental annotation Exercises: Using simple Praat scripts to analyse a small annotated speech corpus understanding basic acoustic analyses running and editing scripts

Annotation Annotation generally means describing, classifying and organizing (speech) material by systematically adding symbolic labels to its parts. The analyses you will be able to perform are restricted by the accuracy and types of annotations you have for your corpus. Up to date, no automatic speech segmentation or recognition tool exists for any language that can perform as well as a human annotator.

Transcripts are not annotations as such. Annotations and transcripts are not data.

Multiple annotation layers kuva jossa esimerkkejä monenmoisista annotaatiokerroksista

Prerequisites for annotating and analysing a speech corpus Signal files in a format readable by the annotation tool (Praat: WAV, AIFF, AIFC, Next/Sun, NIST; 16- or 8-bit) Sufficiently high signal-to-noise ratio Different speakers should preferably be separated into different audio files (crosstalk is difficult to annotate). High acoustic quality is required for complex acoustic analyses (e.g., formant modeling). If studying speech and interaction, there should be a common timeline for all audio/video/other signal files.

Planning an annotation project Annotation is boring and time-comsuming -> you should make sure it is worth all the work! Annotation should help to run analyses automatically and to reduce the need for manually browsing through your corpus. Explore and practise with a small material, then complete your annotations. What are you aiming to study?

Remember... Speech communication is much more than an acoustic form of writing. Writing things down in a specific notation and carefully classifying them does not make these things nor the categories any more real. All units that you plan to annotate tend to be fuzzy when you try to find them in real speech: the temporal boundaries are unclear, the different categories are sometimes difficult to separate, etc.

Annotation and the Human Factor...

Defining your annotation structure List your units: what kind of labels are allowed? What kind of properties do your units have? Which values are allowed for the properties? How many layers (tiers) of annotation do you need? You should understand how the use of these units, labels and tiers can help you to automatically analyse your material in a consistent way. Do not waste time labeling things that can be automatically measured! (e.g. labeling pause durations into a TextGrid)

Multiple annotation layers : Word units in search focus

Multiple annotation layers: Phone units in search focus

Metadata It is important to gather sufficiently detailed metadata about the speech material (speakers and their background, recording conditions, etc.) Metadata can also be used when analysing the corpus! E.g., the speakers sex and age are factors that tend to affect their linguistic behaviour. (If a speech database system is not available, you can encode information about the speakers, e.g., into the filenames.)

Why choose Praat for analysing your corpus? Widely used, well known, well maintained Easily installed on multiple platforms Scriptable All Praat scripts and files can be made fully portable from one system to another. With Praat, you can use your corpus almost anywhere!

Why not to use Praat Video annotation must be done with another tool. Praat does not include a proper database system as such, so searching a speech corpus with Praat must be implemented through Praat scripts (which can become painfully slow). Recommended: If your corpus is large, use Praat (scripts) to dump your annotations and acoustic analysis results to a suitable format and do the searching and statistics somewhere else.

Links Praat: http://www.praat.org Praat scripts: http://www.helsinki.fi/~lennes/praat-scripts/ Linguistic annotation (tools and formats): http://www.ldc.upenn.edu/annotation/ Annotation guide (in Finnish; a public draft version): http://www.helsinki.fi/~lennes/nimikointiopas.html An RDF/XML Schema for formally defining your annotation structure, e.g., in your own applications: http://www.csc.fi/kielipankki/projektit/sapuhe/