CIVIL Corpus: Voice Quality for Forensic Speaker Comparison



Similar documents
A. Batliner, S. Burger, B. Johne, A. Kieling. L.M.-Universitat Munchen. F.{A.{Universitat Erlangen{Nurnberg

Speech Signal Processing: An Overview

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA

Establishing the Uniqueness of the Human Voice for Security Applications

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

Pronunciation in English

Formant Bandwidth and Resilience of Speech to Noise

Perceived Speech Quality Prediction for Voice over IP-based Networks


Emotion Detection from Speech

Intonation difficulties in non-native languages.

ARTICLE. Sound in surveillance Adding audio to your IP video solution

The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications

Understanding Impaired Speech. Kobi Calev, Morris Alper January 2016 Voiceitt

Curriculum Vitae. Alison M. Trude December Website: Department of Psychology, University of Chicago

Phonetic and phonological properties of the final pitch accent in Catalan declaratives

Program curriculum for graduate studies in Speech and Music Communication

SPEAKER RECOGNITION OF DISGUISED VOICES: A PROGRAM FOR RESEARCH

Your personalised, online solution to meeting ICAO English language proficiency requirements

Tutorial about the VQR (Voice Quality Restoration) technology

Innkeeper PBX. Desktop Digital Hybrid. User Guide. JK Audio

Why major in linguistics (and what does a linguist do)?

SASSC: A Standard Arabic Single Speaker Corpus

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser

Reading Assistant: Technology for Guided Oral Reading

THE VOICE OF LOVE. Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard

AI Audio 2 (SoundMAX High Definition Audio utility)

Speech-Language Pathology Curriculum Foundation Course Linkages

What Is Linguistics? December 1992 Center for Applied Linguistics

Q. I know about connecting things to my TV and DVD Players. Show me a quick way to get started?

The Pronunciation of the Aspirated Consonants P, T, and K in English by Native Speakers of Spanish and French

Glossary of key terms and guide to methods of language analysis AS and A-level English Language (7701 and 7702)

Study Plan for Master of Arts in Applied Linguistics

Turkish Radiology Dictation System

COURSES IN ENGLISH AND OTHER LANGUAGES AT THE UNIVERSITY OF HUELVA (update: 24 th July 2014)

PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS

Formant frequencies of British English vowels produced by native speakers of Farsi

A CHINESE SPEECH DATA WAREHOUSE

Thirukkural - A Text-to-Speech Synthesis System

SF16-FMD. 16-Bit 3-D Sound Board with FM Radio. User Manual

innkeeper PBX Desktop Digital Hybrid User Guide JK Audio

innkeeper PBX Desktop Digital Hybrid User Guide JK Audio Warranty

Dialects. Dialects: Regional Varieties of English. Language Varieties. What is a dialect?

L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES

Luis Bonilla, Ph.D. Curriculum Vitae. 124 Sunnyside Park Rd. Syracuse, NY

GRADUATE CURRICULUM ON VOICE AND VOICE DISORDERS ASHA Special Interest Division 3, Voice and Voice Disorders

Technical Training Seminar on Troubleshooting the Triple Play Services for CCTA Member Companies August 24, 25, 26, 2010 San Juan, Puerto Rico

Master programmes. Documents communications in Russia and abroad Philosophy Philosophy of I. Kant and modern Neokantianism

M.A. Hispanic Linguistics. University of Arizona: Tucson, AZ.

SPEECH OR LANGUAGE IMPAIRMENT EARLY CHILDHOOD SPECIAL EDUCATION

HomePlug adapters for transferring music and speech over your household power circuit

Cisco IPICS Push-to-Talk Management Center

Digitizing Sound Files

Revolabs Audio S olutions Solutions Hear Every Word!

innkeeper PBX Desktop Digital Hybrid User Guide JK Audio Warranty

Between voicing and aspiration

Spanish. In the College of Arts and Letters

Bachelors of Science Program in Communication Disorders and Sciences:

The Effect of Long-Term Use of Drugs on Speaker s Fundamental Frequency

Guidelines for ToBI Labelling (version 3.0, March 1997) by Mary E. Beckman and Gayle Ayers Elam

Audio processing and ALC in the FT-897D

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages CHARTE NIVEAU B2 Pages 11-14

Voice Authentication for ATM Security

1. Introduction to Spoken Dialogue Systems

Social Signal Processing Understanding Nonverbal Behavior in Human- Human Interactions

How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3

Teaching English as a Foreign Language (TEFL) Certificate Programs

Vannesa Mueller CURRICULUM VITAE

Active Monitoring of Voice over IP Services with Malden

Text-To-Speech Technologies for Mobile Telephony Services

VoiceTone T1 USER S MANUAL

Ph.D in Speech-Language Pathology

User guide. Stereo Bluetooth Headset SBH70

Universal Host. Desktop Digital Hybrid. User Guide. JK Audio

You don t hear me but your phone s voice interface does. José LOPES ESTEVES & Chaouki KASMI

High Definition Wideband

9,'(2 #6(3$5$725 8VHU V#0DQXDO

Overview of Commissioner s List of Reading Instruments

Speech therapy in treatment of Vocal cord Nodules

Prosodic Characteristics of Emotional Speech: Measurements of Fundamental Frequency Movements

WinPitch LTL II, a Multimodal Pronunciation Software

Socio-Cultural Knowledge in Conversational Inference

Annotation Pro Software Speech signal visualisation, part 1

Internet Radio access. Free radio from over 50,000 stations

Curriculum Vita. Gregg M. Stutchman Forensic Analyst & Expert Witness in Audio, Video & Image Forensics & Forensic Photography

EUROPEAN COMPUTER DRIVING LICENCE. Multimedia Audio Editing. Syllabus

User guide. Stereo Bluetooth Headset SBH50

Proceedings of Meetings on Acoustics

ELECTROGLOTTOGRAPHIC ASSESSMENT OF THE RESULTS OF MEDICAL TREATMENT OF PATIENTS WITH VOCAL CORDS POLYPS 1. INTRODUCTION

SPEECH Biswajeet Sarangi, B.Sc.(Audiology & speech Language pathology)

LINGUISTICS (LIN) Spring Degrees Awarded M.A. in Linguistics; M.A. in Teaching English to Speakers of Other Languages; Ph.D.

Transcription:

CIVIL Corpus: Voice Quality for Forensic Speaker Comparison Eugenia San Segundo Helena Alves Marianela Fernández Trinidad Phonetics Lab. CSIC CILC2013 Alicante 15 Marzo 2013

CIVIL Project Cualidad Individual de la Voz en la Identificación de Locutores 2010 Phonetics Lab CSIC Laryngeal settings modification FORENSIC PHONETICS

Types of Voice Transformation (non electronic) 1) Phonation disguise: whisper (Orchard y Yarmey 1995 & Yarmey et al. 2001, Evans & Foulkes 2009) falsetto (Endres, Bambach & Floss 1971, Wagner & Köster 1999, Künzel 2000, Alves et al. 2012 ) creak/creaky ( Hirson & Duckworth 1993, Moosmüller 2001 Künzel 2000, Alves et al. 2012) 2) Prosody disguise: pitch, intonation, speech rate (Dellwo, Ramyead & Dancovicova 2009 & Dellwo, Kolly & Leemann 2012)

Types of Voice Transformation (non electronic) 3) Supraglottal disguise: Through objects (Molina de Figueiredo & Souza Britto 2000; Horga, 2002) Techniques that interfere within the habitual speech transmission (Rose & Simmons 1996, Llamas et ál. 2008, Gil & San Segundo 2013) 4) Phonological system disguise: foreign accent, dialectal or pathological features (Zhang & Tan 2008, Tate 1979, Markham 1999, Storey 1996, Moosmüller 2006, Simpson & Neuhauser 2009, 2010)

Disguise as a Challenge in Forensic Phonetics Most criminals do not combine all these disguising techniques (Masthoff 1996). The most frequently used is the voluntary modification of the phonation types. This kind of disguise is specially difficult to maintain for a long stretch of time (Künzel 2000).

CIVIL: hypotheses Changes in phonation = harmful for speaker recognition Idiosyncratic phonetic features (biometric traces): Remain despite the disguise attempts Some laryngeal characteristics cannot be disguised

Types of Phonation Phonation = vocal folds vibration From: http://www.phys.unsw.edu.au/jw/voice.html

Types of Phonation Different states of the vocal folds produce different types of phonation Falsetto -adducted +tense elongated Modal adducted tense ------------- Creak/y +adducted -tense shortened

Corpus CIVIL 31 female speakers and 27 male speakers Standard European Spanish 20-35 years old mean 25.6 years old Two recording sessions mean 29.8 days Why? Forensically realistic Non-contemporaneous speech samples ( - ) Within-speaker variation (+) Between-speaker variation

Corpus CIVIL Three tasks: Voice Signal: 3-4 minutes of conversation Microphone 33 carrier sentences Telephone 2 texts EGG Three Types of Phonation: Modal Falsetto Creak/y

Electroglottograph Measures the time variation of the degree of contact between the vibrating vocal folds Pérez Sanz, C. Ajustes laríngeos y estilos de habla en radio y televisión (Ph.D.)

Recording Equipment & Settings Equipment Recording booth of the CCHS Phonetics Lab Condenser microphone E6i Omnidirecctonal Earset Audio Interface UA-25EX by Roland PC with the software Adobe Audition 1.0 for Windows Telephones CISCO IP Phone as emitter & Samsung Galaxy as receiver Electroglottograph Glottal Enterprises EG2-PCX2 Settings: Sample Rate: 44100 Resolution: 16-bits Channels for voice: Stereo (L-microphone & R-telephone) Channels for EGG: Stereo (L-microphone & R-EGG)

Results so far Alves et al. (2012) Disguised voices: a perceptual experiment, 3rd European Conference of the International Association of Forensic Linguistics, Oporto. Listeners recognition of disguised voices is above chance (p < 0.001 ***) Speakers are worse recognized when using creak than when using falsetto. No performance differences between experts and naïve listeners in disguised voice recognition FEMALE VOICES!!

Future directions MALE VOICES? -Hypothesis: Worse recognition results when using falsetto - Expectations not met: - Creak less expected for female voice prototype - Falsetto less expected for male voices prototype Fernández-Trinidad, Infante & Alves (2013) Falsetto as a disguise method in male voices, AESLA, La Laguna

CIVIL Corpus: Voice Quality for Forensic Speaker Comparison Upon completion (total = 100 speakers): an open-access research resource!

CIVIL Corpus: Voice Quality for Forensic Speaker Comparison Thank you for your attention! eugenia.sansegundo@cchs.csic.es helena.alves@cchs.csic.es marianela.fernandez@cchs.csic.es CILC Alicante 15 Marzo 2013