Can V/C-ratio alone be sufficient for discrimination of V:C/VC: in Swedish? A perception test with manipulated durations.

Similar documents
SWING: A tool for modelling intonational varieties of Swedish Beskow, Jonas; Bruce, Gösta; Enflo, Laura; Granström, Björn; Schötz, Susanne

Aspects of North Swedish intonational phonology. Bruce, Gösta

Quarterly Progress and Status Report. Preaspiration in Southern Swedish dialects

Pronunciation in English

L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES

Quarterly Progress and Status Report. Word stress and duration in Finnish

Intonation difficulties in non-native languages.

THE PRIORITY OF TEMPORAL ASPECTS IN L2-SWEDISH PROSODY: STUDIES IN PERCEPTION AND PRODUCTION. Bosse Thorén

Functional Auditory Performance Indicators (FAPI)

Workshop Perceptual Effects of Filtering and Masking Introduction to Filtering and Masking

Thirukkural - A Text-to-Speech Synthesis System

The Effects of Speaking Rate on Mandarin Tones. C2008 Yun H. Stockton

Learning Today Smart Tutor Supports English Language Learners

Modelling compound intonation in Dala and Gotland Swedish Schötz, Susanne; Bruce, Gösta; Granström, Björn

MODELING THE CHANGING POPULARITY OF NAMES by Bengt Sigurd, Mats Eeg-Olofsson and Jørgen Ouren

Evaluation of a Segmental Durations Model for TTS

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages CHARTE NIVEAU B2 Pages 11-14

The Pronunciation of the Aspirated Consonants P, T, and K in English by Native Speakers of Spanish and French

A perceptual study of difficulties met by native speakers of English in the production of the durational patterns of Finnish disyllabic word

PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS

Reading Competencies

Journal of Phonetics Swedish Quantity: Central Standard Swedish and Fenno-Swedish

Common Phonological processes - There are several kinds of familiar processes that are found in many many languages.

DRA2 Word Analysis. correlated to. Virginia Learning Standards Grade 1

Some Basic Concepts Marla Yoshida

The sound patterns of language

Pronunciation Difficulties of Japanese Speakers of English: Predictions Based on a Contrastive Analysis Steven W. Carruthers

Carla Simões, Speech Analysis and Transcription Software

Things to remember when transcribing speech

Comparative Analysis on the Armenian and Korean Languages

Mother Tongue Influence on Spoken English

Office Phone/ / lix@cwu.edu Office Hours: MW 3:50-4:50, TR 12:00-12:30

Strategies to improve reading fluency skills. Nancy B. Swigert, M.A., CCC/SLP The Reading Center Lexington, KY

An Arabic Text-To-Speech System Based on Artificial Neural Networks

INTEGRATING THE COMMON CORE STANDARDS INTO INTERACTIVE, ONLINE EARLY LITERACY PROGRAMS

Strand: Reading Literature Topics Standard I can statements Vocabulary Key Ideas and Details

THE VOICE OF LOVE. Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard

(in Speak Out! 23: 19-25)

Historical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics?

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser

Developmental Verbal Dyspraxia Nuffield Approach

M.A. Hispanic Linguistics. University of Arizona: Tucson, AZ.

Teaching and Learning Mandarin Tones. 19 th May 2012 Rob Neal

. Niparko, J. K. (2006). Speech Recognition at 1-Year Follow-Up in the Childhood

English for communication in the workplace

WinPitch LTL II, a Multimodal Pronunciation Software

CAMBRIDGE FIRST CERTIFICATE Listening and Speaking NEW EDITION. Sue O Connell with Louise Hashemi

The effects of non-native English speaking EFL teachers accents on their willingness to teach pronunciation

The Impact of Using Technology in Teaching English as a Second Language

Praat Tutorial. Pauline Welby and Kiwako Ito The Ohio State University. January 13, 2002

iboard Phonics Curriculum Guidance

Phonics and Word Work

Prelinguistic vocal behaviors. Stage 1 (birth-1 month) Stage 2 (2-3 months) Stage 4 (7-9 months) Stage 3 (4-6 months)

Annotation Pro Software Speech signal visualisation, part 1

Swedish for Immigrants

Lecture 12: An Overview of Speech Recognition

Thai Pronunciation and Phonetic Symbols Prawet Jantharat Ed.D.

Master of Arts in Linguistics Syllabus

The Role of Listening in Language Acquisition; the Challenges & Strategies in Teaching Listening

COMPUTER TECHNOLOGY IN TEACHING READING

Portions have been extracted from this report to protect the identity of the student. RIT/NTID AURAL REHABILITATION REPORT Academic Year

Prosodic focus marking in Bai

Unit 1 Title: Word Work Grade Level: 1 st Grade Timeframe: 6 Weeks

Selecting Research Based Instructional Programs

Unit 2 Title: Word Work Grade Level: 1 st Grade Timeframe: 6 Weeks

What is the Sapir-Whorf hypothesis?

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE

What Does Research Tell Us About Teaching Reading to English Language Learners?

Perceptual Acquisition of Norwegian Close Rounded Vowels by Mandarin Chinese Learners of Norwegian

Debbie Hepplewhite s suggestions for effective and supportive phonics provision and practice

Phonetic and phonological properties of the final pitch accent in Catalan declaratives

On the distinction between 'stress-timed' and 'syllable-timed' languages Peter Roach

Running head: PROCESSING REDUCED WORD FORMS. Processing reduced word forms: the suffix restoration effect. Rachèl J.J.K. Kemps, Mirjam Ernestus

Culture and Language. What We Say Influences What We Think, What We Feel and What We Believe

Program curriculum for graduate studies in Speech and Music Communication

MATTHEW K. GORDON (University of California, Los Angeles) THE NEUTRAL VOWELS OF FINNISH: HOW NEUTRAL ARE THEY? *

Uitspraakevaluatie & training met behulp van spraaktechnologie. Pronunciation assessment & training by means of speech technology.

Massachusetts Tests for Educator Licensure

Myanmar Continuous Speech Recognition System Based on DTW and HMM

OCPS Curriculum, Instruction, Assessment Alignment

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA

Teaching pronunciation Glasgow June 15th

SYLLABLE STRESS IN URDU

Issues in Second Language Listening Comprehension and the Pedagogical Implications

The National Reading Panel: Five Components of Reading Instruction Frequently Asked Questions

LABORATORY ACTIVITIES FOR LARGE AND ONLINE PHONETICS CLASSES

How Can Teachers Teach Listening?

Phonetic Perception and Pronunciation Difficulties of Russian Language (From a Canadian Perspective) Alyssa Marren

The Perception and Production of Portuguese Mid- Vowels by Native Speakers of American English

ISSN X. Lund University. Centre for Languages and Literature. General Linguistics Phonetics. Working Papers

Indiana Department of Education

Effect of Audio Instructional Package on Basic Pupils Performance in English Pronunciation Skills in Ilorin, Kwara State, Nigeria

Wednesday 4 th November Y1/2 Parent Workshop Phonics & Reading

Scientifically Based Reading Programs: What are they and how do I know?

Teaching Word Identification and Spelling Word Identification by Rebecca Felton, PhD.

What makes a language attractive its sound, national identity or familiarity?

An Evaluation of Bone-conducted Ultrasonic Hearing-aid regarding Transmission of Japanese Prosodic Phonemes

Proceedings of Meetings on Acoustics

The influence of L1 on the acquisition of Swedish quantity by native speakers of Spanish, English and Estonian

COMPREHENSIBILITY AND PROSODY RATINGS FOR PRONUNCIATION SOFTWARE DEVELOPMENT

Transcription:

Can alone be sufficient for discrimination of V:C/VC: in Swedish? A perception test with manipulated durations. Bosse Thorén Department of Linguistics, Stockholm University 1. Introduction Out of the need to simplify linguistic description for pedagogic purposes, a basic prosodic model for Swedish has developed over the last two decades. Since Swedish is a language with both distinctive word stress, e.g. planet (the plane) pla net (planet) and distinctive length i.e. quantity, e.g. glas (glass) glass (ice cream), it has become obvious that these properties are important when learning (and teaching) Swedish as a second language. This model is developed by practicing teachers and not the result of systematic scientific work. It has been influenced by Bruce s (1977) model, where the quantity distinction which occurs only in stressed syllables is the common denominator of all degrees of word prominence in Swedish. Elert (19) shows that the quantity distinction has temporal correlates, and also, that the quantity is realized as a complementary durational relation between vowel and subsequent consonant in stressed syllables; /V C/ - /VC /. The pedagogical model also owes much to Bannert s (e.g. 19, 199) many studies that have improved our understanding of what phonetic properties are crucial for speaking intelligible Swedish, and what properties are less important. His findings suggest that correct word stress is vital for making oneself understood in spoken Swedish. Furthermore, Fant & Kruckenberg (199), suggest that duration is the most reliable phonetic correlate to stress in Swedish, so consequently the phonologic properties of stress and quantity, and the phonetic correlate of duration seem to be inseparable in Swedish. The pedagogic model focuses on stress and length, not distinguishing phonological properties and temporal duration. If the learners of Swedish as a second language implement the length in the phonologically long sounds as increased duration, it is assumed that a native Swedish listener is helped in perception of stress at word- and sentence level and phonological quantity. Each bi- or polysyllabic word in the native Swedish speakers lexicon is assumed to be marked for which syllable gets the stress if the word is prominent in the utterance, and the potentially stressed syllable is marked for lengthening of either vowel or consonant, thus allowing for syllables of the type /V C/ and /VC / respectively. Although many tutorial works about Swedish prosody include descriptions and instructions concerning tonal features, the abovedescribed hierarchy of stress and length is common for works that differ in other

aspects. This basic model often manifests itself in textbooks in Swedish as a second language in the following way: Potatis är gott. (Potatoes are tasty) In each prominent word, the long sound in the stressed syllable is underlined. By pointing out the phonologically long sound in the stressed syllable and thus promoting the quantity distinction by increasing or reducing the V/Cratio this marking also points out the stressed syllables in the prominent words, as well as the prominent words in a sentence,. Thorén (1) tested the effect of artificially increased duration in phonologically long segments, as a s of improving Swedish with a Polish accent, and found that native Swedish listeners tended to judge increased duration as improved Swedish pronunciation, although some of the listeners had difficulties in discriminating manipulated and intact stimuli. The phonologic distinction /V C/ /VC / is also accompanied by a spectral difference in the vowel, which is greater for some vowels and smaller for others (e.g. Elert 197). Duration and spectrum also interacts differently in different regional varieties of Swedish, and e.g. Reuter (1971) finds that Finland-Swedish (Helsinki variety) has smaller spectral differences between long and short vowel allophone than central standard Swedish, which is noticeable for the vowel phonemes /a/ and / /, both of which differ greatly between long and short allophone in most regional varieties of Sweden-Swedish. Reuter (19) found furthermore that Finland-Swedish long/short vowel allophones differ more in duration than is the case for central standard Swedish. The pedagogical basic prosody model, or at least how it is commonly implemented suggests that temporal correlates to the Swedish quantity are more important cues than spectral. More important should here be interpreted as more important as perceptual cues and hence more important to teach/learn. A study by Behne et al. (1997) suggests that duration is a stronger cue to discrimination for /i/, /o/ and /a/, but that the interaction between duration and formant spectrum for /a/ is complicated. The main purpose of the present study is to test whether the temporal correlates of the Swedish /V C/ - /VC /-distinction is a stronger cue to Swedish quantity than the spectral correlates, and to compare a method using complementary duration to studies using manipulation of vowel duration only, e.g. Behne et al. (1997) and Hadding-Koch & Abramson (19). Another aim is to scrutinize the suggestion of Hadding-Koch & Abramson (19), that Swedish has a subset of eight vowel phonemes that combine with a phoneme of length to yield long counterparts and another subset of two vowel phonemes, / / and /P/, distinguished by quality.

. Method.1 Stimuli 1 Swedish words, providing minimal word pairs, each pair having identical phoneme sequences and differing only in quantity, were read by two native Swedish (Stockholm variety) speakers, one male and one female. The words were: mat-matt [ma t-mat ], vit-vitt [vi túú-vit ], hut-hutt [h t-hpt ], våt-vått [vo tvçt ], tät-tätt [te t-tet ] and söt-sött [so t-søt ]. This common way of using different IPA-characters for the long-short allophones of the Swedish vowels reflects the spectral difference that is noticeable for most of the vowel phonemes. The words were read in alphabetic order from a list and recorded in a studio at the Phonetics department of the Institution of Linguistics at the University of Stockholm. Based on duration values from the readings of the two speakers, a duration of ms was chosen as total duration of the VC-sequence for all stimuli in the experiment. The recording of the female speaker was resynthesized and manipulated in PRAAT (Boersma & Weenink 1). PRAAT manipulations by the PSOLA-method allows for reducing and increasing durations evenly in a chosen section. Each of the 1 words were given s shown in table 1. Segmentation was made by s of spectrogram, oscillogram and listening. When the words had voiceless initial consonant, the vowel duration was defined as the voiced part of the word, and when the words (mat, matt, våt, vått) had voiced initial consonant, the boundary between consonant and vowel was not problematic. The final t-segment was defined as the beginning of voiceless part after the vowel an including the burst. Table 1. Vowel durations, consonant durations and s for all test words. V/C ratio in the Consonant duration Diagonal 3 3 Vowel Duration 3 1, 3 1,,7, 1,3,,1

. Presentation 7 versions of each word made a total of stimuli, that were presented in random order to native adult Swedish listeners. Presentation was made with head phones from computer or CD-player. The listeners heard each stimuli once, and responded for each stimulus whether they perceived it as a /V C/-word or as a /VC /-word, by marking one of two words, e.g. mat or matt. There was no uncertain alternative. After the discrimination test, the listeners judged the naturalness of the pronunciation for each stimulus on a scale from 1 totally unnatural pronunciation to totally natural pronunciation. 3. Result 3.1 Original s Table shows ranges and values of the for the original, intact recordings of the test words. In these recordings there seems to be a vacuum of s between. and., which could be interpreted as a security distance for the discrimination. The total number of six recordings (one male and one female speaker reading at intended slow, medium and fast rate) show in fact very few observations between.3 and.7. Table. Range and values for s in original recordings. s for V C words s for VC words Mat matt.7.97..3 Vit vitt.1.9.3. Hut hutt..3.19.3 Våt vått.3 1..1.37 Tät tätt. 1...3 Söt sött.7 1..3.37 Total and.1 1. m=.79.1.3 m=.7 3. Discrimination The result of the discrimination test is captured in figure 1, and shows that all included vowel phonemes are influenced by the change in. The different phonemes however, show varying sensitivity to the temporal change. The vowel that seems most apt to be perceived as the non-original quantity type is /E/, and most resistant to temporal change is / /. Most of the shifting over from original to non-original quantity type takes place between s.3 and

.7, which corresponds rather well to the vacuum observed in the original recordings of native speech. We can note that short [I] and short [ç] need V/Cratios greater than the typical ones, in order to be perceived as long, and that long [A ] and long [ ] need s smaller than the typical ones, in order to be perceived as short. Number of V:Cresponses Number of V:Cresponses 1 1 "mat" - "matt".1..3..7 1. 1. "hut" - "hutt".1..3..7 1. 1. Number of V:Cresponses 1 Number of V:Cresponses 1 "vit" - "vitt".1..3..7 1. 1. "våt" - "vått".1..3..7 1. 1. Number of V:Cresponses 1 "tät" - "tätt".1..3..7 1. 1. Number of V:Cresponses 1 "söt" - "sött".1..3..7 1. 1. Original V:C Original VC: Figure 1. Number of listeners perceiving stimuli as /V C/-word as a function of. A solid line with filled squares always represents an original V C-word and a solid line with open squares always represents an original VC -word. 3.3 Naturalness Judgements of naturalness are presented in figure and 3. As could be expected, naturalness judgements decrease as the differs more from the typical value for the quantity type in question. This decrease is more obvious for original /VC /-words. The vowel phonemes /E/ and /O/, traditionally known as having minimal spectral difference between long and short allophone are judged as highly natural both when perceived as /V C/ [te t], [so t] and when perceived as /VC / [tet ], [sot ], with the high naturalness judgements more evenly distributed

when original word is /V C/-type. This tendency is true for phonemes /i/, /o/, /E/ and /O/, indicating that their original /VC /-spectrum is more marked than the /V C/-spectrum. Original hutt [hpt ] is judged as natural only when having s.1 to., i.e. s similar to original version read by the Swedish speakers. "mat",1,,3,,7 1 1, "vit",1,,3,,7 1 1, "hut",1,,3,,7 1 1, "våt",1,,3,,7 1 1, "tät" Naturalness judgements for original "söt",1,,3,,7 1 1,,1,,3,,7 1 1, Perceived as VC: Perceived as V:C Figure. Mean values of naturalness judgements for original /V C/ words at different V/Cratios and perceived as either /V C/ or /VC / This is consistent with the traditional view, (e.g. Hadding-Koch & Abramson 19, Elert 197) that / /, and to some extent /a/, have greater spectral difference between long and short allophone than other Swedish vowel phonemes, and that /E/ and /O/ have minimal spectral difference between long and short allophone, thus being more sensitive to mere temporal change.

"matt",1,,3,,7 1 1, "vitt",1,,3,,7 1 1, "vått",1,,3,,7 1 1, "hutt",1,,3,,7 1 1, "tätt",1,,3,,7 1 1, Naturalness judgements for original "sött",1,,3,,7 1 1, Perceived as V:C Perceived as VC: Figure 3. Mean values of naturalness judgements for original /VC / words at different V/Cratios and perceived as either /V C/ or /VC / 3. Duration versus spectrum So far we have stated that native listeners can perceive a word of /V C/-type as a /VC /-word, and vice versa, with the aid of temporal change only, and also that this is easier for some vowel phonemes than for others. This material does not allow for a reversed test, namely to keep constant and gradually change the vowel spectrum. What we can do, however, is to take a typical for one quantity type, and then shift the spectrum to a value typical for the other quantity type (namely the original spectrum for either of the two quantity types). Can this change affect the listeners perception? This measure is compared to the result of constant spectrum and moving to a value typical for the non-original quantity type. The typical ratios for long vowels are best represented by ratios.7 and 1. in the original recordings, and the typical ratios for short vowels are best repre-

sented by ratios. and.3. These s have also received the highest rates for naturalness. Let us compare the number of /V C/-judgements (averaged over the two mentioned values) for each vowel phoneme at the typical values for long and short vowel respectively, and se how many listeners perceive stimuli as non-original quantity type, when spectral properties are shifted at constant. Figure shows how a change in temporal and spectral properties respectively affect the perception of the listeners. The average responses over all six vowel phonemes show that temporal change has caused more shifts in perception than spectral change. The / / phoneme is the only one that is more affected by spectral than temporal change, and /E/ /O/ are in these figures unaffected by spectral change, but have caused all listeners to shift their perception from original to non original quantity type as a result of temporal change. Number of responses for non original quantity type as a function of changed and spectrum respectively Number of Changed-quantity-type responses 1 /a/ /i/ /u/ /å/ /ä/ /ö/ Mean Temporal shift from V:C Temporal shift from VC: Spectral shift from V:C Spectral shift from VC: Vowel phonemes in Swedish orthography Figure. Number of listeners changing their judgements to non-original quantity type as a function of temporal and spectral manipulation.. Discussion The study indicates that all of the included vowel phonemes use both duration and spectrum as cues for the quantity distinction, but in different proportions for various vowel phonemes. Since the various phonemes differ to a large extent in their duration/spectrum dependence, and since all of the vowel phonemes are influenced by temporal shift, the results do not support the suggestion of Hadding- Koch & Abramson (19), that the / / phoneme would hold an exceptional position in the Swedish vowel inventory, but it puts / / at one end of a continuum

of duration/spectrum dependence and the /E/ phoneme at the other end, / / being more dependent on spectrum than on, and /E/ being more dependent on than on spectrum. The fact that Hadding-Koch & Abramson (19) used south Swedish speakers and listeners can also axplain some of the differences in result and conclusions between their studie and the present one. The six studied phonemes could be lined up according to their proportional dependence on or spectrum, using the diagrams in figure, with num ber of listeners affected by temporal change / number of listeners affected by spectral change: /E/ /, /O/ / >> /o/ 3/ >> /i/ /1, /a/ /1 >> / / 11/9 The results suggest that both temporal and spectral properties play an important role for Swedish V C/VC -discrimination, but taken over the vowel inventory in this study, the temporal organisation seems to be a stronger cue to the listeners perception than does the spectrum. We see also, as shown in table 1, that the speakers make similar temporal differences for the / /-phoneme, as for the other vowel phonemes in spite of the little distinctive use this phoneme seems to make of the duration. Although this is not consistent with Elert s (19) findings that the temporal difference between long and short / /-allophone are smaller than for the other vowel phonemes, it can indicate that not only V C/VC - discrimination must be maintained, but also physical lengthening in stressed syllables (cf. Behne et al. 1997). The increased duration in stressed syllables seems to have at least two roles to play; one to distinguish stressed syllables from unstressed ones, and one to distinguish the two quantity types. The conclusion for pedagogical purposes is that the temporal organisation is of superior importance because of its double function, and that the spectral differences are also important, but play a role only in quantity distinction, and not substantially for all vowel phonemes. Inspired by the present results and Reuters findings, I suggest that more similar experiments be carried out with various Swedish regional varieties, to extract what is general Swedish quantity, and what is regional or local Swedish quantity.. References Bannert R. (19) From prominent syllables to a skeleton of ing: a model of prosodically guided speech recognition. Working Papers, Department of Linguistics, Lund University 9, 1-3. Bannert, R. (199). Intelligibility in foreign accented Swedish: the effects of rhythmical and tonal features. Reports from the Department of Phonetics, Umeå University. PHONUM

Behne, D. Czigler P. & Sullivan K. (1997) Swedish Quantity and Quality: A traditional Issue Revisited. Phonum. Dept of Phonetics, Umeå University. 1-. Boersma P. & Weenink D. (1) Praat a system for doing phonetics by computer. http://www.fon.hum.uva.nl/praat/ Bruce, Gösta. (1977). Swedish word accents in sentence perspective. Gleerup, Lund. Elert, C-C. (19). Phonologic Studies of Quantity in Swedish. Uppsala: Almqvist & Wiksell. Elert, C-C. (197) Ljud och ord i svenskan. Almqvist & Wiksell, Stockholm. Fant G. & Kruckenberg A. (199). Notes on stress and word accent in Swedish, KTH, Speech Transmission Laboratory, Quarterly Progress and Status Report -3, 1-1. Hadding-Koch K. & Abramson A. (19). Duration versus spectrum in Swedish vowels: Some perceptual experiments. I Studia Linguistica 1. 9-7. Reuter M. (1971). Vokalerna i finlandssvenskan: En instrumentell analys och ett försök till systematisering enligt särdrag. I Studier i nordisk filologi, -9. Reuter M. (19/. Kvantitet I helsingforssvenskan. Opublicerad licentiatsavhandling. Helsingfors universitet. Thorén B. (1). Vem vinner på längden? Två experiment med manipulerad duration I betonade stavelser. D-uppsats i fonetik. Institutionen för filosofi och lingvistik, avdelningen för fonetik. Umeå universitet. Bosse Thorén Dept. of Linguistics University of Stockholm Fjällvråksvägen 1 Sundsvall bosse.thoren@skola.sundsvall.se