The SweDat Project and Swedia Database for Phonetic and Acoustic Research
|
|
|
- Marsha Crystal Carson
- 10 years ago
- Views:
Transcription
1 2009 Fifth IEEE International Conference on e-science The SweDat Project and Swedia Database for Phonetic and Acoustic Research Jonas Lindh and Anders Eriksson Department of Philosophy, Linguistics and Theory of Science University of Gothenburg Gothenburg, Sweden The project described here may be seen as a continuation of an earlier project, SweDia 2000, aimed at transforming the database collected in that project to a full-fledged e-science database. The database consists of recordings of Swedish dialects from 107 locations in Sweden and Swedish speaking parts of Finland. The goal of the present project is to make the material searchable in a flexible and simple way to make it available to a much wider sector of the research community than is the case at present. The database will be accessible over the Internet via user-friendly interfaces specifically designed for this type of data. Other more specialized research interfaces will also be designed to facilitate phonetic acoustic research and orientation of the database. Phonetics; Acoustics; Forensic Phonetics;Dialectology I. INTRODUCTION The database consists of recorded speech material from 107 Swedish dialects. The recordings were made as part of a research project carried out as a joint effort by the departments of linguistics at Umeå, Stockholm and Lund universities. The research project was funded by the Bank of Sweden Tercentenary Fund for the period The full title of the project was The Phonetics and Phonology of the Swedish Dialects around the Year In the following it will be referred to as SweDia 2000 or simply SweDia. To be able to perform empirically based linguistic research at present it is important to have access to large amounts of reliable data. The data should also be searchable in a user friendly way to facilitate answering research questions without any specialized knowledge of computation or data handling. On-line applications are suitable for some types of research, while other types are better used locally for reasons of security, speed etc. We are trying to provide tools and data for an as large user population as possible for the Swedia database as research questions span way beyond phonetic, phonological and acoustic questions; for example syntax [1] and biometric voice statistics [2]. In this paper we will describe some unique properties of the database, how the current project will continue the development of interfaces, functions and availability of the data. II. COLLECTING THE DATA Most of the recordings were made during the summer of The recording locations were evenly distributed over Sweden and the Swedish speaking parts of Finland, taking into account both geographical distribution and population density. For each location twelve speakers were recorded, representing two age groups young adults aged years of age and an older generation, years of age, approximately equivalent to the parent generation of the younger speakers. Both age groups consisted of three male and three female speakers. In figure 1 on the next page the geographical distribution of all the recording places can be observed. III. SOME UNIQUE PROPERTIES OF THE SWEDIA 2000 DATABASE There are other dialect databases with a comparable number of speakers and dialects but the present database has certain properties, which as far as we are aware, do not exist in otherwise comparable databases. Synchronic: All recordings were made within a narrow and very precisely defined time slice. That means that they represent the dialectal variation within the Swedish speaking community at a precisely defined moment in time. Consistent: The recorded material has three parts that contain precisely defined speech material meant to represent three fundamental, phonological properties of Swedish the quantity system, the accent system, and the phoneme inventory. This means that it is possible to analyze and compare speech material that is identical for all dialects. Complete: In addition the database contains approximately 30 minutes of spontaneous speech per speaker, corresponding to running words per dialect when transcribed. This part provides information about how the various phonological rules are implemented in everyday casual speech. But it may also be used for morphological and syntactic studies as mentioned in the introduction. These different, well-controlled components make the database a unique source of information for answering research questions of the type mentioned in Section 5 in a systematic, varied and detailed way /09 $ IEEE DOI /e-Science
2 Arjeplog Sorsele Piteå Överkalix Kalix Nederluleå Frostviken Vilhelmina Strömsund Fjällsjö Burträsk Vindeln Bjurholm Nysätra Åre Aspås Storsjö Berg Vemdalen Anundsjö Ragunda Kramfors Torp Indal Närpes Munsala Vörå Lillhärdal Särna Färila Delsbo Älvdalen Ovanåker Orsa Skog Ockelbo Dalby Leksand Brändö S, Finnskoga Årsunda Malung Grangärde Gräsö Saltvik Houtskär Borgå Nora Kyrkslätt Gräsmark Skillingmark Skinnskatteberg Skuttunge Gåsborn Villberga Dragsfjärd Snappertuna Köla Järnboås Haraker Kårsta St. Mellösa Bengtsfors Viby Länna Sorunda Skee Torsö V. Vingåker Frändefors Tjällmo Korsberga Orust Floby S:t Anna Rimforsa Östad Asby Fårö Kärna Järsnäs Ankarsrum Fole Öxabäck Stenberga Böda Sproge Frillesås Burseryd Bredsätra Årstad-Heberg Hamneda Väckelsång Våxtorp Broby Torsås Segerstad Bjuv Össjö Jämshög Hällevik Torhamn N. Rörum Bara Löderup Figure 1. The geographical spread of all places where recordings took place. 46
3 IV. SOME SCIENTIFIC CONSIDERATIONS There are several fundamental scientific questions that may be addressed in a fruitful way by an analysis based on data of the kind contained in the SweDia database. One such question which has played a central role in the SweDia project will be described in some detail. In traditional dialectology, geographical dispersion and isolation of groups of speakers as well as renewed contact as a result of migration are considered the driving forces behind linguistic variation and change. There is no doubt that those factors are important, but is that all there is? If that were the case, then variation and change could take any direction and the end result would appear more or less chaotic. We know, however, that that is not the case so there must exist linguistic laws which govern development and change, constrain variation and make certain choices more likely than others. A basic tenet in the SweDia project is the belief those laws and constraints can be discovered, described and tested. Interesting results have been obtained by approaching the description of regional distribution from an angle that does not assume any particular geographically based constraints at all. In three studies ([3], [4], [5]), cluster analysis has been used as a means of constructing dialect areas only based on individual acoustically grounded phonological properties. The resulting geographical areas are defined by dialects whose properties cluster together. In principle this could result in a very scattered picture with no obvious geographical coherence. But this does not turn out to be the case. On the contrary, dialects group in geographical areas that in many cases closely resemble those suggested in traditional dialectology. This would be a rather trivial finding if the clustering was based on the same considerations as the traditional analyses, but this is not the case at all. In both studies mentioned above, the cluster analyses are based on acoustic properties never considered in traditional classifications. This lends support for the assumption that dialectal variation is constrained by the compatibility of internal factors. Dialectal variation as a function of age and gender of the speaker may also be studied using the SweDia database. We know from previous studies that these factors play a role, but the size of the SweDia database and the fact that the material is identically structured for all speakers represented in the database makes it possible to address this question with a greater degree of precision than has previously been possible. Another important area of research is typologically oriented studies of sound systems and accent systems. Many examples will be found by looking at the SweDia publication list 1 but many more remain to be done. V. PROJECT GOALS In this section we will briefly describe and motivate the main goals of the current project named SweDat. A. Making the database searchable In order to make the speech data searchable, the audio files have been tagged (a kind of time aligned annotation). The tagging consists of text files containing various 1 relevant labels and information about where in the sound file the corresponding feature may be found. There are several analysis programs available which allow the audio files to be displayed (and listened to) with the tag files time aligned with the audio signal in a separate window. We have chosen a sound format and a tag file format that is compatible with the two most widely used (freeware) analysis programs (Praat and WaveSurfer) [6] [7]. B. Tagging the Parts of the Controlled Speech Material not yet Tagged It goes without saying that gender and age are crucially important factors in dialect research. As was detailed above, the database covers variation with respect to these factors. A problem that remains to be solved before these factors may be included in searches at all levels, however, is to complete the tagging of the entire material. The resources available for the project within which the data were collected did not permit tagging of the entire material. As a compromise it was decided that it would be more useful to have all types of material tagged for at least one category of speaker than parts of the material for all speakers. The category we chose for complete tagging was the older generation of male speakers. In addition, as much of the remaining material as could be treated using available economic resources was also tagged. Approximately 85 % of the work has been now completed. This kind of work can only to a limited extent be performed using automatic methods and is therefore very time consuming. However, the current project described here has the goal to complete as much as possible of the tagging and hopefully complete the database. C. Tagging the spontaneous speech material The spontaneous speech material has not previously been tagged at all and it was never the intention in the SweDia project to do so. The material was primarily meant to be used as an extra resource from which additional material could be retrieved. It would, of course, be a useful resource if the spontaneous speech material had also been tagged to make it searchable, but at the time of the original SweDia project, this would have required an inordinate amount of work and would have increased the costs way beyond available resources. In recent years, however, several methods have been developed for automatic or semi-automatic alignment of a text transcribed from a sound file and the corresponding sound file. We have tested one such program and found that under favourable conditions the alignment of text and sound can be quite good [7]. We have access to orthographic transcriptions of about 50 speakers representing 15 different dialects. These transcriptions were made by transcribers engaged in a small pilot study of dialect syntax done in Umeå and by people working for the Scandinavian Dialect Syntax project who have been given access to SweDia recordings. What we intend to do within the present SweDat project is to use an automatic aligner to tag those files and see how far we can get. Our own resources will not primarily be spent on transcription work but with developing automatic or semi-automatic alignment routines so that, hopefully, spontaneous speech material from more dialects may be incorporated as orthographic transcriptions become available from various other project groups and individual researchers. Would it not be possible to use the same 47
4 method to semi-automatically transcribe the controlled material one might ask? And the answer is that we are already doing that. But for this type of material it is essential for many types of analyses that the alignment between the tags and the corresponding sound is very precise. So although the semi-automatic methods save quite a bit of time, all the files must nevertheless be checked very carefully by human transcribers to make sure that the precision is at the required level. If one accepts that the tagging of the spontaneous material will primarily be used to search the material for occurrences of certain words or phrases, then the same degree of precision is not required. The alignment must nevertheless be checked manually. The results of this checking will also provide us with valuable information useful for fine tuning the automatic methods. VI. DEVELOPING EASY-TO-USE USER INTERFACES When the work on setting up the database began towards the end of 1999, the only suitable programs were developed for a UNIX environment. They were fine, worked well and were easy to use for people with a reasonable UNIX experience. They were expensive, however, and so were the machines. But at the time and for the purpose it was a very satisfactory solution. Several search and analysis tools were developed for use on UNIX platforms. Today the situation is radically different. Two of the programs most widely used for acoustic analysis are freeware programs. They cannot be used to run the tools that were developed, but they both accept the sound file formats and the time aligned tag files. So listening to the original recordings and using the time aligned tag-files is possible without any further work. At least one of the programs, Praat, provides an environment which would permit the development of all the tools needed to do the work that was previously done on UNIX machines. But the tool scripts and programs must be developed from scratch, using the same ideas but implemented in a different way. This is, however, only the first step. The programs developed for the UNIX environment were largely command line based, which is fine for those familiar with such an environment but an insurmountable obstacle for many of the users we now have in mind. In addition to porting the tools to the Praat environment, we must therefore also develop user friendly, intuitive, graphical interfaces which can be understood and used by any moderately computer literate user. Examples of tools already existing in the old system are search tools by which the user can search for all occurrences of a certain word uttered by a given speaker category, say older women, and a freely chosen set of recording locations. As a result of the search, the requested words are copied from the respective sound files and concatenated in a result file together with the corresponding labels from the tag files. The result can then be listened to or saved for further analyses. Cartography is also an important and much used method for exploring dialect data for regional distribution of various properties. There exist several studies using SweDia material which have applied this approach. Three have been mentioned here an ongoing Doctoral thesis in Groningen, Netherlands, a completed Doctoral dissertation from Umeå and a Master s thesis from Gothenburg. They used cluster analysis or multidimensional scaling to map the results of acoustic analyses on a geographical map. We intend to make some form of automatic mapping of various types of analysis results an integral part of the user interface we plan to develop. VII. ALREADY IN USE AND ONGOING DEVELOPMENT So far, some implementations and development has been done and some are still in a developing stage. A. Web Based Tools When it comes to web based applications we have developed cooperation with the Text Laboratory in Oslo, a unit at the Department of Linguistics and Scandinavian Studies, University of Oslo. They are mainly involved in research and development of new resources such as software and tools for user interfaces, searching, databases, text research, linguistic research and corpora. One of the tools under development is Glossa [9], a web based search tool for researchers in linguistics. The tool can be used for multi-lingual corpora and provides highly sophisticated ways to search annotated corpora and provide you with statistics. The annotated corpora can also include audio and video that can be shown integrated in the browser. We have provided them with parts of our database along with annotations in a suitable format. Some conversion tools between formats had to be developed within our project, however. The opportunity to be part of this joint effort is highly rewarding for our purposes. B. Locally Based Tools For acoustic and phonetic research purposes it is sometimes better to work with the database in a locally based environment. The database in its current format needs at least around 200 GB of disk space. The tools for acoustic research will be ported to the open source environment Praat. A very suitable plug-in function is provided for developing interfaces with a scripting environment. In this way the database can be provided with a suitable Swedia plug-in with functions for acoustically displaying figures for groups of data or just to search and extract desirable parts. C. Forensic Phonetic Biometric Research A research group at the department of Philosophy, Linguistics and Theory of Science, University of Gothenburg, is currently investigating the use of automatic methods in speaker identification. To increase reliability, using Bayesian statistical methods, it is important to have access to databases with data representative for the population in question. We intend to use the data in the SweDia database which contains data from about 1300 speakers varying in dialect, speaker age and speaker sex as part of such a population database. For those purposes yet another open source environment for biometric research and development, Mistral/Alize [10], is used integrated with our other tools. So far a pilot study of parameter statistics has been done [11] and several so called universal background models (UBMs) have been created [2] [12]. VIII. FUTURE DEVELOPMENT OF THE PRESENT PROJECT The aim is now to complete the database with described tools and research environment and make it 48
5 available for as many areas of research as possible. As described above several important initiatives and first steps have been taken. However, the ongoing developments are only first steps and a lot of work remains. The result of our efforts will hopefully make the database a very useful resource in our own discipline, but we also hope to contribute to other areas when it comes to handling databases, developing tools and encourage interesting interdisciplinary collaborations. ACKNOWLEDGMENT This research is supported by a research grant from the Swedish Research Council (grant # ). REFERENCES [1] [2] J. Lindh, Pick a Voice among Wolves, Goats and Lambs. Proceedings of IAFPA2009, Cambridge, UK, August [3] F. Schaeffler, Phonological Quantity in Swedish Dialects. (Doctoral Dissertation) Umeå University: PHONUM 10 Reports in phonetics, [4] Lundberg, Jan, Classifying Dialects Using Cluster Analysis. Master s Thesis in Computational Linguistics, Department of Linguistics, University of Gothenburg. [5] T. Leinonen, Classifying Swedish Dialects based on Vowel Pronunciation. Workshop of Production, Perception, Attitude. Leuven, Netherlands, [6] P. Boersma, Praat, a system for doing phonetics by computer. Glot International 5:9/10, pp , [7] K. Sjölander and J. Beskow, Wavesurfer An Open Source Speech Tool. In Proc. ICSLP, volume 4, pp , [8] K. Sjölander, An HMM-based system for automatic segmentation and alignment of speech. in Proc of Fonetik, Umeå University, PHONUM 9, pp , [9] L. Nygaard, Glossa The Corpus Explorer, Version [10] J-F. Bonastre, F. Wils. and S. Meigner ALIZE, a free toolkit for speaker recognition, in Proceedings of ICASSP, 2005, pp [11] J. Lindh, Preliminary Descriptive F0-statistics for Young Male Speakers, Papers from FONETIK 2006, Working Papers, 52, Department of Linguistics and Phonetics, Lund University, 89-92, [12] J. Lindh, A first step towards a text-independent speaker verification Praat plug-in using Mistral/Alize tools, The XXII nd Swedish Phonetics Conference, Department of Linguistics, Stockholm University,
SWING: A tool for modelling intonational varieties of Swedish Beskow, Jonas; Bruce, Gösta; Enflo, Laura; Granström, Björn; Schötz, Susanne
SWING: A tool for modelling intonational varieties of Swedish Beskow, Jonas; Bruce, Gösta; Enflo, Laura; Granström, Björn; Schötz, Susanne Published in: Proceedings of Fonetik 2008 Published: 2008-01-01
Quarterly Progress and Status Report. Preaspiration in Southern Swedish dialects
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Preaspiration in Southern Swedish dialects Tronnier, M. journal: Proceedings of Fonetik, TMH-QPSR volume: 44 number: 1 year: 2002
Turkish Radiology Dictation System
Turkish Radiology Dictation System Ebru Arısoy, Levent M. Arslan Boaziçi University, Electrical and Electronic Engineering Department, 34342, Bebek, stanbul, Turkey [email protected], [email protected]
Aspects of North Swedish intonational phonology. Bruce, Gösta
Aspects of North Swedish intonational phonology. Bruce, Gösta Published in: Proceedings from Fonetik 3 ; Phonum 9 Published: 3-01-01 Link to publication Citation for published version (APA): Bruce, G.
Study Plan for Master of Arts in Applied Linguistics
Study Plan for Master of Arts in Applied Linguistics Master of Arts in Applied Linguistics is awarded by the Faculty of Graduate Studies at Jordan University of Science and Technology (JUST) upon the fulfillment
Scandinavian Dialect Syntax Transnational collaboration, data collection, and resource development
Scandinavian Dialect Syntax Transnational collaboration, data collection, and resource development Janne Bondi Johannessen, Signe Laake, Kristin Hagen, Øystein Alexander Vangsnes, Tor Anders Åfarli, Arne
AUTOMATIC PHONEME SEGMENTATION WITH RELAXED TEXTUAL CONSTRAINTS
AUTOMATIC PHONEME SEGMENTATION WITH RELAXED TEXTUAL CONSTRAINTS PIERRE LANCHANTIN, ANDREW C. MORRIS, XAVIER RODET, CHRISTOPHE VEAUX Very high quality text-to-speech synthesis can be achieved by unit selection
Carla Simões, [email protected]. Speech Analysis and Transcription Software
Carla Simões, [email protected] Speech Analysis and Transcription Software 1 Overview Methods for Speech Acoustic Analysis Why Speech Acoustic Analysis? Annotation Segmentation Alignment Speech Analysis
Establishing the Uniqueness of the Human Voice for Security Applications
Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Establishing the Uniqueness of the Human Voice for Security Applications Naresh P. Trilok, Sung-Hyuk Cha, and Charles C.
Robust Methods for Automatic Transcription and Alignment of Speech Signals
Robust Methods for Automatic Transcription and Alignment of Speech Signals Leif Grönqvist ([email protected]) Course in Speech Recognition January 2. 2004 Contents Contents 1 1 Introduction 2 2 Background
Master of Arts in Linguistics Syllabus
Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university
APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA
APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA Tuija Niemi-Laitinen*, Juhani Saastamoinen**, Tomi Kinnunen**, Pasi Fränti** *Crime Laboratory, NBI, Finland **Dept. of Computer
EXMARaLDA and the FOLK tools two toolsets for transcribing and annotating spoken language
EXMARaLDA and the FOLK tools two toolsets for transcribing and annotating spoken language Thomas Schmidt Institut für Deutsche Sprache, Mannheim R 5, 6-13 D-68161 Mannheim [email protected]
Efficient diphone database creation for MBROLA, a multilingual speech synthesiser
Efficient diphone database creation for, a multilingual speech synthesiser Institute of Linguistics Adam Mickiewicz University Poznań OWD 2010 Wisła-Kopydło, Poland Why? useful for testing speech models
Speech Recognition Software Review
Contents 1 Abstract... 2 2 About Recognition Software... 3 3 How to Choose Recognition Software... 4 3.1 Standard Features of Recognition Software... 4 3.2 Definitions... 4 3.3 Models... 5 3.3.1 VoxForge...
ChildFreq: An Online Tool to Explore Word Frequencies in Child Language
LUCS Minor 16, 2010. ISSN 1104-1609. ChildFreq: An Online Tool to Explore Word Frequencies in Child Language Rasmus Bååth Lund University Cognitive Science Kungshuset, Lundagård, 222 22 Lund [email protected]
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
Fast Labeling and Transcription with the Speechalyzer Toolkit
Fast Labeling and Transcription with the Speechalyzer Toolkit Felix Burkhardt Deutsche Telekom Laboratories, Berlin, Germany [email protected] Abstract We describe a software tool named Speechalyzer
CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages 8-10. CHARTE NIVEAU B2 Pages 11-14
CHARTES D'ANGLAIS SOMMAIRE CHARTE NIVEAU A1 Pages 2-4 CHARTE NIVEAU A2 Pages 5-7 CHARTE NIVEAU B1 Pages 8-10 CHARTE NIVEAU B2 Pages 11-14 CHARTE NIVEAU C1 Pages 15-17 MAJ, le 11 juin 2014 A1 Skills-based
Program curriculum for graduate studies in Speech and Music Communication
Program curriculum for graduate studies in Speech and Music Communication School of Computer Science and Communication, KTH (Translated version, November 2009) Common guidelines for graduate-level studies
The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications
Forensic Science International 146S (2004) S95 S99 www.elsevier.com/locate/forsciint The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications A.
Tools & Resources for Visualising Conversational-Speech Interaction
Tools & Resources for Visualising Conversational-Speech Interaction Nick Campbell NiCT/ATR-SLC Keihanna Science City, Kyoto, Japan. [email protected] Preamble large corpus data examples new stuff conclusion
Why major in linguistics (and what does a linguist do)?
Why major in linguistics (and what does a linguist do)? Written by Monica Macaulay and Kristen Syrett What is linguistics? If you are considering a linguistics major, you probably already know at least
Automatic slide assignation for language model adaptation
Automatic slide assignation for language model adaptation Applications of Computational Linguistics Adrià Agustí Martínez Villaronga May 23, 2013 1 Introduction Online multimedia repositories are rapidly
Research Portfolio. Beáta B. Megyesi January 8, 2007
Research Portfolio Beáta B. Megyesi January 8, 2007 Research Activities Research activities focus on mainly four areas: Natural language processing During the last ten years, since I started my academic
TExES Texas Examinations of Educator Standards. Preparation Manual. 191 Generalist EC 6
TExES Texas Examinations of Educator Standards Preparation Manual 191 Generalist EC 6 Copyright 2011 by Texas Education Agency (TEA). All rights reserved. The Texas Education Agency logo and TEA are registered
An Arabic Text-To-Speech System Based on Artificial Neural Networks
Journal of Computer Science 5 (3): 207-213, 2009 ISSN 1549-3636 2009 Science Publications An Arabic Text-To-Speech System Based on Artificial Neural Networks Ghadeer Al-Said and Moussa Abdallah Department
Between voicing and aspiration
Workshop Maps and Grammar 17-18 September 2014 Introduction Dutch-German dialect continuum Voicing languages vs. aspiration languages Phonology meets phonetics Phonetically continuous, phonologically discrete
Text-To-Speech Technologies for Mobile Telephony Services
Text-To-Speech Technologies for Mobile Telephony Services Paulseph-John Farrugia Department of Computer Science and AI, University of Malta Abstract. Text-To-Speech (TTS) systems aim to transform arbitrary
The REPERE Corpus : a multimodal corpus for person recognition
The REPERE Corpus : a multimodal corpus for person recognition Aude Giraudel 1, Matthieu Carré 2, Valérie Mapelli 2 Juliette Kahn 3, Olivier Galibert 3, Ludovic Quintard 3 1 Direction générale de l armement,
Master of Arts Program in Linguistics for Communication Department of Linguistics Faculty of Liberal Arts Thammasat University
Master of Arts Program in Linguistics for Communication Department of Linguistics Faculty of Liberal Arts Thammasat University 1. Academic Program Master of Arts Program in Linguistics for Communication
PDF hosted at the Radboud Repository of the Radboud University Nijmegen
PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/60933
Strategic plan for the development of research in Language Technology at the University of Gothenburg
Strategic plan for the development of research in Language Technology at the University of Gothenburg A long range plan with particular reference to the current research period 2009 12 Amended version,
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
Quarterly Progress and Status Report. An ontogenetic study of infant speech perception
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report An ontogenetic study of infant speech perception Lacerda, F. and Aurelius, G. and Landberg,. and Roug-Hellichius, L-L. journal:
A CHINESE SPEECH DATA WAREHOUSE
A CHINESE SPEECH DATA WAREHOUSE LUK Wing-Pong, Robert and CHENG Chung-Keng Department of Computing, Hong Kong Polytechnic University Tel: 2766 5143, FAX: 2774 0842, E-mail: {csrluk,cskcheng}@comp.polyu.edu.hk
Your personalised, online solution to meeting ICAO English language proficiency requirements
Your personalised, online solution to meeting ICAO English language proficiency requirements 1 Acknowledged as leaders in their respective fields of Aviation English training and Speech Recognition technology,
209 THE STRUCTURE AND USE OF ENGLISH.
209 THE STRUCTURE AND USE OF ENGLISH. (3) A general survey of the history, structure, and use of the English language. Topics investigated include: the history of the English language; elements of the
Chapter 2 A Systems Approach to Leadership Overview
Chapter 2 A Systems Approach to Leadership Overview 2.1 Introduction Developing a high performance organisation is an appropriate and achievable goal in today s business environment. High performance organisations
Student diaries: using technology to produce alternative forms of feedback
Student diaries: using technology to produce alternative forms of feedback NUZ QUADRI University of Hertfordshire PETER BULLEN University of Hertfordshire AMANDA JEFFERIES University of Hertfordshire 214
1-04-10 Configuration Management: An Object-Based Method Barbara Dumas
1-04-10 Configuration Management: An Object-Based Method Barbara Dumas Payoff Configuration management (CM) helps an organization maintain an inventory of its software assets. In traditional CM systems,
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
Experiments with Signal-Driven Symbolic Prosody for Statistical Parametric Speech Synthesis
Experiments with Signal-Driven Symbolic Prosody for Statistical Parametric Speech Synthesis Fabio Tesser, Giacomo Sommavilla, Giulio Paci, Piero Cosi Institute of Cognitive Sciences and Technologies, National
Generating Training Data for Medical Dictations
Generating Training Data for Medical Dictations Sergey Pakhomov University of Minnesota, MN [email protected] Michael Schonwetter Linguistech Consortium, NJ [email protected] Joan Bachenko
CREDIT TRANSFER: GUIDELINES FOR STUDENT TRANSFER AND ARTICULATION AMONG MISSOURI COLLEGES AND UNIVERSITIES
CREDIT TRANSFER: GUIDELINES FOR STUDENT TRANSFER AND ARTICULATION AMONG MISSOURI COLLEGES AND UNIVERSITIES With Revisions as Proposed by the General Education Steering Committee [Extracts] A. RATIONALE
Contemporary Linguistics
Contemporary Linguistics An Introduction Editedby WILLIAM O'GRADY MICHAEL DOBROVOLSKY FRANCIS KATAMBA LONGMAN London and New York Table of contents Dedication Epigraph Series list Acknowledgements Preface
Emotion Detection from Speech
Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction
Bachelors of Science Program in Communication Disorders and Sciences:
Bachelors of Science Program in Communication Disorders and Sciences: Mission: The SIUC CDS program is committed to multiple complimentary missions. We provide support for, and align with, the university,
Preface to the Second Edition
Preface to the Second Edition The second edition of this book contains both basic and more advanced material on non-life insurance mathematics. Parts I and II of the book cover the basic course of the
Proceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Speech Communication Session 2aSC: Linking Perception and Production
Historical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics?
Historical Linguistics Diachronic Analysis What is Historical Linguistics? Historical linguistics is the study of how languages change over time and of their relationships with other languages. All languages
What Is Linguistics? December 1992 Center for Applied Linguistics
What Is Linguistics? December 1992 Center for Applied Linguistics Linguistics is the study of language. Knowledge of linguistics, however, is different from knowledge of a language. Just as a person is
A Short Introduction to Transcribing with ELAN. Ingrid Rosenfelder Linguistics Lab University of Pennsylvania
A Short Introduction to Transcribing with ELAN Ingrid Rosenfelder Linguistics Lab University of Pennsylvania January 2011 Contents 1 Source 2 2 Opening files for annotation 2 2.1 Starting a new transcription.....................
A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing
A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing LAW VI JEJU 2012 Bayu Distiawan Trisedya & Ruli Manurung Faculty of Computer Science Universitas
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
Module Handbook for the Masters program in General Linguistics at the University of Bamberg
1 Otto-Friedrich-Universität Bamberg Humanities Faculty Department of General Linguistics http://www.uni-bamberg.de/aspra Module Handbook for the Masters program in General Linguistics at the University
The course is included in the CPD programme for teachers II.
Faculties of Humanities and Theology LLYU72, Swedish as a Second Language for Upper Secondary School Teachers, 60 credits Svenska som andraspråk för lärare i gymnasieskolan, 60 högskolepoäng First Cycle
COMMUNICATION COMMUNITIES CULTURES COMPARISONS CONNECTIONS. STANDARDS FOR FOREIGN LANGUAGE LEARNING Preparing for the 21st Century
COMMUNICATION COMMUNITIES CULTURES COMPARISONS CONNECTIONS STANDARDS FOR FOREIGN LANGUAGE LEARNING Preparing for the 21st Century Language and communication are at the heart of the human experience. The
Speech Analytics. Whitepaper
Speech Analytics Whitepaper This document is property of ASC telecom AG. All rights reserved. Distribution or copying of this document is forbidden without permission of ASC. 1 Introduction Hearing the
Annotation and Evaluation of Swedish Multiword Named Entities
Annotation and Evaluation of Swedish Multiword Named Entities DIMITRIOS KOKKINAKIS Department of Swedish, the Swedish Language Bank University of Gothenburg Sweden [email protected] Introduction
L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES
L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES Zhen Qin, Allard Jongman Department of Linguistics, University of Kansas, United States [email protected], [email protected]
THE VOICE OF LOVE. Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard
THE VOICE OF LOVE Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard University of Toledo, United States [email protected], [email protected], [email protected],
Analysing Qualitative Data
Analysing Qualitative Data Workshop Professor Debra Myhill Philosophical Assumptions It is important to think about the philosophical assumptions that underpin the interpretation of all data. Your ontological
The use of Praat in corpus research
The use of Praat in corpus research Paul Boersma Praat is a computer program for analysing, synthesizing and manipulating speech and other sounds, and for creating publication-quality graphics. It is open
SASSC: A Standard Arabic Single Speaker Corpus
SASSC: A Standard Arabic Single Speaker Corpus Ibrahim Almosallam, Atheer AlKhalifa, Mansour Alghamdi, Mohamed Alkanhal, Ashraf Alkhairy The Computer Research Institute King Abdulaziz City for Science
Annotation Pro Software Speech signal visualisation, part 1
Annotation Pro Software Speech signal visualisation, part 1 [email protected] katarzyna.klessa.pl Katarzyna Klessa ` Topics of the class 1. Introduction: annotation of speech recordings 2. Annotation Pro
Online experiments with the Percy software framework experiences and some early results
Online experiments with the Percy software framework experiences and some early results Christoph Draxler BAS Bavarian Archive of Speech Signals Institute of Phonetics and Speech Processing Ludwig-Maximilian
Visual History Archive in the Social Scientific Research Some remarks and experiences from the user's perspective
Visual History Archive in the Social Scientific Research Some remarks and experiences from the user's perspective Annual CLARIN Mee.ng October 22, 2013 Prague Jakub Mlynář Malach Centre for Visual History
Things to remember when transcribing speech
Notes and discussion Things to remember when transcribing speech David Crystal University of Reading Until the day comes when this journal is available in an audio or video format, we shall have to rely
Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
in Language, Culture, and Communication
22 April 2013 Study Plan M. A. Degree in Language, Culture, and Communication Linguistics Department 2012/2013 Faculty of Foreign Languages - Jordan University 1 STUDY PLAN M. A. DEGREE IN LANGUAGE, CULTURE
MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS
University of Cambridge: Programme Specifications Every effort has been made to ensure the accuracy of the information in this programme specification. Programme specifications are produced and then reviewed
Annotation in Language Documentation
Annotation in Language Documentation Univ. Hamburg Workshop Annotation SEBASTIAN DRUDE 2015-10-29 Topics 1. Language Documentation 2. Data and Annotation (theory) 3. Types and interdependencies of Annotations
A Survey on Bilingual Teaching in Higher Education Institute in the Northeast of China
ISSN 1798-4769 Journal of Language Teaching and Research, Vol. 1, No. 4, pp. 353-357, July 2 2 ACADEMY PUBLISHER Manufactured in Finland. doi:.44/jltr.1.4.353-357 A Survey on Bilingual Teaching in Higher
1. Introduction to Spoken Dialogue Systems
SoSe 2006 Projekt Sprachdialogsysteme 1. Introduction to Spoken Dialogue Systems Walther v. Hahn, Cristina Vertan {vhahn,vertan}@informatik.uni-hamburg.de Content What are Spoken dialogue systems? Types
ARMORVOX IMPOSTORMAPS HOW TO BUILD AN EFFECTIVE VOICE BIOMETRIC SOLUTION IN THREE EASY STEPS
ARMORVOX IMPOSTORMAPS HOW TO BUILD AN EFFECTIVE VOICE BIOMETRIC SOLUTION IN THREE EASY STEPS ImpostorMaps is a methodology developed by Auraya and available from Auraya resellers worldwide to configure,
ENGLISH FILE Pre-intermediate
Karen Ludlow New ENGLISH FILE Pre-intermediate and the Common European Framework of Reference 2 INTRODUCTION What is this booklet for? The aim of this booklet is to give a clear and simple introduction
General Syllabus for Third Cycle Studies for the Degree of Doctor in Cognitive Science
GENERAL SYLLABUS 21 March 2013 Reg. no HT 2013/50 1 Department of Philosophy Cognitive Science General Syllabus for Third Cycle Studies for the Degree of Doctor in Cognitive Science This syllabus was approved
KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 22/2013, ISSN 1642-6037 medical diagnosis, ontology, subjective intelligence, reasoning, fuzzy rules Hamido FUJITA 1 KNOWLEDGE-BASED IN MEDICAL DECISION
