CLST Annual Report 2010

Size: px
Start display at page:

Download "CLST Annual Report 2010"

Transcription

1 CLST Annual Report 2010 CLST is the Centre for Language and Speech Technology. CLST operates as a separate unit within the Faculty of Arts of the Radboud University Nijmegen. CLST is active in an impressive number of projects in various domains. In 2010 we participated in and acquired projects in the domains of: - ehumanities & datamining - Computer Assisted Language Learning - Communication in health care - Forensic speech processing Our projects are funded through grants from the EU, NWO (the Netherlands Organisation for Scientific Research), The Dutch language Union (STEVIN-programme), the Ministry of Economic affairs (IOP), STW (Technology Foundation; part of NWO), CLARIN-NL. A number of projects are carried out for companies (speech transcriptions, music detection). During 2010, twenty-eight researchers were actively involved in CLST projects. CLST can look back on a stable and fruitful year both with respect to its project port folio and to its financial situation, as the brief account below demonstrates. Board Board members in 2010 were: L. Boves (chairman), A. van Kemenade, D. Iskra, D. Kenyon-Jackson and N. Oostdijk. N. Schröder is advisor of the Board. Final responsibility lies with the Dean of the Faculty. CLST s executive director is Dr H. van den Heuvel. Projects Below we present an overview of the research topics and projects 1 in which CLST was involved in ehumanities & datamining: - BATS( Speaker tracking and topic detection in audio archives) - TM4IP (Text Mining for Intellectual Property) - LOHW (Living Oral History Workbench ASR-based access to 250 interviews with veterans & development of a web-based annotation tool) Improving automatic speech recognition: - Autonomata Too (recognition of names) - MIDAS (noise robust speech recognition) - SOMS (Second Opinion Music Recognition System as distinguished from speech) Computational modeling of language acquisition and processing: - ACMoLA (A Computational Model of Language Acquisition) Computer assisted second language learning and speech technology: - DISCO (ASR-based CALL for training oral proficiency In Dutch) - TST-AAP (a demosystem for pronunciation training of Dutch) 1 See for detailed information on all projects: 1

2 - FASOP (Corrective feedback and acquisition of syntax in oral proficiency) - My Pronunciation Coach (Development of a pronunciation trainer for English as second language) Communication in Health Care; - Optifox (Automatic optimization of the settings of a cochlear implant using automatic pronunciation assessment techniques) - FeetBack (combining body sensors and persuasive messages to incite patients to healthy behavior) Production & standardisation of language resources (corpora and tools): - SoNaR (a 500 M words text corpus of written Dutch) - AAM-LR (Demonstrator web application for annotation of audio recordings) - Adelheid (demonstrator web application for tagging historical Dutch texts) - INTER-VIEWs (curation of 250 interviews see LOHW and implementation of a search tool) - TQE (demonstrator web application for automatic quality evaluation of phonetic transcriptions in audio) CLST also participates in Training Networks funded by the EC in the Marie Curie Programme: - S2S (exploring how humans and computers understand speech using fine phonetic details) - SCALE (Speech Communication with Adaptive Learning) - Bbfor2 (Bayesian Biometrics for forensics) Five projects were completed in 2010: Autonomata Too, LOHW, SOMS, TM4IP, Veni-project Odette Scharenborg, Acquisition Our cooperation with the University s Language and Communication institute Radboud in to Languages 2 resulted in a successful application for a Valorisation Grant Phase 2 from STW for a developing a demonstrator web application focusing on pronunciation training for Dutch learners of English. This application uses speech technology to detect pronunciation errors. This should result in a spin-off company bringing the product (called My Pronunciation Coach) to the market by the end of The Sint Maartenskliniek acquired funding from ZonMW for a project together with CLST aiming at a digital poli webapplication in which language and speech technology will be developed to improve patients communication possibilities through chat-by-click, word prediction, speech synthesis, and speech recognition. Another new project in health care is OptiFox. This is an FP7 SME project from the EC. Aim of the project is automatic optimization of the settings of a cochlear implant using dedicated speech processing software in an application which supports the audiologist in a clinical setting. CLST provides algorithms and speech technology for automatic pronunciation assessment. Suzan Verberne acquired a Google Digital Humanities Award for a research project on extracting factoids from Dutch texts

3 Participation in new initiatives and awareness CLST members are involved in European initiatives with potential impact on the landscape of language and speech technology: - CLARIN, which is a large-scale pan-european collaborative effort to create, coordinate and make language resources and technology available and readily usable in the Humanities and Social Sciences. 3 - FLaReNet, which aims at developing a common vision of the area of language resources and language technologies, and fostering a European strategy for consolidating the sector, thus enhancing competitiveness at EU level and worldwide. 4 At national level CLST keeps track of new trends and developments as well. To that end, CLST participates at Board level in NOTaS. This is a foundation in which national companies and knowledge centers cooperate to stimulate the development and application of language and speech technology in the Dutch language. Personnel Three temporary staff member left CLST in 2010 and found employment elsewhere. Two other temporary staff members had an appointment in a Marie Curie network project and moved to a new position in their location of origin. Three new employees were added to the team: one PhD student, a junior researcher and one post-doc. At the end of 2010 CLST consisted of three tenure staff members, five temporary staff members, ten PhD students, and three members with a temporary part-time posting from the Department of Linguistics, summing up to 19.1 fte. The institute s daily management is in the hands of a director with the support of a secretary. The table below shows a comprehensive overview of the fte distribution in CLST s personnel. NWO-like projects Other external Overhead funding Tenure staff PhD students Other temporary staff Secondments Dept Linguistics Director & Secretary 0.4 Sum Annual performance interviews were held with all staff members. These interviews serve to achieve a match between personal ambitions, job contents and career opportunities of individual employees. CLST reserves part of its budget for training and education of its personnel. Most staff members followed one or more courses, most of which were offered by the Radboud University as part of its career policy

4 Customer Satisfaction For most of its projects CLST is obliged to write progress reports. These progress reports follow the templates offered by the funding bodies (STEVIN, EU, CLARIN-NL). Customer satisfaction in other projects (typically with companies) is monitored by or telephone. Customer responses are almost without exception very positive. Public Relations Every CLST research employee visited at least one international conference or workshop. Attendance of workshops is, as a rule, restricted to conferences to which the employee contributes an accepted paper. In all contributions the affiliation to CLST and the Radboud University is mentioned. Further PR activities in 2010 were: o Website updates and improvement of its position in rank orders of search engines o Folders and brochures (Dutch, English) o Publicity in newspapers o Periodicals for a general or professional audience such as DIXIT, Kennislink, Onze Taal, Levende Talen Henk van den Heuvel, Director CLST [email protected] 4

5 Publications Aimetti, G., Moore, R.K. & Bosch, L.F.M. ten (2010). Discovering an Optimal Set of Minimally Contrasting Acoustic Speech Units: A Point of Focus for Whole-Word Pattern Matching. In Proceedings of interspeech 2010 (pp ). Makuhari, Japan. Altosaar, T., Bosch, L.F.M. ten, Aimetti, G., Koniaris, Chr., Demuynck, K. & Heuvel, H. van den (2010). A Speech Corpus for Modeling Language Acquisition: CAREGIVER. In Proceedings of LREC 2010 (pp. CD). Malta. Beijer, L., Rietveld, T., Beers, M. van, Slangen, R., Heuvel, H. van den, Swart, B.J.M. de & Geurts, A.C.H. (2010). E-learning based Speech Therapy (EST): a web application for speech training. Telemedicine Journal and e-health, 16(2), Bergmann, C., Gubian, M. & Boves, L.W.J. (2010). Modelling the effect of speaker familiarity and noise on infant word recognition. In Proceedings of Interspeech 2010 (pp. CD). Makuhari, Japan. Bergmann, C., Paulus, M.A. & Fikkert, J.P.M. (2010). A closer look at pronoun comprehension: comparing different methods. In J. Costa, A. Castro, M. Lobo & F. Pratas (Eds.), Language Acquisition and Development: Proceedings of GALA 2009 (pp ). Cambridge: Cambridge Scholars Publishing. Bosch, L.F.M. ten & Boves, L.W.J. (2010). Language Acquisition and Cross-Modal Associations: Computational Simulation of the Result of Infant Studies. In Proceedings of Interspeech 2010 (pp ). Makuhari, Japan. Bruijn, M. de, Bosch, L.F.M. ten, Kuik, D.J., Langendijk, J.A., Leemans, C.R. & Verdonck-de Leeuw, I.M. (2010). Neural network analysis to assess hypernasality in patients treated for oral or oropharyngeal cancer. In Proceedings 28th World Congress of the International Association of Logopedics and Phoniatrics (pp ). Athene, Griekenland. Bruijn, M. de, Bosch, L.F.M. ten, Kuik, D.J., Langendijk, J.A., Leemans, C.R. & Verdonck-de Leeuw, I.M. (2010). Objective assessment of speech quality in patients treated for a tumour in the oral cavity or oropharynx. In Proceedings 28th World Congress of the International Association of Logopedics and Phoniatrics (pp ). Athene, Griekenland. Cheng, C., Xu, Y. & Gubian, M. (2010). Exploring the Mechanism of Tonal Contraction in Taiwan Mandarin. In Proceedings of Interspeech Makuhari, Japan. Cucchiarini, C., Doremalen, J.J.H.C. van & Strik, H. (2010). Fluency in non-native read and spontaneous speech. In Proceedings of DiSS-LPSS Joint Workshop 2010 (pp. CD). Tokyo, Japan. Cucchiarini, C. (2010). Language resources, speech technology and language learning: how to establish a virtuous circle. CLARIN-EU Newsletter. [Online]. Available from: [ ]. D'hondt, E.K.L. & Verberne, S. (2010). CLEF-IP 2010: Prior Art Retrieval using the different sections in patent documents. In Proceedings of the Conference on Multilingual and Multimodal Information Access Evaluation (CLEF 2010), CLEF-IP workshop. Padua, Italy. Available from: [ ]. D'hondt, E.K.L., Verberne, S., Oostdijk, N.H.J. & Boves, L.W.J. (2010). Re-ranking based on Syntactic Dependencies in Prior-Art Retrieval. In Proceedings of the Dutch-Belgium Information Retrieval workshop 2010 (DIR-2010) (pp ). Nijmegen. Doremalen, J.J.H.C. van, Cucchiarini, C. & Strik, H. (2010). Optimizing automatic speech recognition for low-proficient non-native speakers. EURASIP Journal on Audio, Speech and Music Processing. [Online]. Available from: [ ]. Doremalen, J.J.H.C. van, Cucchiarini, C. & Strik, H. (2010). Phoneme Errors in Read and Spontaneous Non-Native Speech: Relevance for CAPT System Development. In Proceedings of the SLaTE-2010 workshop (pp. CD). Tokyo, Japan. Doremalen, J.J.H.C. van, Cucchiarini, C. & Strik, H. (2010). Using Non-Native Error Patterns to Improve Pronunciation Verification. In Proceedings of Interspeech 2010 (pp. CD). Tokyo, Japan. Doremalen, J.J.H.C. van, Strik, H. & Cucchiarini, C. (2010, mei 26). Automatic Speech Recognition in CALL: The Essential Role of Adaptation. Katholieke Universiteit Leuven, Campus Kortrijk (KULAK), Kortrijk, ITEC. 5

6 Gemmeke, J.F., Hamme, H. Van, Cranen, B. & Boves, L.W.J. (2010). Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition. IEEE Journal of Selected Topics in Signal Processing, 4(2), Gemmeke, J.F., Cranen, B. & Remes, U. (2010). Sparse imputation for large vocabulary noise robust {ASR}. Computer Speech & Language. [Online]. Available from: [ ]. Gemmeke, J.F. & Virtanen, T. (2010). Artificial and online acquired noise dictionaries for noise robust {ASR}. In Proceedings of Interspeech 2010 (pp ). Makuhari, Japan. Gemmeke, J.F. & Virtanen, T. (2010). Noise robust exemplar-based connected digit recognition. In Proceedings ICASSP 2010 (pp. DVD). Dallas, Texas, USA. Gemmeke, J.F., Remes, U. & Palomäki, K.J. (2010). Observation uncertainty measures for sparse imputation. In Proceedings of Interspeech 2010 (pp ). Makuhari, Japan. Gubian, M., Cangemi, F. & Boves, L.W.J. (2010). Automatic and Data Driven Pitch Contour Manipulation with Functional Data Analysis. In M. Hasegawa-Johnson (Ed.), Proceedings of Speech Prosody (pp :1-4). Chicago, IL, USA. Gubian, M., Bergmann, C. & Boves, L.W.J. (2010). Investigating word learning processes in an artificial agent. In Development and Learning (ICDL), Proceedings of the Ninth IEEE International Conference (pp ). Available from: [ ]. Halteren, H. van (2010). In bewerking. ACM Transactions on Speech and Language Processing (TSLP). Heijden, M van der, Hinne, M., Verberne, S., Hoenkamp, E.C.M., Weide, T. van der & Kraaij, W. (2010). When is a query a question? Reconstructing wh-requests from ad hoc-queries. In B. Croft (Ed.), Query Representation and Understanding : Workshop of the 33rd Annual International ACM in Information Retrieval on Research and Development SIGIR Conference (pp ). [S.l]: ACM. Heuvel, H. van den, Horik, R. van, Scagliola, S.I., Sanders, E.P. & Witkamp, P. (2010). The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-humanities. In Proceedings of LREC (pp. CD). Malta. Heuvel, H. van den (2010). Interviews, geknipt voor onderzoek. DIXIT, 7(1), Huijbregts, M.A.H. & Leeuwen, D.A. van (2010). Towards automatic speaker retrieval for large multimedia archives. In Proceedings AIEMPro. Firenze: ACM. Jongenelen, M.M., Hoeken, H. & Hendriks, B.C. (2010). Explicit and implicit messages in telehealth. Information Design Journal, 18(3), Oostdijk, N.H.J., D'hondt, E.K.L., Halteren, H. van & Verberne, S. (2010). Genre and Domain in Patent Texts. In Proceedings of The 3rd International Workshop on Patent Information Retrieval at CIKM 2010 (pp ). Oostdijk, N.H.J., Verberne, S. & Koster, C.H.A. (2010). Constructing a broad coverage lexicon for text mining in the patent domain. In Proceedings of LREC 2010 (pp ). Malta: European Language Resources Association (ELRA). Penning de Vries, B.W.F., Cucchiarini, C., Strik, H. & Hout, R.W.N.M. van (2010). The Role of Corrective Feedback in Second Language Learning: New Research Possibilities by Combining CALL and Speech Technology'. In Proceedings of L2WS (pp. USB). Tokyo, Japan. Réveil, B., Martens, J-P. & Heuvel, H. van den (2010). Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon. In Proceedings of LREC 2010 (pp. CD). Malta. Reynaert, M., Oostdijk, N.H.J., Clercq, O. De, Heuvel, H. van den & Jong, F. de (2010). Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus. In Proceedings of LREC 2010 (pp ). Malta: European Language Resources Association (ELRA). Ruiter, M., Rietveld, T., Cucchiarini, C., Krahmer, E. & Strik, H. (2010). Human Language Technology and communicative disabilities: Requirements and possibilities for the future. In Proceedings of LREC 2010 (pp ). Malta. Sanders, E.P. & Heuvel, H. van den (2010). Automatic Pronunciation Error Detection in Repetitor. In Proceedings L2WS 2010, Tokyo, Japan. Tokyo. Available from: [ ]. Scharenborg, O.E. & Boves, L.W.J. (2010). Computational modelling of spoken-word recognition processes: design choices and evaluation. Pragmatics and Cognition, 18(1),

7 Strik, H., Colpaert, J., Doremalen, J.J.H.C. van & Cucchiarini, C. (2010). Language resources and CALL applications: speech data and speech technology in the DISCO project. In Proceedings E-learning Workshop 2010 (pp. CD). Valletta, Malta. Strik, H., Loo, J. van de, Doremalen, J.J.H.C. van & Cucchiarini, C. (2010). Practicing Syntax in Spoken Interaction: Automatic Detection of Syntactic Errors in Non-Native Utterances. In Proceedings of the SLaTE 2010 Workshop (pp. CD). Tokyo, Japan. Sun, Y., Bosch, L.F.M. ten & Boves, L.W.J. (2010). Hybrid HMM/BLSTM-RNN for Robust Speech Recogntion. In Proceedings of Text, Speech and Dialogue (TSD) 2010 (pp ). Brno, Czech Republic. Sun, Y., Gemmeke, J.F., Cranen, B., Bosch, L.F.M. ten & Boves, L.W.J. (2010). Using a DBN to Integrate Sparse Classification and GMM-Based ASR. In Proceedings of Interspeech 2010 (pp ). Makuhari, Japan. Verberne, S., Halteren, H. van, Raaijmakers, S., Theijssen, D.L. & Boves, L.W.J. (2010). Learning to Rank for Why-Question Answering. Information Retrieval, online. [Online]. Available from: [ ]. Verberne, S., Boves, L.W.J., Oostdijk, N.H.J. & Coppen, P.A.J.M. (2010). What is not in the Bag of Words for Why-QA? Computational Linguistics, 36(2), Verberne, S., Hinne, M., Heijden, M van der, Hoenkamp, E., Kraaij, W. & Weide, T. van der (2010). How does the Library Searcher Behave? A Contrastive Study of Library Search against Ad-hoc Search. In Proceedings of the Conference on Multilingual and Multimodal Information Access Evaluation (CLEF 2010), logclef workshop (pp. 1-5). [S.l:: s.n]. Verberne, S., Vogel, Merijn & D'hondt, E.K.L. (2010). Patent classification experiments with the Linguistic Classification System LCS. In Proceedings of the Conference on Multilingual and Multimodal Information Access Evaluation (CLEF 2010), CLEF-IP workshop (pp. CD). Verberne, S., D'hondt, E.K.L., Oostdijk, N.H.J. & Koster, C.H.A. (2010). Quantifying the Challenges in Parsing Patent Claims. In Proceedings of the 1st International Workshop on Advances in Patent Information Retrieval at ECIR 2010 (pp ). [S.l]: [s.n]. Versteegh, M.H. (2010). Active Word Learning Under Uncertain Input Conditions. In Proceedings of Interspeech 2010 (pp ). Makuhari, Japan. Virtanen, T., Gemmeke, J.F. & Hurmalainen, A. (2010). State-based labelling for a sparse representation of speech and its application to robust speech recognition. In Proceedings of Interspeech 2010 (pp ). Makuhari, Japan. Zellers, M., Gubian, M. & Post, B. (2010). Redescribing Intonational Categories with Functional Data Analysis. In Proceedings of Interspeech 2010 (pp. CD). Makuhari, Japan. 7

How To Help The Netherlands Language And Speech Technology (Lst) Project

How To Help The Netherlands Language And Speech Technology (Lst) Project CLST Annual Report 2012 CLST is the Centre for Language and Speech Technology. CLST operates as a separate unit within the Faculty of Arts of the Radboud University Nijmegen. CLST was founded in January

More information

e-health Helmer Strik en vele anderen 22 November 2013

e-health Helmer Strik en vele anderen 22 November 2013 e-health Helmer Strik en vele anderen 22 November 2013 http://www.ru.nl/clst/projects/communication-health/ This presentation OSTT : 'Ontwikkelcentrum voor Spraak- en Taal-Technologie ComPoli : Communicatie

More information

Turkish Radiology Dictation System

Turkish Radiology Dictation System Turkish Radiology Dictation System Ebru Arısoy, Levent M. Arslan Boaziçi University, Electrical and Electronic Engineering Department, 34342, Bebek, stanbul, Turkey [email protected], [email protected]

More information

Introduction to the Digital Literacy Instructor

Introduction to the Digital Literacy Instructor Introduction to the Digital Literacy Instructor Helmer Strik Department of Linguistics Centre for Language and Speech Technology (CLST), The Netherlands Newcastle meeting millennium bridge Target group

More information

Technology for society. STW annual conference 2015

Technology for society. STW annual conference 2015 Technology for society STW annual conference 2015 Starting date: June 2015 Version: 11 June 2015 page 1 of 5 Content Content... 1 Open your Mind... 2 Conditions... 2 Who can apply?... 2 Funding... 2 General

More information

Master of Science in Artificial Intelligence

Master of Science in Artificial Intelligence Master of Science in Artificial Intelligence Options: Engineering and Computer Science (ECS) Speech and Language Technology (SLT) Big Data Analytics (BDA) Faculty of Engineering Science Faculty of Science

More information

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS

More information

Dutch-Flemish Research Programme for Dutch Language and Speech Technology. stevin programme. project results

Dutch-Flemish Research Programme for Dutch Language and Speech Technology. stevin programme. project results Dutch-Flemish Research Programme for Dutch Language and Speech Technology stevin programme project results Contents Design and editing: Erica Renckens (www.ericarenckens.nl) Design front cover: www.nieuw-eken.nl

More information

Master of Artificial Intelligence

Master of Artificial Intelligence Faculty of Engineering Faculty of Science Master of Artificial Intelligence Options: Engineering and Computer Science (ECS) Speech and Language Technology (SLT) Cognitive Science (CS) K.U.Leuven Masters.

More information

Transcription bottleneck of speech corpus exploitation

Transcription bottleneck of speech corpus exploitation Transcription bottleneck of speech corpus exploitation Caren Brinckmann Institut für Deutsche Sprache, Mannheim, Germany Lesser Used Languages and Computer Linguistics (LULCL) II Nov 13/14, 2008 Bozen

More information

Study Plan for Master of Arts in Applied Linguistics

Study Plan for Master of Arts in Applied Linguistics Study Plan for Master of Arts in Applied Linguistics Master of Arts in Applied Linguistics is awarded by the Faculty of Graduate Studies at Jordan University of Science and Technology (JUST) upon the fulfillment

More information

Metadata for Corpora PATCOR and Domotica-2

Metadata for Corpora PATCOR and Domotica-2 July 2013 Technical Report: KUL/ESAT/PSI/1303 Metadata for Corpora PATCOR and Domotica-2 Tessema N., Ons B., van de Loo J., Gemmeke J.F., De Pauw G., Daelemans W., Van hamme H. Katholieke Universiteit

More information

The Knowledge Sharing Infrastructure KSI. Steven Krauwer

The Knowledge Sharing Infrastructure KSI. Steven Krauwer The Knowledge Sharing Infrastructure KSI Steven Krauwer 1 Why a KSI? Building or using a complex installation requires specialized skills and expertise. CLARIN is no exception. CLARIN is populated with

More information

Query term suggestion in academic search

Query term suggestion in academic search Query term suggestion in academic search Suzan Verberne 1, Maya Sappelli 1,2, and Wessel Kraaij 2,1 1. Institute for Computing and Information Sciences, Radboud University Nijmegen 2. TNO, Delft Abstract.

More information

How To Help The European Single Market With Data And Information Technology

How To Help The European Single Market With Data And Information Technology Connecting Europe for New Horizon European activities in the area of Big Data Márta Nagy-Rothengass DG CONNECT, Head of Unit "Data Value Chain" META-Forum 2013, 19 September 2013, Berlin OUTLINE 1. Data

More information

Specialty Answering Service. All rights reserved.

Specialty Answering Service. All rights reserved. 0 Contents 1 Introduction... 2 1.1 Types of Dialog Systems... 2 2 Dialog Systems in Contact Centers... 4 2.1 Automated Call Centers... 4 3 History... 3 4 Designing Interactive Dialogs with Structured Data...

More information

Programme Specification (Postgraduate) Date amended: March 2012

Programme Specification (Postgraduate) Date amended: March 2012 Programme Specification (Postgraduate) Date amended: March 2012 1. Programme Title(s): MA in Applied Linguistics and TESOL 2. Awarding body or institution: University of Leicester 3. a) Mode of study Campus:

More information

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990

More information

D2.4: Two trained semantic decoders for the Appointment Scheduling task

D2.4: Two trained semantic decoders for the Appointment Scheduling task D2.4: Two trained semantic decoders for the Appointment Scheduling task James Henderson, François Mairesse, Lonneke van der Plas, Paola Merlo Distribution: Public CLASSiC Computational Learning in Adaptive

More information

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that

More information

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser Efficient diphone database creation for, a multilingual speech synthesiser Institute of Linguistics Adam Mickiewicz University Poznań OWD 2010 Wisła-Kopydło, Poland Why? useful for testing speech models

More information

For students entering in 2004 Date of specification: September 2003

For students entering in 2004 Date of specification: September 2003 MSc Speech and Language Therapy Awarding Institution: The University of Reading Teaching Institution: The University of Reading Relevant subject benchmarking group: Speech and Language Therapy Faculty

More information

Education and Assessment Regulations Language and Communication Research Master s Programme Tilburg University 2007-2008 1

Education and Assessment Regulations Language and Communication Research Master s Programme Tilburg University 2007-2008 1 Education and Assessment Regulations Tilburg University 2007-2008 1 Section 1 General Provisions Article 1.1 Applicability of the regulations These regulations apply to the educational programme and the

More information

Survey Results: Requirements and Use Cases for Linguistic Linked Data

Survey Results: Requirements and Use Cases for Linguistic Linked Data Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group

More information

Teaching Framework. Framework components

Teaching Framework. Framework components Teaching Framework Framework components CE/3007b/4Y09 UCLES 2014 Framework components Each category and sub-category of the framework is made up of components. The explanations below set out what is meant

More information

MA APPLIED LINGUISTICS AND TESOL

MA APPLIED LINGUISTICS AND TESOL MA APPLIED LINGUISTICS AND TESOL Programme Specification 2015 Primary Purpose: Course management, monitoring and quality assurance. Secondary Purpose: Detailed information for students, staff and employers.

More information

Master of Arts in Teaching English to Speakers of Other Languages (MA TESOL)

Master of Arts in Teaching English to Speakers of Other Languages (MA TESOL) Master of Arts in Teaching English to Speakers of Other Languages (MA TESOL) Overview Teaching English to non-native English speakers requires skills beyond just knowing the language. Teachers must have

More information

Term extraction for user profiling: evaluation by the user

Term extraction for user profiling: evaluation by the user Term extraction for user profiling: evaluation by the user Suzan Verberne 1, Maya Sappelli 1,2, Wessel Kraaij 1,2 1 Institute for Computing and Information Sciences, Radboud University Nijmegen 2 TNO,

More information

Telecommunication (120 ЕCTS)

Telecommunication (120 ЕCTS) Study program Faculty Cycle Software Engineering and Telecommunication (120 ЕCTS) Contemporary Sciences and Technologies Postgraduate ECTS 120 Offered in Tetovo Description of the program This master study

More information

L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES

L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES L2 EXPERIENCE MODULATES LEARNERS USE OF CUES IN THE PERCEPTION OF L3 TONES Zhen Qin, Allard Jongman Department of Linguistics, University of Kansas, United States [email protected], [email protected]

More information

Text-To-Speech Technologies for Mobile Telephony Services

Text-To-Speech Technologies for Mobile Telephony Services Text-To-Speech Technologies for Mobile Telephony Services Paulseph-John Farrugia Department of Computer Science and AI, University of Malta Abstract. Text-To-Speech (TTS) systems aim to transform arbitrary

More information

CONTROL, COMMUNICATION & SIGNAL PROCESSING (CCSP)

CONTROL, COMMUNICATION & SIGNAL PROCESSING (CCSP) CONTROL, COMMUNICATION & SIGNAL PROCESSING (CCSP) KEY RESEARCH AREAS Data compression for speech, audio, images, and video Digital and analog signal processing Image and video processing Computer vision

More information

Your boldest wishes concerning online corpora: OpenSoNaR and you

Your boldest wishes concerning online corpora: OpenSoNaR and you 1 Your boldest wishes concerning online corpora: OpenSoNaR and you Martin Reynaert TiCC, Tilburg University and CLST, Radboud Universiteit Nijmegen TiCC Colloquium, Tilburg University. October 16th, 2013

More information

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina Graduate Co-op Students Information Manual Department of Computer Science Faculty of Science University of Regina 2014 1 Table of Contents 1. Department Description..3 2. Program Requirements and Procedures

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

Master of Arts in Linguistics Syllabus

Master of Arts in Linguistics Syllabus Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university

More information

Programme Specification (Postgraduate)

Programme Specification (Postgraduate) Programme Specification (Postgraduate) 1. Programme Title(s): MSc/PGDip*/PGCert* Data Analysis for Business Intelligence *Exit awards only 2. Awarding body or institution: University of Leicester 3. a)

More information

The Information Literacy (IL) and Information Technology (IT) Teaching and Learning Circle. Summary, Overview and Index

The Information Literacy (IL) and Information Technology (IT) Teaching and Learning Circle. Summary, Overview and Index The Information Literacy (IL) and Information Technology (IT) Teaching and Learning Circle Summary, Overview and Index The IL-IT Teaching and Learning Circle met during the summer and fall of 2002. The

More information

How To Create A Clarin Metadata Infrastructure

How To Create A Clarin Metadata Infrastructure Creating & Testing CLARIN Metadata Components Folkert de Vriend (1), Daan Broeder (2), Griet Depoorter (3), Laura van Eerten (3), Dieter van Uytvanck (2) 1) Meertens Institute Joan Muyskenweg 25, Amsterdam,

More information

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Oana NICOLAE Faculty of Mathematics and Computer Science, Department of Computer Science, University of Craiova, Romania [email protected]

More information

Robust Methods for Automatic Transcription and Alignment of Speech Signals

Robust Methods for Automatic Transcription and Alignment of Speech Signals Robust Methods for Automatic Transcription and Alignment of Speech Signals Leif Grönqvist ([email protected]) Course in Speech Recognition January 2. 2004 Contents Contents 1 1 Introduction 2 2 Background

More information

C E D A T 8 5. Innovating services and technologies for speech content management

C E D A T 8 5. Innovating services and technologies for speech content management C E D A T 8 5 Innovating services and technologies for speech content management Company profile 25 years experience in the market of transcription/reporting services; Cedat 85 Group: Cedat 85 srl Subtitle

More information

Why major in linguistics (and what does a linguist do)?

Why major in linguistics (and what does a linguist do)? Why major in linguistics (and what does a linguist do)? Written by Monica Macaulay and Kristen Syrett What is linguistics? If you are considering a linguistics major, you probably already know at least

More information

The University of Amsterdam s Question Answering System at QA@CLEF 2007

The University of Amsterdam s Question Answering System at QA@CLEF 2007 The University of Amsterdam s Question Answering System at QA@CLEF 2007 Valentin Jijkoun, Katja Hofmann, David Ahn, Mahboob Alam Khalid, Joris van Rantwijk, Maarten de Rijke, and Erik Tjong Kim Sang ISLA,

More information

Language Technologies in Europe: trends and future perspectives

Language Technologies in Europe: trends and future perspectives Language Technologies in Europe: trends and future perspectives European Commission Márta Nagy-Rothengass, DG CONNECT Data Value Chain Unit Berlin, 24 January 2013 META - Creation of language resources

More information

Combining textual and non-textual features for e-mail importance estimation

Combining textual and non-textual features for e-mail importance estimation Combining textual and non-textual features for e-mail importance estimation Maya Sappelli a Suzan Verberne b Wessel Kraaij a a TNO and Radboud University Nijmegen b Radboud University Nijmegen Abstract

More information

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA Tuija Niemi-Laitinen*, Juhani Saastamoinen**, Tomi Kinnunen**, Pasi Fränti** *Crime Laboratory, NBI, Finland **Dept. of Computer

More information

Carla Simões, [email protected]. Speech Analysis and Transcription Software

Carla Simões, t-carlas@microsoft.com. Speech Analysis and Transcription Software Carla Simões, [email protected] Speech Analysis and Transcription Software 1 Overview Methods for Speech Acoustic Analysis Why Speech Acoustic Analysis? Annotation Segmentation Alignment Speech Analysis

More information

BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION

BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION P. Vanroose Katholieke Universiteit Leuven, div. ESAT/PSI Kasteelpark Arenberg 10, B 3001 Heverlee, Belgium [email protected]

More information

How To Teach Technical English In English

How To Teach Technical English In English ICT IN TEACHING PROFESSIONAL ENGLISH FOR MECHANICAL ENGINEERING Michaela Vesela Brno University of Technology, Faculty of Mechanical Engineering, Institute of Foreign Languages Brno / Czech Republic E-mail:

More information

German Speech Recognition: A Solution for the Analysis and Processing of Lecture Recordings

German Speech Recognition: A Solution for the Analysis and Processing of Lecture Recordings German Speech Recognition: A Solution for the Analysis and Processing of Lecture Recordings Haojin Yang, Christoph Oehlke, Christoph Meinel Hasso Plattner Institut (HPI), University of Potsdam P.O. Box

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/60933

More information

CLARIN: Common Language Resources and Technology Infrastructure

CLARIN: Common Language Resources and Technology Infrastructure CLARIN: Common Language Resources and Technology Infrastructure Tamás Váradi, Peter Wittenburg, Steven Krauwer, Martin Wynne, Kimmo Koskenniemi Hungarian Academy of Sciences (Budapest), MPI for Psycholinguistics

More information

DBA International Programme

DBA International Programme DBA International Programme Fulfilling Ambitions www.shu.ac.uk www.bsn.eu/internationaldba Fulfilling Ambitions Sheffield Business School at Sheffield Hallam University and Business School Netherlands

More information

Giuseppe Riccardi, Marco Ronchetti. University of Trento

Giuseppe Riccardi, Marco Ronchetti. University of Trento Giuseppe Riccardi, Marco Ronchetti University of Trento 1 Outline Searching Information Next Generation Search Interfaces Needle E-learning Application Multimedia Docs Indexing, Search and Presentation

More information

Understanding Impaired Speech. Kobi Calev, Morris Alper January 2016 Voiceitt

Understanding Impaired Speech. Kobi Calev, Morris Alper January 2016 Voiceitt Understanding Impaired Speech Kobi Calev, Morris Alper January 2016 Voiceitt Our Problem Domain We deal with phonological disorders They may be either - resonance or phonation - physiological or neural

More information

Introduction to Pattern Recognition

Introduction to Pattern Recognition Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)

More information

Charles van Leeuwen Universiteit Maastricht Cercles Conference Frankfurt an der Oder September 2006

Charles van Leeuwen Universiteit Maastricht Cercles Conference Frankfurt an der Oder September 2006 Success & Failure in English Medium Teaching: the Maastricht Experience Charles van Leeuwen Universiteit Maastricht Cercles Conference Frankfurt an der Oder September 2006 The Maastricht context (1): English

More information

Post-doctoral researcher, Faculty of Translation Studies, University College Ghent

Post-doctoral researcher, Faculty of Translation Studies, University College Ghent Lieve Macken Faculty of Translation Studies Groot-Brittanniëlaan 45 B-9000, Ghent Belgium email: [email protected] url: lt3.hogent.be/en/people/lieve-macken/ Born: June 17, 1968 Belgium Nationality:

More information

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages 8-10. CHARTE NIVEAU B2 Pages 11-14

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages 8-10. CHARTE NIVEAU B2 Pages 11-14 CHARTES D'ANGLAIS SOMMAIRE CHARTE NIVEAU A1 Pages 2-4 CHARTE NIVEAU A2 Pages 5-7 CHARTE NIVEAU B1 Pages 8-10 CHARTE NIVEAU B2 Pages 11-14 CHARTE NIVEAU C1 Pages 15-17 MAJ, le 11 juin 2014 A1 Skills-based

More information

The SweDat Project and Swedia Database for Phonetic and Acoustic Research

The SweDat Project and Swedia Database for Phonetic and Acoustic Research 2009 Fifth IEEE International Conference on e-science The SweDat Project and Swedia Database for Phonetic and Acoustic Research Jonas Lindh and Anders Eriksson Department of Philosophy, Linguistics and

More information

Effect of Captioning Lecture Videos For Learning in Foreign Language 外 国 語 ( 英 語 ) 講 義 映 像 に 対 する 字 幕 提 示 の 理 解 度 効 果

Effect of Captioning Lecture Videos For Learning in Foreign Language 外 国 語 ( 英 語 ) 講 義 映 像 に 対 する 字 幕 提 示 の 理 解 度 効 果 Effect of Captioning Lecture Videos For Learning in Foreign Language VERI FERDIANSYAH 1 SEIICHI NAKAGAWA 1 Toyohashi University of Technology, Tenpaku-cho, Toyohashi 441-858 Japan E-mail: {veri, nakagawa}@slp.cs.tut.ac.jp

More information

Career Paths for the CDS Major

Career Paths for the CDS Major College of Education COMMUNICATION DISORDERS AND SCIENCES (CDS) Advising Handout Career Paths for the CDS Major Speech Language Pathology Speech language pathologists work with individuals with communication

More information

Evaluating grapheme-to-phoneme converters in automatic speech recognition context

Evaluating grapheme-to-phoneme converters in automatic speech recognition context Evaluating grapheme-to-phoneme converters in automatic speech recognition context Denis Jouvet, Dominique Fohr, Irina Illina To cite this version: Denis Jouvet, Dominique Fohr, Irina Illina. Evaluating

More information

The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT)

The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT) The Development of Multimedia-Multilingual Storage, Retrieval and Delivery for E-Organization (STREDEO PROJECT) Asanee Kawtrakul, Kajornsak Julavittayanukool, Mukda Suktarachan, Patcharee Varasrai, Nathavit

More information

Recent advances in Digital Music Processing and Indexing

Recent advances in Digital Music Processing and Indexing Recent advances in Digital Music Processing and Indexing Acoustics 08 warm-up TELECOM ParisTech Gaël RICHARD Telecom ParisTech (ENST) www.enst.fr/~grichard/ Content Introduction and Applications Components

More information

MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS

MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS University of Cambridge: Programme Specifications Every effort has been made to ensure the accuracy of the information in this programme specification. Programme specifications are produced and then reviewed

More information

DATABASES. http://db.cs.utwente.nl. Peter M.G. Apers

DATABASES. http://db.cs.utwente.nl. Peter M.G. Apers DATABASES http://db.cs.utwente.nl Peter M.G. Apers DATABASES ORGANIZATION Staff Peter Apers (full prof) Willem Jonker (full prof) Djoerd Hiemstra (associate prof; information retrieval) Maurice van Keulen

More information

Comparing Support Vector Machines, Recurrent Networks and Finite State Transducers for Classifying Spoken Utterances

Comparing Support Vector Machines, Recurrent Networks and Finite State Transducers for Classifying Spoken Utterances Comparing Support Vector Machines, Recurrent Networks and Finite State Transducers for Classifying Spoken Utterances Sheila Garfield and Stefan Wermter University of Sunderland, School of Computing and

More information

The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project

The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Seminar on Dec 19 th Abstracts & speaker information The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Eleni Bozia (USA) Angelos Barmpoutis (USA)

More information

Programme Specification (Undergraduate) Date amended: 28 August 2015

Programme Specification (Undergraduate) Date amended: 28 August 2015 Programme Specification (Undergraduate) Date amended: 28 August 2015 1. Programme Title(s) and UCAS code(s): BSc Mathematics and Actuarial Science (including year in industry option) 2. Awarding body or

More information

Domain Classification of Technical Terms Using the Web

Domain Classification of Technical Terms Using the Web Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using

More information

Research Portfolio. Beáta B. Megyesi January 8, 2007

Research Portfolio. Beáta B. Megyesi January 8, 2007 Research Portfolio Beáta B. Megyesi January 8, 2007 Research Activities Research activities focus on mainly four areas: Natural language processing During the last ten years, since I started my academic

More information