Gender assignment to English loanwords. (Variation in) Gender assignment: Status quo. Variation in gender assignment: What?

Size: px
Start display at page:

Download "Gender assignment to English loanwords. (Variation in) Gender assignment: Status quo. Variation in gender assignment: What?"

Transcription

1 A cross-linguistic comparison of variation in gender assignment to English loanwords in German and Polish Marcus Callies Philipps Universität Marburg Eva Ogiermann Carl von Ossietzky Universität Oldenburg Konrad Szczesniak Uniwersytet Śląski Sosnowiec Gender assignment to English loanwords Onysko (2007): Interaction of gender rules and a default hierarchy "Principle and rule approach": All anglicisms which are not default gender (i.e. masculine, considered the unmarked) receive their gender by specific rules (based on German): Primary rules: semantic and morphological rules Secondary criteria: phonological rules and lexical-conceptual equivalence Rules operate on top of an underlying default gender hierarchy: If s-/m-rules apply, secondary criteria are unimportant, conflicts are settled in favour of default If no s-/m-rules apply, secondary criteria can influence gender assignment before default DGfS Bamberg (Variation in) Gender assignment: Status quo Many studies do not explicitly address variation in gender assignment (aka gender vacillation, gender wavering, German "Genusschwankung") Mostly (diachronic) dictionary studies with few others using comp. small corpora, then often limited to only one newspaper or news magazine (e.g. Onysko 2007) Only rarely have dictionaries and/or corpora been supplemented by other types of data (Carstensen 1980, Schulte-Beckhausen 2002, Fischer 2005 worked with native speaker informants) Variation in gender assignment: What? 'True' instances of variation: Only those where the different genders do not indicate a differenc in meaning, i.e. gender does not separate lexical items (Onysko 2007): das Crossover ('mix of styles, genres') vs. der Crossover ('type of car') "Genusschwankung par excellence": No morphophonological or semantic rule available, hence inter-speaker variation between masculine and neuter (Talanga 1987) Carstensen 1980, Talanga 1987, Kilarski 2001, Schulte-Beckhausen 2002, Chan 2005, Fischer 2005, Onysko 2007

2 Variation in gender assignment: How much? Most studies claim that there is comparatively little variation in gender assignment to loanwords: Kilarski (2001): Variation between 3.5% for English loans in Swedish and 5.8% in Danish among all assigned nouns Schulte-Beckhausen (2002): Comparing dictionary, corpus and informant data, significant differences between the data types Nettmann-Multanowska (2003): Wavering more characteristic of German than Polish; 57 (10%) vs. 9 (3%) instances in her corpus) Chan (2005): Highest degree of variation between masculine and neuter (141 out of 3105 entries = 5%), M/F 0.9%, F/N 0.6%, M/N/F 0.1% Fischer (2005): Highest degree of variation between masculine and neuter; variation is highest with simple (monosyllabic), unaffixed words without any formal marking, esp. if the meaning of the word is unknown Onysko (2007): Only a minimal amount of gender variation in his SPIEGEL corpus (which is not quantified) Variation in gender assignment: How much? Why there is more variation than has been assumed to date: Bias towards dictionary data: Dictionaries have shown to be inconsistent and often document normative, expert language use Using only dictionaries or one type of newspaper / news magazine increases the likelihood of bias towards a specific in-house writing style or policy on anglicisms (see e.g. Yang 1990 and Onysko 2007 who used Der Spiegel) Rules for gender assignment usually explained on the basis of linguists' expert knowledge, but most native speakers are linguistically untrained: linguists' intuitions don't match those of other native speakers Variation in gender assignment: Factors Factors that are assumed to influence variation: Gender rules based on formal properties (morphophonological, semantic) Regional differences Recency of borrowing / diachronic factor Frequency Context of presentation Number of available lexical-conceptual equivalents in the recipient language (the more are available, the more variation: Login > die Anmeldung, das Passwort, der Benutzername) Bilingual competence / knowledge word meaning (variation higher when meaning is unknown) Carstensen 1980, Talanga 1987, Schulte-Beckhausen 2002, Fischer 2005, Onysko 2007 Variation in gender assignment: Why? Variation understood as conflicts between assignment rules, i.e. competition among rivaling factors (e.g. morphophonological and semantic criteria), and competition among lexical equivalents Variation also explained in terms of inter-speaker variation and indeterminacy of the closest lexical equivalent; thus relegated to arbitrariness and idiosyncracies, or non-linguistic factors ("Sprachgefühl") Carstensen 1980, Kilarski 2001, Schulte-Beckhausen 2002, Fischer 2005, Onysko 2007

3 Research questions 1. Taking into account corpus and informant data, how much variation is there? 2. What are the factors that cause variation in gender assignment to loanwords and what are those that make variation less likely? 3. Do these factors differ in the two languages, and if so, how do they play out? Corpus study (1) 10 out of a number of initial test items later used in the experimental study subject to pilot corpus studies German: Berliner Zeitung newspaper corpus (in DWDS), (252m words); low frequency items also checked in COSMAS II (W-öff, Archiv der geschriebenen Korpora, alle öffentlichen Korpora, 2.2b words) Polish: web-as-corpus study ( using collocational patterns to retrieve instances marked for gender Corpus study (2) Corpus study (3) Clearly, variation can be found in corpora. But... For some more recent borrowings very low frequency counts; many instances inconclusive because not marked for gender or ambiguous

4 Gender assignment in German (1) Phonological rules Monosyllabic words 24 phonological rules (Köpcke 1982) simplified version: masculine monosyllabic words are unmarked Morphological rules Masc: -er / -ling / -rich Fem: -e/ -keit / -ung/ -schaft Neut: -sel/ -tum/ nis Gender assignment in German (2) Semantic rules a) semantic field analogy Masc: days of the week / alcoholic drinks /spices Fem: names of trees / numbers Neut: colours / town names / languages b) hypernymy der Wagen der Honda, der Twingo die Zigarette die Marlboro, die Camel das Hotel das Hilton, das Meryan Biological gender (can be outranked by morphological rules) die Frau but das Fräulein Gender assignment in Polish (1) Phonological rules (Auslaut) Masc: all consonants dom 'House' /i/ dyżurny 'employee on call' /a/ dentysta 'dentist' Fem: /a/ krowa 'cow' some consonants noc 'night' Neut: /o/ jajko 'egg' /e/ słońce 'sun' /ę/ niemowlę 'infant' /um/ muzeum 'museum' Morphological rules (suffixes echo phonological rules) Masc: -ik / -iciel / -izm Fem: -ka / -acja Neut: -anie /-cie / -stwo Gender assignment in Polish (2) Biological gender (outranks phonological rules) Masc: mężczyzna 'man' Fem: babsztyl 'woman' (pejorative) Suggested hierarchy: biological gender > phonological rules > semantic rules

5 Hypotheses 1. Variation is low(er) with words that do have a marker that is a strong trigger for a specific gender (morphophonological and semantic rules are so strong that variation is marginal) { er} = masculine, {-ing} = neuter bitch = feminine, coach = masculine ending in consonant = masculine (Polish) 2. Variation is high(er) with words that have no marker/feature that determines a specific gender 3. Variation increases if there is no single clear lexical equivalent (i.e. a broad range of possible lexical equivalents or none at all) 4. Variation increases if the meaning of a word is unknown Experimental study (1) 26 loanwords (nouns), selected acc. to formal and semantic criteria to be applicable as test items in both languages biological gender/semantic field/cognate: bitch, coach; alcopop, shake, techno, domain words with morphological marking: browser, voucher, casting, posting deverbal nouns with particle: download, update, take-off, login words ending in a special sound: preview, crew; movie, cookie; badge, stage: label, jingle simplex, monosyllabic words: gate, sale, slot, gig Experimental study (2) Format: Gender assignment by providing the definite article (in German) or inflectional suffix(es) (in Polish) to words in contextualised sentences (translational equivalents) Further questions as to informants' knowledge of the meaning of the word (known, unknown, not sure) and potential lexical equivalents in the native language Questionnaire administered to 146 German and 100 Polish native speaker informants, all university students of English in their mid-twenties

6 Results (1) Variation measured in terms of a diversity index (Simpson's D) taking into account the range of gender categories present among the answers (how many) and the relative abundances, i.e. the evenness or equitability with which the answers are distributed among the different gender categories (how often a gender is represented) D is a figure between 0 and 1: If it is 1, the answers are spread equally across the given categories (e.g. 50 masc., 50 fem., 50 neut.) If it is 0, all answers fall into one category, there is no variation at all In short: the higher the D value, the more variation Results (2) Words for which there is a high degree of variation the most frequently mentioned gender category does not exceed 90% have a broad range of genders mentioned (range between 3 and 7) D value is higher than 0.4 show intra-speaker variation (mostly masculine/neuter) Results (3) Results (4) Variation only in German: voucher, take-off, login, techno Variation only in Polish: browser, download, update, domain, crew, stage, label, gate, sale) indicating the different weight that gender rules have in the two languages Variation in both languages: alcopop, preview, movie, cookie, badge, jingle High variation usually correlates with uncertainty/lack of knowledge of word meaning (few exceptions) /u:/, /I, i:/, /dz/ and /schwa+l/ are sounds that trigger variation in Polish

7 Results (5) Conclusion Hierarchy of factors that determine variation in gender assignment? gender marker > lexical equivalent > knowledge of word meaning Variation is highest if a) rules that work on gender markers are out and cannot be applied b) there is no single clear lexical equivalent in the respective language (broad range of lexical equivalents mentioned and a high percentage of answers in the category "no lexical equivalent given") c) there is lack/uncertainty about the meaning of the word References Thank you! Danke! Dziękujemy bardzo! Baran, Dominika (2003), "English loanwords in Polish and the question of gender assignment", Penn Working Papers in Linguistics 8:1, Carstensen, Broder (1980), "Das Genus englischer Fremd- und Lehnwörter im Deutschen, in Viereck, Wolfgang (ed.), Studien zum Einfluß der englischen Sprache auf das Deutsche. Tübingen: Narr, Chan, Sze-Mun (2005). Genusintegration: eine systematische Untersuchung zur Genuszuweisung englischer Entlehnungen in der deutschen Sprache. München: Iudicium. Fischer, Rudolf-Josef (2005), Genuszuordnung. Theorie und Praxis am Beispiel des Deutschen. Frankfurt/Main: Peter Lang. Gregor, Bernd (1983), Genuszuordnung: Das Genus englischer Lehnwörter im Deutschen. Tübingen: Niemeyer. Kilarski, Marcin (2001), Gender assignment of English loanwords in Danish, Swedish and Norwegian. Ph.D. dissertation, Adam Mickiewicz University. Köpcke, Klaus-Michael (1982), Untersuchungen zum Genussystem der deutschen Gegenwartssprache. Tübingen: Niemeyer. Nettmann-Multanowska, Kinga (2003), English Loanwords in Polish and German after 1945: Orthography and Morphology. Frankfurt/Main: Peter Lang. Onysko, Alexander (2007), Anglicisms in German. Borrowing, Lexical Productivity, and Written Codeswitching. Berlin: Walter de Gruyter. Schulte-Beckhausen, Marion (2002), Genusschwankung bei englischen, französischen, italienischen und spanischen Lehnwörtern im Deutschen: Eine Untersuchung auf der Grundlage deutscher Wörterbücher seit Frankfurt/Main: Peter Lang. Talanga, Tomislav (1987), Das Phänomen der Genusschwankung in der deutschen Gegenwartssprache untersucht nach Angaben neuerer Wörterbücher der deutschen Standardsprache. PhD dissertation, University of Bonn.

Acquiring grammatical gender in northern and southern Dutch. Jan Klom, Gunther De Vogelaer

Acquiring grammatical gender in northern and southern Dutch. Jan Klom, Gunther De Vogelaer Acquiring grammatical gender in northern and southern Acquring grammatical gender in southern and northern 2 Research questions How does variation relate to change? (transmission in Labov 2007 variation

More information

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1]

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1] Content 1. Empirical linguistics 2. Text corpora and corpus linguistics 3. Concordances 4. Application I: The German progressive 5. Part-of-speech tagging 6. Fequency analysis 7. Application II: Compounds

More information

Multipurpsoe Business Partner Certificates Guideline for the Business Partner

Multipurpsoe Business Partner Certificates Guideline for the Business Partner Multipurpsoe Business Partner Certificates Guideline for the Business Partner 15.05.2013 Guideline for the Business Partner, V1.3 Document Status Document details Siemens Topic Project name Document type

More information

Syntactic Theory on Swedish

Syntactic Theory on Swedish Syntactic Theory on Swedish Mats Uddenfeldt Pernilla Näsfors June 13, 2003 Report for Introductory course in NLP Department of Linguistics Uppsala University Sweden Abstract Using the grammar presented

More information

Complex Predications in Argument Structure Alternations

Complex Predications in Argument Structure Alternations Complex Predications in Argument Structure Alternations Stefan Engelberg (Institut für Deutsche Sprache & University of Mannheim) Stefan Engelberg (IDS Mannheim), Universitatea din Bucureşti, November

More information

What Makes a Good Online Dictionary? Empirical Insights from an Interdisciplinary Research Project

What Makes a Good Online Dictionary? Empirical Insights from an Interdisciplinary Research Project Proceedings of elex 2011, pp. 203-208 What Makes a Good Online Dictionary? Empirical Insights from an Interdisciplinary Research Project Carolin Müller-Spitzer, Alexander Koplenig, Antje Töpel Institute

More information

Electronic offprint from. baltic linguistics. Vol. 3, 2012

Electronic offprint from. baltic linguistics. Vol. 3, 2012 Electronic offprint from baltic linguistics Vol. 3, 2012 ISSN 2081-7533 Nɪᴄᴏʟᴇ Nᴀᴜ, A Short Grammar of Latgalian. (Languages of the World/Materials, 482.) München: ʟɪɴᴄᴏᴍ Europa, 2011, 119 pp. ɪѕʙɴ 978-3-86288-055-3.

More information

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic by Sigrún Helgadóttir Abstract This paper gives the results of an experiment concerned with training three different taggers on tagged

More information

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1]

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1] Content 1. Empirical linguistics 2. Text corpora and corpus linguistics 3. Concordances 4. Application I: The German progressive 5. Part-of-speech tagging 6. Fequency analysis 7. Application II: Compounds

More information

Hybrid Strategies. for better products and shorter time-to-market

Hybrid Strategies. for better products and shorter time-to-market Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,

More information

Extracting translation relations for humanreadable dictionaries from bilingual text

Extracting translation relations for humanreadable dictionaries from bilingual text Extracting translation relations for humanreadable dictionaries from bilingual text Overview 1. Company 2. Translate pro 12.1 and AutoLearn 3. Translation workflow 4. Extraction method 5. Extended

More information

Targeted Advertising and Consumer Privacy Concerns Experimental Studies in an Internet Context

Targeted Advertising and Consumer Privacy Concerns Experimental Studies in an Internet Context TECHNISCHE UNIVERSITAT MUNCHEN Lehrstuhl fur Betriebswirtschaftslehre - Dienstleistungsund Technologiemarketing Targeted Advertising and Consumer Privacy Concerns Experimental Studies in an Internet Context

More information

Differences in linguistic and discourse features of narrative writing performance. Dr. Bilal Genç 1 Dr. Kağan Büyükkarcı 2 Ali Göksu 3

Differences in linguistic and discourse features of narrative writing performance. Dr. Bilal Genç 1 Dr. Kağan Büyükkarcı 2 Ali Göksu 3 Yıl/Year: 2012 Cilt/Volume: 1 Sayı/Issue:2 Sayfalar/Pages: 40-47 Differences in linguistic and discourse features of narrative writing performance Abstract Dr. Bilal Genç 1 Dr. Kağan Büyükkarcı 2 Ali Göksu

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

Descriptive and Normative Aspects of Lexicographic Decision-Making: The Borderline Cases

Descriptive and Normative Aspects of Lexicographic Decision-Making: The Borderline Cases THE DICTIONARY-MAKING PROCESS Descriptive and Normative Aspects of Lexicographic Decision-Making: The Borderline Cases Lars Trap-Jensen The Danish Dictionary Danish Society for Language and Literature

More information

HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN

HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN Yu Chen, Andreas Eisele DFKI GmbH, Saarbrücken, Germany May 28, 2010 OUTLINE INTRODUCTION ARCHITECTURE EXPERIMENTS CONCLUSION SMT VS. RBMT [K.

More information

Comprendium Translator System Overview

Comprendium Translator System Overview Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4

More information

bound Pronouns

bound Pronouns Bound and referential pronouns *with thanks to Birgit Bärnreuther, Christina Bergmann, Dominique Goltz, Stefan Hinterwimmer, MaikeKleemeyer, Peter König, Florian Krause, Marlene Meyer Peter Bosch Institute

More information

Programmierbeispiele zur Datenaufbereitung der Stichprobe der Integrierten Arbeitsmarktbiografien (SIAB) in Stata

Programmierbeispiele zur Datenaufbereitung der Stichprobe der Integrierten Arbeitsmarktbiografien (SIAB) in Stata 04/2013 Programmierbeispiele zur Datenaufbereitung der Stichprobe der Integrierten Arbeitsmarktbiografien (SIAB) in Stata Generierung von Querschnittdaten und biografischen Variablen August 2013 (2. aktualisierte

More information

The Use of Text Corpora in Lexical Research

The Use of Text Corpora in Lexical Research The Use of Text Corpora in Lexical Research Stefan Engelberg Workshop, Universitatea din Bucureşti, November 2008 http://www.ids-mannheim.de/ll/lehre/engelberg/ Webseite_CorpLex/CorpLex.html [email protected]

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

3. Introduction to Culture, 2st

3. Introduction to Culture, 2st LA Englisch Äquivalenz BA Englisch 1. Studienabschnitt Introduction to English and American Studies Introduction to English Linguistics I PS 2 Semesterstunden Introduction to English Linguistics II PS

More information

Optimizing Gender. Curt Rice * University of Tromsø

Optimizing Gender. Curt Rice * University of Tromsø Optimizing Gender Curt Rice * University of Tromsø The assignment of a noun to a grammatical gender category follows from the meaning and shape of the noun along with a theory of the interaction of these

More information

Chapter 5. Phrase-based models. Statistical Machine Translation

Chapter 5. Phrase-based models. Statistical Machine Translation Chapter 5 Phrase-based models Statistical Machine Translation Motivation Word-Based Models translate words as atomic units Phrase-Based Models translate phrases as atomic units Advantages: many-to-many

More information

Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde

Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde Statistical Verb-Clustering Model soft clustering: Verbs may belong to several clusters trained on verb-argument tuples clusters together verbs with similar subcategorization and selectional restriction

More information

GERMAN WORD ORDER. Mihaela PARPALEA 1

GERMAN WORD ORDER. Mihaela PARPALEA 1 Bulletin of the Transilvania University of Braşov Vol. 2 (51) - 2009 Series IV: Philology and Cultural Studies GERMAN WORD ORDER Mihaela PARPALEA 1 Abstract: The idea that German word order is governed

More information

1 Basic concepts. 1.1 What is morphology?

1 Basic concepts. 1.1 What is morphology? EXTRACT 1 Basic concepts It has become a tradition to begin monographs and textbooks on morphology with a tribute to the German poet Johann Wolfgang von Goethe, who invented the term Morphologie in 1790

More information

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites

More information

Linear Coding of non-linear Hierarchies. Revitalization of an Ancient Classification Method

Linear Coding of non-linear Hierarchies. Revitalization of an Ancient Classification Method : Revitalization of an Ancient Classification Method Institute of Language and Information University of Düsseldorf [email protected] GfKl 2008 The Problem: Sometimes we are forced to order things

More information

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak 9.6.2015

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak 9.6.2015 Computer-Based Text- and Data Analysis Technologies and Applications Mark Cieliebak 9.6.2015 Data Scientist analyze Data Library use 2 About Me Mark Cieliebak + Software Engineer & Data Scientist + PhD

More information

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX COLING 82, J. Horeck~ (ed.j North-Holland Publishing Compa~y Academia, 1982 COMPUTATIONAL DATA ANALYSIS FOR SYNTAX Ludmila UhliFova - Zva Nebeska - Jan Kralik Czech Language Institute Czechoslovak Academy

More information

Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2

Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2 Structure of the talk Sebastian Bücking 1 and Markus Egg 2 1 Universität Tübingen [email protected] 2 Rijksuniversiteit Groningen [email protected] 12 December 2008 two challenges for a

More information

Keywords academic writing phraseology dissertations online support international students

Keywords academic writing phraseology dissertations online support international students Phrasebank: a University-wide Online Writing Resource John Morley, Director of Academic Support Programmes, School of Languages, Linguistics and Cultures, The University of Manchester Summary A salient

More information

Some Implications of Controlling Contextual Constraint: Exploring Word Meaning Inference by Using a Cloze Task

Some Implications of Controlling Contextual Constraint: Exploring Word Meaning Inference by Using a Cloze Task Some Implications of Controlling Contextual Constraint: Exploring Word Meaning Inference by Using a Cloze Task Abstract 20 vs Keywords: Lexical Inference, Contextual Constraint, Cloze Task 1. Introduction

More information

Simple maths for keywords

Simple maths for keywords Simple maths for keywords Adam Kilgarriff Lexical Computing Ltd [email protected] Abstract We present a simple method for identifying keywords of one corpus vs. another. There is no one-sizefits-all

More information

Doctoral School of Historical Sciences Dr. Székely Gábor professor Program of Assyiriology Dr. Dezső Tamás habilitate docent

Doctoral School of Historical Sciences Dr. Székely Gábor professor Program of Assyiriology Dr. Dezső Tamás habilitate docent Doctoral School of Historical Sciences Dr. Székely Gábor professor Program of Assyiriology Dr. Dezső Tamás habilitate docent The theses of the Dissertation Nominal and Verbal Plurality in Sumerian: A Morphosemantic

More information

German Language Resource Packet

German Language Resource Packet German has three features of word order than do not exist in English: 1. The main verb must be the second element in the independent clause. This often requires an inversion of subject and verb. For example:

More information

Annotation in Language Documentation

Annotation in Language Documentation Annotation in Language Documentation Univ. Hamburg Workshop Annotation SEBASTIAN DRUDE 2015-10-29 Topics 1. Language Documentation 2. Data and Annotation (theory) 3. Types and interdependencies of Annotations

More information

Text-Driven Ontology Generation and Extension in the Finance Domain. Mihaela Vela Language Technology Lab DFKI Saarbrücken

Text-Driven Ontology Generation and Extension in the Finance Domain. Mihaela Vela Language Technology Lab DFKI Saarbrücken Text-Driven Ontology Generation and Extension in the Finance Domain Mihaela Vela Language Technology Lab DFKI Saarbrücken European MUSING project Development of Business Intelligence tools and modules

More information

A Mapping of CIDOC CRM Events to German Wordnet for Event Detection in Texts

A Mapping of CIDOC CRM Events to German Wordnet for Event Detection in Texts A Mapping of CIDOC CRM Events to German Wordnet for Event Detection in Texts Martin Scholz Friedrich-Alexander-University Erlangen-Nürnberg Digital Humanities Research Group Outline Motivation: information

More information

1 Business Modeling. 1.1 Event-driven Process Chain (EPC) Seite 2

1 Business Modeling. 1.1 Event-driven Process Chain (EPC) Seite 2 Business Process Modeling with EPC and UML Transformation or Integration? Dr. Markus Nüttgens, Dipl.-Inform. Thomas Feld, Dipl.-Kfm. Volker Zimmermann Institut für Wirtschaftsinformatik (IWi), Universität

More information

German Language Support Package

German Language Support Package German Language Support Package August 2014 Dear Parents and Students of Goethe International Charter School, Welcome to a new and exciting school year of great learning experiences and success! The key

More information

A Joint Sequence Translation Model with Integrated Reordering

A Joint Sequence Translation Model with Integrated Reordering A Joint Sequence Translation Model with Integrated Reordering Nadir Durrani, Helmut Schmid and Alexander Fraser Institute for Natural Language Processing University of Stuttgart Introduction Generation

More information

Master of Arts in Linguistics Syllabus

Master of Arts in Linguistics Syllabus Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university

More information

Customizing an English-Korean Machine Translation System for Patent Translation *

Customizing an English-Korean Machine Translation System for Patent Translation * Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,

More information

CURRICULUM VITAE SILKE BRANDT

CURRICULUM VITAE SILKE BRANDT CURRICULUM VITAE SILKE BRANDT CONTACT Silke Brandt, PhD English Department Nadelberg 6 CH-4051 Basel Switzerland [email protected] POSITIONS 2011-present Postdoctoral researcher English Department

More information

Introduction. Philipp Koehn. 28 January 2016

Introduction. Philipp Koehn. 28 January 2016 Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)

More information

Checklist Use this checklist to find out how much English you already know. Grundstufe 1 (Common European Framework: A1 Level)

Checklist Use this checklist to find out how much English you already know. Grundstufe 1 (Common European Framework: A1 Level) Der XL Test: Was können Sie schon? Schätzen Sie Ihre Sprachkenntnisse selbst ein! Sprache: Englisch Mit der folgenden e haben Sie die Möglichkeit, Ihre Fremdsprachenkenntnisse selbst einzuschätzen. Die

More information

The English Genitive Alternation

The English Genitive Alternation The English Genitive Alternation s and of genitives in English The English s genitive freely alternates with the of genitive in many situations: Mary s brother the brother of Mary the man s house the house

More information

MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS

MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS University of Cambridge: Programme Specifications Every effort has been made to ensure the accuracy of the information in this programme specification. Programme specifications are produced and then reviewed

More information

Lean Company @ E T HS MF Einführung des Lean Company Programms in der Siemens Business Unit E T HS

Lean Company @ E T HS MF Einführung des Lean Company Programms in der Siemens Business Unit E T HS Lean Company @ E T HS MF Einführung des Lean Company Programms in der Siemens Business Unit E T HS Lars Hildebrand 26. Deutscher Logistik-Kongress 22. Oktober 2009 For internal use only Slide 1 Oct 09

More information

Prof Dr Dr Friedemann Pulvermüller Freie Universität Berlin WS 2013/14 Progress in Brain Language Research Wed, 4-6 pm ct, K 23/11

Prof Dr Dr Friedemann Pulvermüller Freie Universität Berlin WS 2013/14 Progress in Brain Language Research Wed, 4-6 pm ct, K 23/11 1 Graduate Course/Seminar Introduction This colloquium will focus on recent advances in the investigation of brain mechanisms of language. It is designed for students and young researchers of all scientific

More information

Turkish Radiology Dictation System

Turkish Radiology Dictation System Turkish Radiology Dictation System Ebru Arısoy, Levent M. Arslan Boaziçi University, Electrical and Electronic Engineering Department, 34342, Bebek, stanbul, Turkey [email protected], [email protected]

More information

Local Culture in Global English:

Local Culture in Global English: Local Culture in Global English: a case study of Kultur in Sprache / Sprachwissenschaft in Kulturwissenschaften Josef Schmied Chair English Language & Linguistics Chemnitz University of Technology www.tu-chemnitz.de/phil/english/linguist

More information

UNKNOWN WORDS ANALYSIS IN POS TAGGING OF SINHALA LANGUAGE

UNKNOWN WORDS ANALYSIS IN POS TAGGING OF SINHALA LANGUAGE UNKNOWN WORDS ANALYSIS IN POS TAGGING OF SINHALA LANGUAGE A.J.P.M.P. Jayaweera #1, N.G.J. Dias *2 # Virtusa Pvt. Ltd. No 752, Dr. Danister De Silva Mawatha, Colombo 09, Sri Lanka * Department of Statistics

More information

COURSE OBJECTIVES SPAN 100/101 ELEMENTARY SPANISH LISTENING. SPEAKING/FUNCTIONAl KNOWLEDGE

COURSE OBJECTIVES SPAN 100/101 ELEMENTARY SPANISH LISTENING. SPEAKING/FUNCTIONAl KNOWLEDGE SPAN 100/101 ELEMENTARY SPANISH COURSE OBJECTIVES This Spanish course pays equal attention to developing all four language skills (listening, speaking, reading, and writing), with a special emphasis on

More information

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1]

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1] Content 1. Empirical linguistics 2. Text corpora and corpus linguistics 3. Concordances 4. Application I: The German progressive 5. Part-of-speech tagging 6. Fequency analysis 7. Application II: Compounds

More information

EXMARaLDA and the FOLK tools two toolsets for transcribing and annotating spoken language

EXMARaLDA and the FOLK tools two toolsets for transcribing and annotating spoken language EXMARaLDA and the FOLK tools two toolsets for transcribing and annotating spoken language Thomas Schmidt Institut für Deutsche Sprache, Mannheim R 5, 6-13 D-68161 Mannheim [email protected]

More information

Local Culture in Global English:

Local Culture in Global English: Local Culture in Global English: a case study of Kultur in Sprache / Sprachwissenschaft in Kulturwissenschaften Josef Schmied Chair English Language & Linguistics Chemnitz University of Technology www.tu-chemnitz.de

More information

Varieties of specification and underspecification: A view from semantics

Varieties of specification and underspecification: A view from semantics Varieties of specification and underspecification: A view from semantics Torgrim Solstad D1/B4 SFB meeting on long-term goals June 29th, 2009 The technique of underspecification I Presupposed: in semantics,

More information

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser Efficient diphone database creation for, a multilingual speech synthesiser Institute of Linguistics Adam Mickiewicz University Poznań OWD 2010 Wisła-Kopydło, Poland Why? useful for testing speech models

More information

Contemporary Linguistics

Contemporary Linguistics Contemporary Linguistics An Introduction Editedby WILLIAM O'GRADY MICHAEL DOBROVOLSKY FRANCIS KATAMBA LONGMAN London and New York Table of contents Dedication Epigraph Series list Acknowledgements Preface

More information

Accessibility and simple language: experiences with automatic compliance tools

Accessibility and simple language: experiences with automatic compliance tools Accessibility and simple language: experiences with automatic compliance tools Dr. Carlos A Velasco Fraunhofer Institute for Applied Information Technology FIT http://www.fit.fraunhofer.de/ W3C-Tag: Das

More information

COMM 104 Introduction to Communications Fall 2014 3 credits Core E&C GE-AH for BAB and CS COMM 130 Introduction to Journalism Fall 2014 3 credits

COMM 104 Introduction to Communications Fall 2014 3 credits Core E&C GE-AH for BAB and CS COMM 130 Introduction to Journalism Fall 2014 3 credits COMM 104 COMM 130 COMM 238 Introduction to Communications This course provides a comprehensive introduction to the field of communication studies. Students will examine the components of human communication

More information

The Vocabulary Size Test Paul Nation 23 October 2012

The Vocabulary Size Test Paul Nation 23 October 2012 The Vocabulary Size Test Paul Nation 23 October 2012 Available versions There is a 14,000 version containing 140 multiple-choice items, with 10 items from each 1000 word family level. A learner s total

More information

The shape of things to come: Young researchers in Germany

The shape of things to come: Young researchers in Germany The shape of things to come: Young researchers in Germany Universitätsverband zur Qualifizierung des wissenschaftlichen Nachwuchses in Deutschland German University Association of Advanced Graduate Training

More information

ARABIC PERSON NAMES RECOGNITION BY USING A RULE BASED APPROACH

ARABIC PERSON NAMES RECOGNITION BY USING A RULE BASED APPROACH Journal of Computer Science 9 (7): 922-927, 2013 ISSN: 1549-3636 2013 doi:10.3844/jcssp.2013.922.927 Published Online 9 (7) 2013 (http://www.thescipub.com/jcs.toc) ARABIC PERSON NAMES RECOGNITION BY USING

More information

German Language Support Package

German Language Support Package German Language Support Package September 2014 Dear Parents and Students of Goethe International Charter School, Welcome to a new and exciting school year of great learning experiences and success! The

More information

Projektgruppe. Categorization of text documents via classification

Projektgruppe. Categorization of text documents via classification Projektgruppe Steffen Beringer Categorization of text documents via classification 4. Juni 2010 Content Motivation Text categorization Classification in the machine learning Document indexing Construction

More information

Green Building Water Technology: Use of Renewable Water Resources in Multi-Storey Buildings

Green Building Water Technology: Use of Renewable Water Resources in Multi-Storey Buildings Green Building Water Technology: Use of Renewable Water Resources in Multi-Storey Buildings Speaker: Erwin Nolde, Berlin email: [email protected] Wasserforum für f r die EMA-Region 11. und 12. März

More information

Dial-Up VPN auf eine Juniper

Dial-Up VPN auf eine Juniper Dial-Up VPN auf eine Juniper Gateway Konfiguration Phase 1 Konfiguration Create a user that is used to define the phase1 id parameters. Navigate to the following screen using the tree pane on the left

More information

Literacy and Numeracy for Learning and Life

Literacy and Numeracy for Learning and Life Literacy and Numeracy for Learning and Life Literacy Session 3 Hospital Schools Literacy and Numeracy for Learning and Life Link Teacher Communication Leading Literacy Core Team Collaboration Assessment

More information

Psychology G4470. Psychology and Neuropsychology of Language. Spring 2013.

Psychology G4470. Psychology and Neuropsychology of Language. Spring 2013. Psychology G4470. Psychology and Neuropsychology of Language. Spring 2013. I. Course description, as it will appear in the bulletins. II. A full description of the content of the course III. Rationale

More information

Pragmatic analysis of hotel websites in terms of interpersonal relationships. Theses of the PhD dissertation by. Kovács Péterné Dudás Andrea

Pragmatic analysis of hotel websites in terms of interpersonal relationships. Theses of the PhD dissertation by. Kovács Péterné Dudás Andrea Pragmatic analysis of hotel websites in terms of interpersonal relationships Theses of the PhD dissertation by Kovács Péterné Dudás Andrea Eötvös Loránd University Faculty of Humanities Doctoral School

More information

Acquisition of German pluralization rules in monolingual and multilingual children

Acquisition of German pluralization rules in monolingual and multilingual children Studies in Second Language Learning and Teaching Department of English Studies, Faculty of Pedagogy and Fine Arts, Adam Mickiewicz University, Kalisz SSLLT 3 (4). 2013. 551-580 http://www.ssllt.amu.edu.pl

More information

An Incrementally Trainable Statistical Approach to Information Extraction Based on Token Classification and Rich Context Models

An Incrementally Trainable Statistical Approach to Information Extraction Based on Token Classification and Rich Context Models Dissertation (Ph.D. Thesis) An Incrementally Trainable Statistical Approach to Information Extraction Based on Token Classification and Rich Context Models Christian Siefkes Disputationen: 16th February

More information

Studienverlaufspläne (Stand Oktober 2013)

Studienverlaufspläne (Stand Oktober 2013) Studienverlaufspläne (Stand Oktober 0) STUDIENVERLAUFSPLÄNE BA/BED Gültig nur für BA- und BEd-Studierende, die seit dem Winterester 0/ an der Universität Trier immatrikuliert sind. Alle anderen Studierenden

More information

Support verb constructions

Support verb constructions Support verb constructions Comments on Angelika Storrer s presentation Markus Egg Rijksuniversiteit Groningen Salsa-Workshop 2006 Outline of the comment Support-verb constructions (SVCs) and textual organisation

More information

CURRICULUM VITAE. M. Sc. Anne-Katharina Schiefele

CURRICULUM VITAE. M. Sc. Anne-Katharina Schiefele CURRICULUM VITAE Address: Department of Clinical Psychology and Psychotherapy, University of Trier, 54286 Trier, Germany TEL 0049 (0)651 201 2882 E-mail: [email protected] Birthday: November 30, 1987

More information

The Rise of Documentary Linguistics and a New Kind of Corpus

The Rise of Documentary Linguistics and a New Kind of Corpus The Rise of Documentary Linguistics and a New Kind of Corpus Gary F. Simons SIL International 5th National Natural Language Research Symposium De La Salle University, Manila, 25 Nov 2008 Milestones in

More information

Comparative Analysis on the Armenian and Korean Languages

Comparative Analysis on the Armenian and Korean Languages Comparative Analysis on the Armenian and Korean Languages Syuzanna Mejlumyan Yerevan State Linguistic University Abstract It has been five years since the Korean language has been taught at Yerevan State

More information

Coffee Break German. Lesson 09. Study Notes. Coffee Break German: Lesson 09 - Notes page 1 of 17

Coffee Break German. Lesson 09. Study Notes. Coffee Break German: Lesson 09 - Notes page 1 of 17 Coffee Break German Lesson 09 Study Notes Coffee Break German: Lesson 09 - Notes page 1 of 17 LESSON NOTES ICH SPRECHE EIN BISSCHEN DEUTSCH In this lesson you will learn how to deal with language problems

More information

WESTERNACHER OUTLOOK E-MAIL-MANAGER OPERATING MANUAL

WESTERNACHER OUTLOOK E-MAIL-MANAGER OPERATING MANUAL TABLE OF CONTENTS 1 Summary 3 2 Software requirements 3 3 Installing the Outlook E-Mail Manager Client 3 3.1 Requirements 3 3.1.1 Installation for trial customers for cloud-based testing 3 3.1.2 Installing

More information

Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014

Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014 Introduction! BM1 Advanced Natural Language Processing Alexander Koller! 17 October 2014 Outline What is computational linguistics? Topics of this course Organizational issues Siri Text prediction Facebook

More information

Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang

Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang Sense-Tagging Verbs in English and Chinese Hoa Trang Dang Department of Computer and Information Sciences University of Pennsylvania [email protected] October 30, 2003 Outline English sense-tagging

More information

ICAME Journal No. 24. Reviews

ICAME Journal No. 24. Reviews ICAME Journal No. 24 Reviews Collins COBUILD Grammar Patterns 2: Nouns and Adjectives, edited by Gill Francis, Susan Hunston, andelizabeth Manning, withjohn Sinclair as the founding editor-in-chief of

More information