Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1]



Similar documents
Making a Dictionary in Ulaanbaatar:

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov [Folie 1]

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov [Folie 1]

The Use of Text Corpora in Lexical Research

AP WORLD LANGUAGE AND CULTURE EXAMS 2012 SCORING GUIDELINES

FOR TEACHERS ONLY The University of the State of New York

Search Engines Chapter 2 Architecture Felix Naumann

Elena Chiocchetti & Natascia Ralli (EURAC) Tanja Wissik & Vesna Lušicky (University of Vienna)

Successful Collaboration in Agile Software Development Teams

Name: Klasse: Datum: A. Was wissen Sie schon? What do you know already from studying Kapitel 1 in Vorsprung? True or false?

The finite verb and the clause: IP

Exemplar for Internal Assessment Resource German Level 1. Resource title: Planning a School Exchange

Exemplar for Internal Achievement Standard. German Level 1

German Language Resource Packet

Complex Predications in Argument Structure Alternations

HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN

AP GERMAN LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES

German for beginners in 7 lessons

All the English here is taken from students work, both written and spoken. Can you spot the errors and correct them?

Mit einem Auge auf den mathema/schen Horizont: Was der Lehrer braucht für die Zukun= seiner Schüler

O CONNOR ebook How do I say that in English?

Coffee Break German Lesson 06

An Incrementally Trainable Statistical Approach to Information Extraction Based on Token Classification and Rich Context Models

Modalverben Theorie. learning target. rules. Aim of this section is to learn how to use modal verbs.

Machine Learning for natural language processing

Markus Dickinson. Dept. of Linguistics, Indiana University Catapult Workshop Series; February 1, 2013

Coffee Break German. Lesson 09. Study Notes. Coffee Break German: Lesson 09 - Notes page 1 of 17

Superiority: Syntax or Semantics? Düsseldorf Jul02. Jill devilliers, Tom Roeper, Jürgen Weissenborn Smith,Umass,Potsdam

Get Related Schatzsuche in SciFinder Scholar

GCE EXAMINERS' REPORTS. GERMAN AS/Advanced

Software / FileMaker / Plug-Ins Mailit 6 for FileMaker 10-13

Department of Geography - Birgit Sattler - University of Duisburg-Essen ILIAS. in geography and landscape architecture

Varieties of specification and underspecification: A view from semantics

LINGUISTIC SUPPORT IN "THESIS WRITER": CORPUS-BASED ACADEMIC PHRASEOLOGY IN ENGLISH AND GERMAN

Student Booklet. Name.. Form..

TIn 1: Lecture 3: Lernziele. Lecture 3 The Belly of the Architect. Basic internal components of the Pointers and data storage in memory

Supervisory Disclosure during a Financial Crisis: Evidence from the EU-wide Stress-Testing Exercises

Jetzt können Sie den Befehl 'nsradmin' auch für diverse Check-Operationen verwenden!

Vergleich der Versionen von Kapitel 1 des EU-GMP-Leitfaden (Oktober 2012) 01 July November Januar 2013 Kommentar Maas & Peither

Checklist Use this checklist to find out how much English you already know. Grundstufe 1 (Common European Framework: A1 Level)

Hände weg von Mississippi - Hands off Mississippi

Joseph Beuys. Selection of 30 prints from three large suites Suite Schwurhand (1980) Suite Zirkulationszeit (1982) Suite Tränen (1985)

NATIVE ADVERTISING, CONTENT MARKETING & CO. AUFBRUCH IN EIN NEUES GOLDENES ZEITALTER DES MARKETINGS?

How To Make A Germanian Stationery Brand From Japanese Quality Germanic Style

Voraussetzungen/ Prerequisites *for English see below*

(Incorporated as a stock corporation in the Republic of Austria under registered number FN m)

MODERN MATHEMATICS International Summer School for Students Participation Agreement

HYPO TIROL BANK AG. EUR 5,750,000,000 Debt Issuance Programme (the "Programme")

QCF Qualifications in Languages. German. Level 1

Formeller Brief Schreiben

Vorläufiges English Programme im akademischen Jahr 2015/2016 Preliminary English Programme in the Academic Year 2015/2016 *for English see below*

Vorläufiges English Programme im akademischen Jahr 2015/2016 Preliminary English Programme in the Academic Year 2015/2016 *for English see below*

Multilingual Term Extraction as a Service from Acrolinx. Ben Gottesman Michael Klemme Acrolinx CHAT2013

Lernsituation 9. Giving information on the phone. 62 Lernsituation 9 Giving information on the phone

It is also possible to combine courses from the English and the German programme, which is of course available for everyone!

English Programme im akademischen Jahr 2014/2015 English Programme in the Academic Year 2014/2015 *for English see below*

SYNTAX AND SEMANTICS OF CAUSAL DENN IN GERMAN TATJANA SCHEFFLER

or M W 11:00 a.m. 12:00 noon and by appointment 10 units of college German or equivalent

Benutzerfreundlich, tiefe Betriebskosten und hohe Sicherheit. Warum sich diese Ziele nicht widersprechen müssen

German Beginners. Stage 6 Syllabus. Preliminary and HSC Courses

Microsoft Nano Server «Tuva» Rinon Belegu

:09: [scheduler thread(5)]: AdvancedCardAllocation.GetAvailableCardsForChannel took 7 msec

Multipurpsoe Business Partner Certificates Guideline for the Business Partner

GCE EXAMINERS' REPORTS

How To Talk To A Teen Help

Information Systems 2

Replicating Portfolios Complex modelling made simple

Finest Laboratory Products

Linux & Docker auf Azure

FOR TEACHERS ONLY The University of the State of New York

GM0101S HEADSTART STUDENT GUIDE PREPARED IY THE DEFENSE LANGUAGE INSTITUTE FOREIGN LANGUAGE CENTER

Real-Time Identification of MWE Candidates in Databases from the BNC and the Web

How to Design a Scientific Poster

bound Pronouns

German Language Support Package

quick documentation Die Parameter der Installation sind in diesem Artikel zu finden:

I Textarbeit. Text 1. I never leave my horse

The University of Toronto. Fall 2009/German 100 Y

Year 7. Home Learning Booklet. French and German Skills

LEJ Langenscheidt Berlin München Wien Zürich New York

Course: German 1 Designated Six Weeks: Weeks 1 and 2. Assessment Vocabulary Instructional Strategies

Functional Analysis II Final Test, Funktionalanalysis II Endklausur,

Version 1.0: General Certificate of Secondary Education June German. (Specification 4665) Unit 4: Writing. Mark Scheme

Definition Science meets Business Conclusion. generated by en.wikipedia.org/serious games

Transcription:

Content 1. Empirical linguistics 2. Text corpora and corpus linguistics 3. Concordances 4. Application I: The German progressive 5. Part-of-speech tagging 6. Fequency analysis 7. Application II: Compounds 8. Co-occurrence analysis 9. Application III: Word senses in lexicography 10. Keyword analysis 9.1 Word senses in a bilingual dictionary 9.2 Sense detection by co-occurrence analysis 9.3 Corpus analysis software V: KWIC Finder Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1] 9.1 Word senses in a bilingual dictionary Identification of word senses in lexicography Article for abziehen (literally: to pull off). Vietze, Hans-Peter: Wörterbuch Deutsch-Mongolisch. Berlin: DAO-Verlag 2002 [first edition, Leipzig 1981]. Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 2] 1

Lemma: abziehen Inflection: <32a> Grammatical variants 1: tr Translations general: Structure of the article 9.1 Word senses in a bilingual dictionary ( pull off?) specific 1: Fell ( coat/fur ) skin 2: Flüssigkeit ( liquid ) ( bottle?) 3: Math ( mathematics ) subtract 4: Typ ( typography ) run off Examples 1: das Rasiermesser ~ ( the straight blade razor ) sharpen 2: Rinde ~ ( bark ) pull off? 3: den Schlüssel ~ ( the key ) take out 2: intr Translations specific 1: sich entfernen go away 2: sich zurückziehen withdraw Examples 1: unverrichteterdinge ~ ( go away without achieving anything ) Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 3] 9.2 Sense detection by co-occurrence analysis Task: Find contemporary word senses of abziehen. Procedure: 1) Carry out co-occurrence analysis for abziehen (CCDB, Deutsches Referenzkorpus ). 2) Explore colloquial senses of abziehen by extracting concordances from internet chat rooms using KWIC finder. 3) Check internet-based neologism dictionaries. Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 4] 2

Korpusrecherchemethoden Step 1: co-occurrence analysis for abziehen (CCDB); function words not considered. meanings covered in Vietze meanings not covered in Vietze Truppen abziehen, to withdraw troups unverrichteter Dinge wieder abziehen, to go away without having achieved anything wurden zwei Punkte abgezogen, two points were deducted eine Show abziehen, to make a scene die Haut abziehen, peel (fruit), skin (an animal) Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 5] Korpusrecherchemethoden vom Einkommen abziehen, to deduct from the income den Zündschlüssel abziehen, to take out the ignition key Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 6] 3

Korpusrecherchemethoden aus 20 Metern abziehen, to shoot (a ball) vigorously from 20 m distance Botschafter (aus ) abziehen, to withdraw the ambassador (from ) Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 7] Korpusrecherchemethoden Kapital (aus ) abziehen, to withdraw capital (from) Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 8] 4

Korpusrecherchemethoden den Rauch abziehen lassen, to let the smoke escape Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 9] 9.3 Software V: KWIC Finder Corpus linguistics and the WWW, two options: Web as a corpus Development of corpus analysis programs that use the WWW as a corpus. Corpus from the web Development of programs that extract texts from the web, process them and integrate them into corpus collections. Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 10] 5

Corpus analysis software IV: KWICFinder 9.3 Software V: KWIC Finder KWICFinder Key Word in Context Research Tool and Concordancer for the Web Developer: William Fletcher. Version: 0.98.22 (Beta Version), 11. Dec. 2006 (Windows). Search: online. Software: locally installed. Access: free download. Corpora: WWW. Languages: ca. 20 languages on the basis of the Latin script. URL: http://www.kwicfinder.com/kwicfinder.html. Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 11] 9.3 Software V: KWIC Finder produces concordances on the basis of WWW pages search can be restricted to pages with particular titles or in particular domains can be used to find examples for colloquial language (chat rooms) or examples for special / technical language Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 12] 6

Step 2: using KWICFinder to collect concordances reflecting colloquial German from the internet enter search term: abziehen Search in pages that show chat in their title. Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 13] Results 1 Mongolia / Languages 2 Publishing dictionaries 3 Corpus linguistics 4 Improving dictionaries 5 Outlook Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 14] 7

more meanings not covered in Vietze (1) Die Leute die mich kennen, wissen, daß ich eigentlich eine ganz Friedfertige und Versöhnliche bin. Aber was hier einige Leute abziehen... echt therapiebedürftig!!! [ ] what some people are pulling off here [ ] abziehen to pull off something (coll.) Was ziehst du hier ab? What are you pulling off here? (2) Die Suppe mit Salz abschmecken, mit verquirltem Eigelb abziehen und die Spargelstückchen hineingeben. [ ] thicken the soup with beaten egg yolk [ ] abziehen to thicken (gastr.) er zieht die Suppe mit Eigelb ab he thickens the soup with egg yolk Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 15] (3) ich finde auch den preis etwas niedrig und der ebayer hat auch nur 2 bewertungen,habe deshalb ihn gefragt,ob wir das geschäft über den treuhandservice abwickeln können.jetzt warte ich auf seine antwort.nicht das der mich abziehen will,nur weil vielleicht zu wenig für das board geboten wurde.nicht mein problem. [ ] that he wants to swindle me [ ] to swindle / cheat (coll.) er versuchte mich abzuziehen he tried to swindle me (4) Bieretiketten kann mein einfach von der Flasche abziehen. [ ] Beer labels can be easily pulled off the bottle [ ] abziehen to pull off sie zog das Etikett von der Bierflasche ab she pulled the label off the beer bottle Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 16] 8

more meanings not covered in Vietze meanings covered in Vietze Relevant for a general bilingual dictionary intr. withdraw take out (key) intr. go away (math.) subtract skin (coat/fur) itr. escape (of smoke (coll.) shoot vigorously tr. withdraw (troups, ambassador) (coll.) swindle, cheat (coll., neg.) do (something) pull off (label) deduct (something from income) withdraw (capital) deduct (points) Irrelevant for a general bilingual dict. sharpen (a straight blade razor) pull off (bark) (youth) tear off and rob (typogr.) run off (youth) extort (gastr) thicken (soup) Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 17] 9