Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions



Similar documents
Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions. Natalia Levshina

Comparing constructicons: A cluster analysis of the causative constructions with doen in Netherlandic and Belgian Dutch.

University of Marburg, RC Deutscher Sprachatlas University of Leuven, RU Quantitative Lexicology and Variational Linguistics

An integrative exemplar-based model of semantic structure: The Dutch causative construction with laten 1

Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions

SAND: Relation between the Database and Printed Maps

total dutch speak Dutch instantly no books no writing absolute confi dence

How To Write A Sentence In Germany

Tracking change in word meaning

Annotation Guidelines for Dutch-English Word Alignment

~ We are all goddesses, the only problem is that we forget that when we grow up ~

Example-Based Treebank Querying. Liesbeth Augustinus Vincent Vandeghinste Frank Van Eynde

Risk-Based Monitoring

The information in this report is confidential. So keep this report in a safe place!

Lexical convergence in the Dutch lexicon

Examen Software Engineering /09/2011

How To Test A Website On A Web Browser

Degrees of adverbialization

Management control in creative firms

Shopper Marketing Model: case Chocomel Hot. Eric van Blanken 20th October 2009

Dutch Foundation Course

CMMI version 1.3. How agile is CMMI?

Virtualisatie. voor desktop en beginners. Gert Schepens Slides & Notities op gertschepens.be

Research Report. Ingelien Poutsma Marnienke van der Maal Sabina Idler

Title page. Title: A conversation-analytic study of time-outs in the Dutch national volleyball. competition. Author: Maarten Breyten van der Meulen

CLARIN project DiscAn :

NAAR NEDERLAND HANDLEIDING

Lean in het digitale tijdperk. Hans Toebak, Arjen Markus, 13 november 2013

LEXICOGRAPHIC ISSUES IN COMBINATORICS

Load Balancing Lync Jaap Wesselius

CO-BRANDING RICHTLIJNEN

Cambridge International Examinations Cambridge International General Certificate of Secondary Education

Acquiring grammatical gender in northern and southern Dutch. Jan Klom, Gunther De Vogelaer

If farming becomes surviving! Ton Duffhues Specialist Agriculture and society ZLTO Director Atelier Waarden van het Land 4 juni 2014, Wageningen

Hoorcollege marketing 5 de uitgebreide marketingmix. Sunday, December 9, 12

Varieties of lexical variation

The Chat Box Revelation On the chat language of Flemish adolescents and young adults

IP-NBM. Copyright Capgemini All Rights Reserved

GMP-Z Annex 15: Kwalificatie en validatie

employager 1.0 design challenge

THE EMOTIONAL VALUE OF PAID FOR MAGAZINES. Intomart GfK 2013 Emotionele Waarde Betaald vs. Gratis Tijdschrift April

Asking what. a person looks like some persons look like

Cultural Trends and language change

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX

De tarieven van Proximus Niet meer gecommercialiseerde Bizz packs

IT-waardeketen management op basis van eeuwenoude supply chain kennis

101 Inspirerende Quotes - Eelco de boer - winst.nl/ebooks/ Inleiding

Ons privacybier. Derde privacycafé Data Protection Institute 13 januari 2016 Thomas More Mechelen 21/01/16

Relationele Databases 2002/2003

Corporate Security & Identity

Duurzaam Supply Management

Nominative-Dative Inversion and the Decline of Dutch

VIDEO CREATIVE IN A DIGITAL WORLD Digital analytics afternoon. Hugo.schurink@millwardbrown.com emmy.brand@millwardbrown.com

Making Leaders Successful Every Day

BEGRIJP JE DOELGROEP BETER DOOR KENNIS OVER TRENDS. Spreker: Aljan de Boer Moderator: Mascha Buiting

Advanced Metering Infrastructure

Met wie moet je als erasmusstudent het eerst contact opnemen als je aankomt?

Sentence Semantics. General Linguistics Jennifer Spenader, February 2006 (Most slides: Petra Hendriks)

Lexical Competition: Round in English and Dutch

Hoofdstuk 2 Samenwerking en afstemming in de zorgketen

Duurzaam Supply Management

Corpus data in usage-based linguistics

Pronouns: A case of production-before-comprehension

HOE WERKT CYBERCRIME EN WAT KAN JE ER TEGEN DOEN? Dave Maasland Managing Director ESET Nederland

Dutch Advanced Course

UNIT ONE A WORLD OF WONDERS

Belgium (30,528 km 2 = 1/317 of China)!

IC Rating NPSP Composieten BV. 9 juni 2010 Variopool

Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang

Specification by Example (methoden, technieken en tools) Remco Snelders Product owner & Business analyst

Causality marking across levels of language structure. A cognitive semantic analysis of causal verbs and causal connectives in Dutch

It s a small word. Johan Rooryck & Guido Vanden Wyngaerd HIL Leiden/FWO - KUBrussel. 1. Introduction *

Term 2 - Form 1. Dutch

COOLS COOLS. Cools is nominated for the Brains Award! Coen Danckmer Voordouw

Kantoor of Koffiebar. Alain Swolfs Market Unit Leader Banking & Insurance alain.swolfs@capgemini.com

EDQM indicators on Pharmaceutical Care

INSIDER TRADING POLICY ZIGGO N.V. EFFECTIVE AS OF Declaration of agreement with the Insider Trading Policy

Timeline (1) Text Mining Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?

Dutch Mortgage Market Pricing On the NMa report. Marco Haan University of Groningen November 18, 2011

NL VMUG UserCon March

Transcription:

Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions Supervisors: Natalia Levshina Dirk Geeraerts Dirk Speelman University of Leuven RU Quantitative Lexicology and Variational Linguistics

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

Empirical Cognitive Semantics Meaning is a concept Meaning reveals itself in usage Meaning is subject to social variation

Empirical Cognitive Semantics Meaning is a concept categorization and comprehension categories and exemplars Meaning reveals itself in usage Meaning is subject to social variation

Empirical Cognitive Semantics Meaning is a concept categorization and comprehension categories and exemplars Meaning reveals itself in usage distributional bottom-up approach Meaning is subject to social variation

Empirical Cognitive Semantics Meaning is a concept categorization and comprehension categories and exemplars Meaning reveals itself in usage distributional bottom-up approach Meaning is subject to social variation lectally diverse data

State of the Art bias towards alternations in studies of constructions (cf. Goldberg 2002: each construction should [also] be studied on its own right) main focus on generalizations and prototypes, although exemplar effects in human categorization are proved to be powerful insufficient attention paid to contextual (social and genre-related) variation; some voices that it is too problematic to be studied as a factor (Gries & Divjak 2010) lack of theoretically interpretable yet robust statistical techniques that can integrate different sources of variation

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

Dutch Causative Constructions De tovenaar deed zijn dienaars een kasteel bouwen. liet The magician CAUSER made made/let AUX his servants CAUSEE a castle AFFECTEE* build EFFECTED PREDICATE slot names: Kemmer & Verhagen (1994)

doen vs. laten doen Direct causation laten Indirect causation «The initiator produces the effected event directly; there is no intervening energy source downstream» «Some other force besides the initiator is the most immediate source of energy in the effected event» Verhagen & Kemmer (1997)

Other dimensions of variation Speelman & Geeraerts 2009: doen is used more frequently in formal registers doen is used more frequently in Belgian Dutch doen is more prone to collocational fixation doen is an obsolescent form with a tendency towards semantic and lexical specialization

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

Data 5768 observations with causative doen and laten corpus - formal written genre: articles from Dutch and Belgian quality newspapers in TwNC and LeNC - informal written genre: posts from Usenet.nl and Usenet.be in Dutch, spam-checked - informal spoken genre: Dutch and Flemish spontaneous faceto-face conversations from CGN retrieval: XML parser (newspapers), regular expressions, manual cleaning-up (e.g. laten we zeggen)

37 Variables, 731 features General context: sentence type, clause type, clause mood, clause tense, Cx syntactic function Causer: semantic class, syntactic expression, POS, person, number, definiteness Causee: see Causer; intentionality, thematic proto-role Affectee: see Causer Effected Predicate: semantic class, semantic source and target domain, prefix, root, extended transitivity, prepositional complements Other: coreferentiality, possession, negation, modals, adverbials

Some corpus methods in CogSem Polysemy Synonymy Exemplars Features Bi-and multinomial Regression Collostructional Analysis Behavioral Profiles + +? + predef. senses + +?? + +? + predef. senses Exemplar Spaces + + + +

Exemplar Spaces Method Example of Data Var 1 Var 2 Var 3 Exemplar 1 A yes X Exemplar 2 A yes Y Exemplar 3 B no X Partitioning into k Clusters Cluster Exemplar 1 1 Exemplar 2 1 Exemplar 3 2 Distances (Gower) Ex. 1 Ex. 2 Ex.3 Exemplar 1 0 0.33 0.66 Exemplar 2 0.33 0 1 Exemplar 3 0.66 1 0 Interpretation: Cluster profiles Cluster A B yes no X Y Cluster 1 1 0 1 0 0.5 0.5 Cluster 2 0 1 0 1 1 0 distinctive features

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

Exemplar Space of doen SMACOF MDS, doen dim 2 dim 1 stress: 0.07

2 Clusters of doen 2 clusters of doen dim 2 dim 1

Clusters of doen: Top distinctive features Red Cluster Blue Cluster Psych. Caused Event (fig.) 3 rd Person Causee Psych. Caused Event (lit) Social Caused Event (fig.) Human Causee Nominal Causee Pronominal Causee Social Caused Event (lit.) 1 st Person Causee Causee undergoes change No change occurs No prepositional complements Verb class Mental Processes Explicit Causee Root denk- Explicit NP Affectee Preposition aan Verb class Motion Implicit Causee Indefinite NP Affectee

Causative doen: A dictionary entry 1. Cause a change in the world, especially a social 1 change conceptualized as an abstract objective event. Betere booklets zijn er geregeld (maar dat doet de prijs stijgen). 22. Trigger a psychological reaction. Dacht ook dat ik het wist, tot je vraag me deed twijfelen, en terecht... doen denken aan remind of " Groovin on E " doet met zijn fris jengelende gitaren en neuzelende fluisterzang denken aan The Kinks.

Contextual variation of doen Frequency of causative doen, per million words Belgium Netherlands doen is more frequent in Belgium in formal genres (Chi-test, α = 0.05) 0 50 100 150 Newspapers Usenet Spoken dialogues

doen: Interaction Context*Semantics newspapers Usenet dialogues Belgium Netherlands

Belgium, newspapers doen, Flemish newspapers dim 2 dim 1

The Netherlands, newspapers doen, Dutch newspapers dim 2 dim 1

Belgium, Usenet doen, Flemish Usenet dim 2 dim 1

The Netherlands, Usenet doen, Dutch Usenet dim 2 dim 1

Belgium, conversations doen, Flemish conversations dim 2 dim 1

The Netherlands, conversations doen, Dutch conversations dim 2 dim 1

doen: Lectally enriched entry 1. esp. Belg. and formal Cause a change in the world, 1 especially a social change conceptualized as an abstract objective event. Betere booklets zijn er geregeld (maar dat doet de prijs stijgen). 22. Trigger a psychological reaction. Dacht ook dat ik het wist, tot je vraag me deed twijfelen, en terecht... doen denken aan remind of " Groovin on E " doet met zijn fris jengelende gitaren en neuzelende fluisterzang denken aan The Kinks.

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

Exemplar Space of laten SMACOF MDS, laten dim 2 dim 1 stress: 0.09

3 Clusters of laten SMACOF MDS, laten dim 2 dim 1

Clusters of laten: Top distinctive features Red Cluster Blue Cluster Green Cluster Psych. Caused Event (lit) Explicit NP Causee Human Undefined Causee Psych. Caused Event (fig) Intransitive EP Intentional Causee Human Causee V. Motion Causee causes change V. Perception Abstract Causee V Creation Root zie- Causee is changed V. Phys Manipulation Pronominal Causee Indefinite NP Affectee V. Social Interaction V. Factive V. Existence V. Change of Possession Root weet- Plural Affectee Material Object Affectee Affectee Clause No coreferentiality Sing. Number Causee 1 st Person Causee Nominal Affectee Explicit NP Affectee

Causative laten: An entry 1 2 3 Stop or fail to carry out impingement, abandon, leave. Ook Rombouts laat Van Hecke vallen. Suffer or benefit from someone s actions, e.g. services. En als ik het geschreven heb ga ik het eerst laten registreren bij ISBN. Ref Allaerts laat zich vangen. Provide information, show. Nou dat heeft ie laten zien.

Contextual variation of laten Frequency of causative laten, per million words Belgium Netherlands Frequency of laten NL > BE newspapers overall written > spoken (Chi-test, α = 0.05) 0 200 400 600 Newspapers Usenet Spoken dialogues

laten: Interaction Context*Semantics newspapers Usenet dialogues Belgium Netherlands

Belgium, newspapers laten, Flemish newspapers dim 2 dim 1

The Netherlands, newspapers laten, Dutch newspapers dim 2 dim 1

Belgium, Usenet laten, Flemish Usenet dim 2 dim 1

The Netherlands, Usenet laten, Dutch Usenet dim 2 dim 1

Belgium, conversations laten, Flemish conversations dim 2 dim 1

The Netherlands, conversations laten, Dutch conversations dim 2 dim 1

laten: Lectally enriched entry 1 2 3 Stop or fail to carry out impingement, abandon, leave. Ook Rombouts laat Van Hecke vallen. Suffer or benefit from someone s actions, e.g. services. En als ik het geschreven heb ga ik het eerst laten registreren bij ISBN. Ref Allaerts laat zich vangen. esp. Neth. Dutch Provide information, show. Nou dat heeft ie laten zien.

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

A Challenge Het is moeilijk een scherp onderscheid te maken tussen het gebruik van doen en dat van laten, al zijn er wel enkele tendensen aan te wijzen. Bij de keuze voor één van beide werkwoorden kunnen niet alleen betekenisverschillen, maar ook geografische en stilistische verschillen een rol spelen. In ieder geval zijn doen en laten niet altijd zonder meer uitwisselbaar. ANS 1997

doen vs laten: Onomasiological Variation SMACOF MDS, doen and laten dim 2 doen laten dim 1 stress: 0.09

doen vs laten: Top distinctive features doen laten abstract Causer human Causer intransitive predicates transitive predicates 3rd person Causer pronominal Affectee V. denoting Mental Processes no prepositional complements root denk- intentional Causee nominal Affectee 3rd person Causee preposition aan explicit NP Affectee explicit NP Causee V. of Perception abstract Affectee implicit Causee 1st person Causee causing (agentive) Causee

Confirmatory Regression: Design Response: doen or laten General semantic predictors: - human/abstract/other Causer - transitive/intransitive effected predicate - Causee causes or undergoes a change - intentional or unintentional Causee - implicit or explicit Causee Contextual variables: Country and Genre Exemplar effects?

Modelling Exemplar Effects Speelman & Geeraerts (2009): lexical fixation operationalized with the help of collostructional analysis (Stefanowitsch & Gries 2003) Bybee (2010): rare collexemes can be semantically similar to frequent collexemes and thus typical of Cx; see also Schmid (2010) solution (cf. exemplar model by Nosofsky 1986): similarity to neighbours = e -d^2 where d is distance between exemplars of the same category (doen or laten)

proxdoen and proxlaten dim 2 doen laten dim 1

Main Effects Estimate Std. Error Pr(> z ) (Intercept) -4.02 0.31 < 2e-16 *** CrSem=Abstr 1.86 0.17 < 2e-16 *** CrSem=Oth 2.1 0.24 < 2e-16 *** proxdoen 0.06 0.004 < 2e-16 *** proxlaten -0.01 0.001 < 2e-16 *** Genre=spont_dial -0.91 0.24 <0.001*** Genre=Usenet -0.16 0.13 0.23 CeEnergy=Change 1.16 0.32 <0.001*** CeEnergy=Oth 1.26 0.29 <0.001*** CeIntent=No 0.62 0.18 <0.001*** CeIntent=Undef -0.04 0.24 0.85 Country=BE 0.33 0.12 0.008** CeSynt=Expl 0.37 0.17 0.029* EPTrans = Intr 0.01 0.17 0.964 full model with interactions: C=0.96 Gamma=0.91 pseudo-r 2 =0.68

Interpretation of the Model doen involves direct causation, laten indirect? - abstract Causers (doen) can be only direct causes, whereas human Causers (laten) can affect the situation indirectly - implicit Causees (laten) are often intermediaries - agent-like (intentionally causing) Causees (laten) allow the Causer to influence the world indirectly doen is more popular in formal genres and Belgium

Interaction Country*Genre newspapers Usenet dialogues Belgium Netherlands 1 1.0 1.2 1.4 doen laten

Average probability of doen clusters of doen in common space dim 2 77% 53% dim 1 stress: 0.09

doen: Onomasiologically enriched entry 1. esp. Belg. and formal Cause a change in the world, 1 especially a social change conceptualized as an abstract objective event. Syn. laten Betere booklets zijn er geregeld (maar dat doet de prijs stijgen). 22. Trigger a psychological reaction. Dacht ook dat ik het wist, tot je vraag me deed twijfelen, en terecht... Syn. laten (?) doen denken aan remind of " Groovin on E " doet met zijn fris jengelende gitaren en neuzelende fluisterzang denken aan The Kinks.

Cf. Van Dale 2005 doen als oorzakelijk hulpww. synoniem: laten ik doe opmerken dat enz. iets doen of doen doen een steen deed hem struikelen doen te weten

Average probability of laten clusters of laten in common space dim 2 97% 98% 93% dim 1

Cf. Van Dale 2005 laten (both auxiliary and notional) 16. maken dat het object de werking verricht die door de onbep. ww. wordt uitgedrukt. synoniem: doen zij liet mij vragen of ik meeging, iem. laten struikelen zijn gedachten, zijn oog laten gaan iem. iets laten weten?

Outline 1. Theoretical and methodological background 2. Dutch causative constructions 3. Data and method 4. Semasiological variation (polysemy) - semantic and contextual variation of doen - semantic and contextual variation of laten 5. Onomasiological variation (near-synonymy) - variation of doen and laten - contextual factors 6. Conclusions and future research

doen: Summary Global semasiological variation: two main senses related to semantic domain of causation (cf. Verhagen & Kemmer 1997, but rather social than physical causation) Contextual semasiological variation: doen is quantitatively and qualitatively poorer in informal genres and in Netherlandic Dutch (cf. Speelman & Geeraerts 2009), biased towards affective causation, esp. fixed doen denken aan

A historical perspective Verhagen 2000: doen used to have a more diverse semantic palette Millenium Sense 18th C. Belgium Netherlands Interpersonal +???? Physical + +? Affective + + + Belgian Dutch is lagging behind (cf. Geeraerts et al. 1999)

laten: Summary Global semasiological variation: three main senses related to different semantic roles of Causer and Causee (cf. Verhagen & Kemmer 1997) Contextual semasiological variation: sense laten3 (laten zien/weten/horen) is unpopular in Belgian newspapers

doen vs. laten: Summary On the general semantic level, the difference between doen and laten can be formulated in terms of causation (in)directness (cf. Kemmer & Verhagen). Cf. polysemy: in all three senses of laten the Causer does not actually affect, and the Causee is not affected (indirectntess); in the two senses of doen the Causer is the (inevitable) cause/stimulus (directness). Exemplar effects play an important role, too.

doen vs. laten: Summary Non-mental intransitive doen and laten are more prone to synonymy than the other senses. At the same time, doen can be readily replaced by laten, but not the other way round. Laat wat je doen kan, maar doe niet wat je laten kan!

Conclusions unsupervised bottom-up exemplar spaces are a way of integrating different interrelated types of variation in one semantic model (complete recontexualization) the semasiological and onomasiological perspectives provide specific yet compatible semantic information both exemplar effects and more general features are found to be important in categorization contextual variation has a large influence on the semasiological structure, but less so on the division of labour between doen and laten

Future research experimental confirmation of findings comparison with unsupervised distributional methods (Vector Space Models) a diachronic study of exemplar space evolution

Vragen? Suggesties? Laat je maar gaan! for further information: http://wwwling.arts.kuleuven.be/qlvl natalia.levshina@arts.kuleuven.be