Lexical convergence in the Dutch lexicon
|
|
|
- Amanda Strickland
- 10 years ago
- Views:
Transcription
1 Overview Introduction Dutch Method Results Lexical convergence in the Dutch lexicon Jocelyne Daems Kris Heylen Dirk Geeraerts University of Leuven RU Quantitative Lexicology and Variational Linguistics Conclusion
2 Overview 1. Introduction 2. Dutch 3. Method Uniformity measure Data 4. Results 5. Conclusion References
3 1. Introduction Variation in Dutch Lectal variation register region gender generation profession and education Linguistic variation vrachtwagen of vrachtauto (of camion of truck) dat ken/kan/kun jij wel zeggen zeven of zeuven
4 1. Introduction Variation in Dutch Lectal variation register region gender generation profession and education Linguistic variation vrachtwagen of vrachtauto dat ken/kan jij wel zeggen zeven of zeuven A specific context can come with its own set of linguistic choices lect
5 1. Introduction Variation in Dutch in this study Lectal variation region and register geographical and stylistic/stratificational question Linguistic variation lexicon onomasiological question
6 1. Introduction Research aim Measure lexical convergence between Dutch in Belgium and Dutch in the Netherlands by means of the word choice for emotion-, IT- and traffic concepts. Cf. Geeraerts, Grondelaers & Speelman 1999
7 2. Dutch Dutch Two national varieties (pluricentric language (Clyne 1992)) Netherlandic Dutch Belgian Dutch Stratification Poldernederlands? Verkavelingsvlaams, soapvlaams, tussentaal in-between language
8 2. Dutch Standardisation Different development of the standardisation process the Netherlands: indepdent development of a/the standard variant of Dutch Belgium: after a long delay, promotion of convergence with the (established) Netherlandic Dutch norm Two opposed views in Belgium Integrationism: Netherlandic Dutch as the norm Particularism: room for a Belgian Dutch standard variant
9 3. Method Geeraerts, Grondelaers & Speelman 1999 uniformity measure for onomasiological variation to what extent do people for a given concept choose the same words? how often do Belgians and Dutchmen express vrachtwagen ( truck ) in the same way?
10 3. Method - Uniformity measure Onomasiological profile concept synonym1 synonym2 synony3 synonym4 absfreq absfreq absfreq absfreq relfreq relfreq relfreq relfreq
11 3. Method - Uniformity measure Onomasiological profile: concrete vrachtwagen vrachtwagen vrachtauto truck camion % 1% 12% 1%
12 3. Method - Uniformity measure Measure lexical convergence: five steps 1. Per concept build the onomasiological profile 2. Per variety build the onomasiological profile 3. Measure uniformity 4. Aggregate over all concepts 5. (Weigh the uniformity measure by means of the concept frequency)
13 3. Method - Uniformity measure Measure lexical convergence: concrete vrachtwagen vrachtwagen vrachtauto truck camion BE NL BE NL BE NL BE NL 86% 74% 1% 12% 12% 14% 1% 0%
14 3. Method - Uniformity measure Measure lexical convergence: five steps 1. Per concept build the onomasiological profile 2. Per variety build the onomasiological profile 3. Measure uniformity = overlap in lexicalisation preferences = sum of the smallest relative value for each term 4. Aggregate over all concepts 5. (Weigh the uniformity measure by means of the concept frequency)
15 3. Method - Uniformity measure Measure lexical convergence: concrete vrachtwagen vrachtwagen vrachtauto truck camion BE NL BE NL BE NL BE NL 86% 74% 1% 12% 12% 14% 1% 0% 74% + 1% + 12% + 0% = 87% measure of uniformity
16 3. Method - Uniformity measure Measure lexical convergence: five steps 1. Per concept build the onomasiological profile 2. Per variety build the onomasiological profile 3. Measure uniformity 4. Aggregate over all concepts = repeat for each concept in the semantic field + compute the mean 5. (Weigh the uniformity measure by means of the concept frequency)
17 3. Method - Uniformity measure Measure lexical convergence: concrete U vrachtwagen 87% metro 97% trein 100% stoep 61% bestelwagen 45% mean 78%
18 3. Method - Uniformity measure Measure lexical convergence: five steps 1. Per concept build the onomasiological profile 2. Per variety build the onomasiological profile 3. Measure uniformity 4. Aggregate over all concepts 5. Weigh the uniformity measure by means of the concept frequency = less frequent concepts weigh less
19 3. Method - Uniformity measure Measure lexical convergence: concrete U W U vrachtwagen 87% 32% 28% metro 97% 8% 7% trein 99% 46% 45% stoep 61% 10% 6% bestelwagen 45% 4% 2% mean 78% sum 89%
20 3. Method - Data Data gathering Four steps Select profiles Lemmatised corpora Script frequencies and concordances Manual disambiguation
21 Profiles traffic 3. Method - Data
22 Profiles IT 3. Method - Data
23 Profiles emotions 3. Method - Data
24 3. Method - Data A sample of annotated text by Alpino [ Na, prep, na, 3849, 0], [ een, det, een, 3849, 1], [ jaar, noun, jaar, 3849, 2], [ -, punct, -, 3849, 3], [ ik, pron, ik, 3849, 4], [ deed, verb, doe, 3849, 5], [ alles, noun, alles, 3849, 6], [,, punct,,, 3849, 7], [ van, prep, van, 3849, 8], [ was, noun, was, 3849, 9], [ ophangen, verb, hang op, 3849, 10], [ tot, prep, tot, 3849, 11], [ de, det, de, 3849, 12], [ beenhouwerij, noun, beenhouwerij, 3849, 13], [ kuisen, verb, kuis, 3849, 14], [ -, punct, -, 3849, 15], [ vroeg, verb, vraag, 3849, 16], [ ik, pron, ik, 3849, 17], [ of, comp, of, 3849, 18], [ ik, pron, ik, 3849, 19], [ meer, det, meer, 3849, 20], [ kon, verb, kan, 3849, 21], [ verdienen, verb, verdien, 3849, 22], [., punct,., 3849, 23], [ De, det, de, 3850, 0], [ slager, noun, slager, 3850, 1], [ wees, verb, wijs af, 3850, 2], [ dat, det, dat, 3850, 3], [ verzoek, noun, verzoek, 3850, 4], [ af, part, af, 3850, 5], [., punct,., 3850, 6], [ Ik, pron, ik, 3851, 1], [ pakte, verb, pak, 3851, 2], [ m n, det, mijn, 3851, 3], [ velo, noun, velo, 3851, 4], [ en, vg, en, 3851, 5], [ ik, pron, ik, 3851, 6], [ was, verb, ben, 3851, 7], [ weg, adv, weg, 3851, 8], [., punct,., 3851, 9] Na een jaar ik deed alles, van was ophangen tot de beenhouwerij kuisen vroeg ik of ik meer kon verdienen. De slager wees dat verzoek af. Ik pakte m n velo en ik was weg.
25 3. Method - Data Manual desambiguation Stevaert vindt de sluiting van de kleine ring rondom Antwerpen een topprioriteit. relevant meaning Al draagt een aap een gouden ring, het is en blijft een lelijk ding. wrong meaning: jewellery Wat is de kick? Als ik in de ring oog in oog sta met mijn tegenstander. wrong meaning: boxing ring
26 Profiles 80 concepts 2 parts of speech 3 lexical fields 3. Method - Data traffic: vrachtwagen ( truck ) IT: gsm ( cell phone ) emotions: verlegen ( shy ), oprechtheid ( sincerity ) Corpora newspaper Usenet Belgium 373M 18M the Netherlands 161M 24M
27 4. Results Hypotheses 1. Diachronic convergence = word choice in Belgium and the Netherlands becomes more uniform 2. Synchronic stratification = word choice in the Netherlands is more uniform than in Belgium = word choice in newspapers is more uniform than in Usenet 3. Role of the semantic field 4. Role of the concept frequency
28 4. Results Hypothesis 1: Diachronic convergence Study : Geeraerts, Grondelaers & Speelman 1999 Data : clothing terms in fashion magazines U (B50,N50) 70% U (B70,N70) 75% U (B90,N90) 82% Study : De Cnodder 2013 Data : clothing terms in fashion magazines U (B12,N12) 78%
29 4. Results Hypothesis 1: Diachronic convergence Study : Geeraerts, Grondelaers & Speelman 1999 Data : football terms in magazines in 1990 U (B50,N50) 66% U (B70,N70) 72% U (B90,N90) 77%
30 4. Results Hypothesis 1: Diachronic convergence Study : Grondelaers, Van Aken, Speelman & Geeraerts 2001 Data : clothing terms in newspapers in 1998 U (B58,N58) 71% U (B78,N78) 64% U (B98,N98) 82% Study : Grondelaers, Van Aken, Speelman & Geeraerts 2001 Data : prepositions in newspapers in 1998 U (B58,N58) 59% U (B78,N78) 62% U (B98,N98) 67%
31 4. Results Hypothesis 1: Diachronic convergence Convergence for the supraregional/formal language use / Shift in Belgian Dutch, which converges towards Netherlandic Dutch! Situation in 2012: stagnation or divergence? Informalisation of Netherlandic Dutch?! Balanced composition of the corpus! Content words vs. function words
32 4. Results Hypothesis 2: Synchronic stratification Study : Geeraerts, Grondelaers & Speelman 1999 Data : clothing terms in fashion magazines and shop windows in 1990 U (Bmag,B etal ) 50% U (Nmag,N etal ) 69% Study : De Cnodder 2013 Data : clothing terms in fashion magazines and shop windows in 2012 U (Bmag,B etal ) 63% U (Nmag,N etal ) 79%
33 U (Bkrant,B Usenet ) 82% U (Bkrant,B IRC ) 75% U (Nkrant,N Usenet ) 97% U (N,N ) 94% 4. Results Hypothesis 2: Synchronic stratification Study : Grondelaers, Van Aken, Speelman & Geeraerts 2001 Data : clothing terms in newspapers, Usenet and IRC in 1998 U (Bkrant,B Usenet ) 90% U (Bkrant,B IRC ) 83% U (Nkrant,N Usenet ) 92% U (Nkrant,N IRC ) 90% Study : Grondelaers, Van Aken, Speelman & Geeraerts 2001 Data : prepositions in newspapers, Usenet and IRC in 1998
34 4. Results Hypothesis 2: Synchronic stratification Study : Daems, Heylen & Geeraerts (In prep.) Data : traffic terms in newspapers and Usenet in U (Bkrant,B Usenet ) 87% U (Nkrant,N Usenet ) 86% Study : Daems, Heylen & Geeraerts (In prep.) Data : IT terms in newspapers and Usenet in U (Bkrant,B Usenet ) 72% mailinglist, provider, database U (Nkrant,N Usenet ) 70% mailinglist, provider, password
35 4. Results Hypothesis 2: Synchronic stratification Uniformity is lower in Belgian Dutch In Belgian Dutch there is a continuum from regional/informal (IRC) to supraregional/formal (newspapers) language use in which Usenet takes an intermediary position! Importance semantic field
36 4. Results Hypothesis 2: Synchronic stratification (bis) Studies : Geeraerts, Grondelaers & Speelman 1999 Data : clothing terms in fashion magazines in 1990 and 2012, newspapers, Usenet and IRC in 1998 U (Bmag,90,N mag,90 ) 82% U (Bmag,12,N mag,12 ) 78% U (Bkrant,N krant ) 82% U (BUsenet,N Usenet ) 73% U (BIRC,N IRC ) 75%
37 4. Results Hypothesis 2: Synchronic stratification (bis) Study : Daems, Heylen & Geeraerts (In prep.) Data : traffic terms in newspapers and Usenet in U (Bkrant,N krant ) 78% U (BUsenet,N Usenet ) 72% bus, trein, vrachtwagen Study : Daems, Heylen & Geeraerts (In prep.) Data : IT terms in newspapers and Usenet in U (Bkrant,N krant ) 74% U (BUsenet,N Usenet ) 91% website, link, provider
38 4. Results Hypothesis 2: Synchronic stratification (bis) On a supraregional level the standard variant is more uniform than the informal variant(s)! Importance of the semantic field
39 4. Results Hypothesis 3: Role of the semantic field Studies : Geeraerts, Grondelaers & Speelman 1999, Grondelaers, Van Aken, Speelman & Geeraerts 2001 Data : football terms in magazines in 1990, prepositions and clothing terms in newspapers in 1998 U (Bkleding,N kleding ) 82% U (voorzetsel,n voorzetsel ) 67% U (Bvoetbal,N voetbal ) 77%
40 4. Results Hypothesis 3: Role of the semantic field Study : Daems, Heylen & Geeraerts (In prep.) Data : traffic, IT and emotion terms in newspapers in U (Bverkeer,N verkeer ) 78% U (BICT,N ICT ) 74% U (Bemotie,noun,N emotie,noun ) 81% U (Bemotie,adj,N emotie,adj ) 79%
41 4. Results Hypothesis 3: Role of the semantic field Age of the field, degree of supraregional contact, register! Statistic significance: requires enough data
42 4. Results Hypothesis 4: Role of the concept frequency Study : Daems, Heylen & Geeraerts (In prep.) Data : traffic, IT and emotion terms in newspapers in non-weighted (U) weighted (U ) (B verkeer,n verkeer ) 70% 78% (B ICT,N ICT ) 79% 74% (B emotie,noun,n emotie,noun ) 83% 81% (B emotie,adj,n emotie,adj ) 78% 79%
43 4. Results Hypothesis 4: Role of the concept frequency Study : Daems, Heylen & Geeraerts (In prep.) Data : traffic, IT and emotion terms in newspapers in non-weighted (U) weighted (U ) (B verkeer,n verkeer ) 70% 78% (B ICT,N ICT ) 79% 74% traffic 70% 78% < bus, fiets, trein, vrachtwagen IT 79% 74% < mobiele telefoon, e(-)mail
44 4. Results Hypothesis 4: Role of the concept frequency Know your data qualitatively
45 5. Conclusion Confirmation Diachronic convergence Synchronic stratification Attention! Importance of the semantic field! Importance of the concept frequency
46 5. Conclusion Future Semasiological perspective Correlation between onomasiological and semasiological perspective Automation: techniques from distributional semantics Vector Space Models (VSMs) Type-based VSMs Token-based VSMs
47 5. Conclusion Future - Semasiological perspective Onomasiology: given a concept - how is it named? Semasiology: given a word - what does it mean? Bv. autosnelweg vs. ring ring sieraad boksring ringweg jaarring kring
48 5. Conclusion Future - Semasiological perspective Case study traffic: Onomasiologically a higher uniformity between Belgian Dutch and Netherlandic Dutch than semasiologically, in other words, we rather name things the same than that words have the same meaning.
49 5. Conclusion Future - Distributional semantics - Vector Space Models You shall know a word by the company it keeps (Firth) Type-based onomasiology: detect synonyms Token-based semasiology: detect senses
50 5. Conclusion
51 5. Conclusion
52 Thank you! Questions? Further information?
53 References 1/2 Bouma, Gosse, Gertjan van Noord & Rob Malouf Alpino: wide-coverage computational analysis of Dutch. In: Walter Daelemans, et al. (eds). Computational Linguistics in the Netherlands Amsterdam: Rodopi, Clyne, Michael Pluricentric languages: differing norms in different nations. Berlin/New York: Mouton de Gruyter. De Cnodder, Tine Convergentie en divergentie in de Nederlandse woordenschat anno Unpublished MA thesis. KU Leuven. Geeraerts, Dirk, Stefan Grondelaers & Dirk Speelman Convergentie en divergentie in de Nederlandse woordenschat: een onderzoek naar kleding- en voetbaltermen. Amsterdam: P.J. Meertens-Instituut. Grondelaers, Stefan, Dirk Geeraerts, Dirk Speelman and José Tummers Lexical standardisation in internet conversations. Comparing Belgium and The Netherlands. In: Fontana, McNally, Turell and Enric Vallduví (eds.), Proceedings of the First International Conference on Language Variation in Europe, Barcelona: Universitat Pompeu Fabra, Institut Universitari de Lingüística Aplicada, Unitat de Investigació de Variació Lingüística.
54 References 2/2 Grondelaers, Stefan, Hilde Van Aken, Dirk Speelman & Dirk Geeraerts Inhoudswoorden en preposities als standaardiseringsindicatoren. De diachrone en synchronic status van het Belgische Nederlands. Nederlandse Taalkunde 6: Heylen, Kris, Dirk Speelman & Dirk Geeraerts Looking at word meaning. An interactive visualization of Semantic Vector Spaces for Dutch synsets. In Miriam Butt, Sheelagh Carpendale, Gerald Penn, Jelena Prokic & Michael Cysouw (eds.), Proceedings of the EACL-2012 joint workshop of LINGVIS & UNCLH: Visualization of Language Patters and Uncovering Language History from Multilingual Resources, Stroudsburg: Association for Computational Linguistics. Speelman, Dirk, Stefan Grondelaers & Dirk Geeraerts Profile-based linguistic uniformity as a generic method for comparing language varieties. Computers and the Humanities 37 (3): Wielfaert, Thomas, Kris Heylen & Dirk Speelman Visualisations interactives des espaces vectoriels sémantiques pour l analyse lexicologique. Actes de SemDis 2013 : Enjeux actuels de la sémantique distributionnelle:
Tracking change in word meaning
Overview Intro DisSem Previous Case Visualisation Conclusion References Tracking change in word meaning A dynamic visualization of diachronic distributional semantics Kris Heylen, Thomas Wielfaert & Dirk
Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions. Natalia Levshina
Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions Natalia Levshina RU Quantitative Lexicology and Variational Linguistics Faculteit Letteren Subfaculteit Taalkunde K.U.Leuven
Distributional Semantic Modelling in Cognitive Sociolinguistics: QLVL probes Semantic Space
Overview CogSoLx Onomas Semas Conclusion Distributional Semantic Modelling in Cognitive Sociolinguistics: QLVL probes Semantic Space Kris Heylen, Dirk Geeraerts & Dirk Speelman KU Leuven Quantitative Lexicology
Comparing constructicons: A cluster analysis of the causative constructions with doen in Netherlandic and Belgian Dutch.
Comparing constructicons: A cluster analysis of the causative constructions with doen in Netherlandic and Belgian Dutch Natalia Levshina Outline 1. Dutch causative Cx with doen 2. Data and method 3. Quantitative
Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions
Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions Supervisors: Natalia Levshina Dirk Geeraerts Dirk Speelman University of Leuven RU Quantitative Lexicology and Variational
Varieties of lexical variation
Dirk Geeraerts University of Leuven Varieties of lexical Abstract This paper presents the theoretical backgr ound of a large-scale lexicological research project on lexical that was carried out at the
The Chat Box Revelation On the chat language of Flemish adolescents and young adults
!"#$%&'(*+,&(-,.,+/$"#('0 1**234567875549:#,(-; &81**2456787
Acquiring grammatical gender in northern and southern Dutch. Jan Klom, Gunther De Vogelaer
Acquiring grammatical gender in northern and southern Acquring grammatical gender in southern and northern 2 Research questions How does variation relate to change? (transmission in Labov 2007 variation
LASSY: LARGE SCALE SYNTACTIC ANNOTATION OF WRITTEN DUTCH
LASSY: LARGE SCALE SYNTACTIC ANNOTATION OF WRITTEN DUTCH Gertjan van Noord Deliverable 3-4: Report Annotation of Lassy Small 1 1 Background Lassy Small is the Lassy corpus in which the syntactic annotations
Cultural Trends and language change
Cultural Trends and language change Gosse Bouma [email protected] Information Science University of Groningen NHL 2015/03 Gosse Bouma 1/25 Popularity of Wolf in English books Gosse Bouma 2/25 Google Books
A chart generator for the Dutch Alpino grammar
June 10, 2009 Introduction Parsing: determining the grammatical structure of a sentence. Semantics: a parser can build a representation of meaning (semantics) as a side-effect of parsing a sentence. Generation:
Annotation Guidelines for Dutch-English Word Alignment
Annotation Guidelines for Dutch-English Word Alignment version 1.0 LT3 Technical Report LT3 10-01 Lieve Macken LT3 Language and Translation Technology Team Faculty of Translation Studies University College
Explaining register and sociolinguistic variation in the lexicon: Corpus studies on Dutch
Explaining register and sociolinguistic variation in the lexicon: Corpus studies on Dutch Published by LOT phone: +31 30 253 6006 Trans 10 e-mail: [email protected] 3512 JK Utrecht http://www.lotschool.nl The
Applying quantitative methods to dialect Dutch verb clusters
Applying quantitative methods to dialect Dutch verb clusters Jeroen van Craenenbroeck KU Leuven/CRISSP [email protected] 1 Introduction Verb cluster ordering is a well-known area of microparametric
Degrees of adverbialization
Hendrik De Smet (KU Leuven) [email protected] Muriel Norde (Humboldt Universität zu Berlin) [email protected] Kristel Van Goethem (F.R.S.-FNRS & Université catholique de Louvain)
Crossing Corpora. Modelling Semantic Similarity across Languages and Lects.
Distributional Models Bilectal Bilingual Crossing Corpora. Modelling Semantic Similarity across Languages and Lects. Yves Peirsman Supervisors: Dirk Geeraerts & Dirk Speelman Quantitative Lexicology and
LEXICOGRAPHIC ISSUES IN COMBINATORICS
LEXICOGRAPHIC ISSUES IN COMBINATORICS Editing multiple idiomatic phraseological units for book and CD-ROM Lineke OPPENTOCHT, Utrecht, The Netherlands Abstract In dictionaries dating from pre-computerized
Study programmes 2011-2012 Faculty of Philosophy and Letters Master in Modern Languages and Literatures : German, Dutch and English, Teaching Focus
First Year Core courses common to the study paths Choose two courses out of the following : LITT0003-1 Issues in comparative Modern Literature - Michel DELVILLE - Suppl : Véronique BRAGARD LING0012-1 Semantics
Thematic File Road Safety 5 Distraction in Traffic
Summary Thematic File Road Safety 5 Distraction in Traffic Summary Thematic File Road Safety 5 - Distraction in Traffic Thematic File Road Safety 5 Distraction in Traffic (2015) D/2015/0779/51 Author:
Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions
FACULTEIT LETTEREN SUBFACULTEIT TAALKUNDE KATHOLIEKE UNIVERSITEIT LEUVEN Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions Proefschrift ingediend tot het behalen van de
Distinguishing Prepositional Complements from Fixed Arguments
POSTER SESSION Distinguishing Prepositional Complements from Fixed Arguments M. Begoña Villada Moirón Alfa-mformatica. University of Groningen P.O. Box 716. 9700 AS Groningen The Netherlands [email protected]
Project notes of CLARIN project DiscAn: Towards a Discourse Annotation system for Dutch language corpora
Project notes of CLARIN project DiscAn: Towards a Discourse Annotation system for Dutch language corpora Ted Sanders University Utrecht Utrecht Institute of Linguistics Trans 10 NL-3512 JK Utrecht [email protected]
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
~ We are all goddesses, the only problem is that we forget that when we grow up ~
~ We are all goddesses, the only problem is that we forget that when we grow up ~ This brochure is Deze brochure is in in English and Dutch het Engels en Nederlands Come and re-discover your divine self
Post-doctoral researcher, Faculty of Translation Studies, University College Ghent
Lieve Macken Faculty of Translation Studies Groot-Brittanniëlaan 45 B-9000, Ghent Belgium email: [email protected] url: lt3.hogent.be/en/people/lieve-macken/ Born: June 17, 1968 Belgium Nationality:
ABN AMRO Bank N.V. The Royal Bank of Scotland N.V. ABN AMRO Holding N.V. RBS Holdings N.V. ABN AMRO Bank N.V.
Op 6 februari 2010 is de naam ABN AMRO Bank N.V. (geregistreerd bij de Kamer van Koophandel onder nummer 33002587) gewijzigd in The Royal Bank of Scotland N.V. Op 1 april 2010 is de naam van ABN AMRO Holding
The University of Amsterdam s Question Answering System at QA@CLEF 2007
The University of Amsterdam s Question Answering System at QA@CLEF 2007 Valentin Jijkoun, Katja Hofmann, David Ahn, Mahboob Alam Khalid, Joris van Rantwijk, Maarten de Rijke, and Erik Tjong Kim Sang ISLA,
Corporate Security & Identity
ir. Yvan De Mesmaeker Secretary general ir. Yvan De Mesmaeker Secretary general of the European Corporate Security Association - ECSA Education: MSc in Engineering Professional responsibilities: Secretary
SAND: Relation between the Database and Printed Maps
SAND: Relation between the Database and Printed Maps Erik Tjong Kim Sang Meertens Institute [email protected] May 16, 2014 1 Introduction SAND, the Syntactic Atlas of the Dutch Dialects,
University of Marburg, RC Deutscher Sprachatlas University of Leuven, RU Quantitative Lexicology and Variational Linguistics
Construction Grammar meets Semantic Vector Spaces: A radically data-driven approach to semantic classification of slot fillers Natalia Levshina Kris Heylen University of Marburg, RC Deutscher Sprachatlas
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
Cambridge International Examinations Cambridge International General Certificate of Secondary Education DUTCH 0515/01 Paper 1 Listening For Examination from 2015 SPECIMEN MARK SCHEME Approx. 45 minutes
Master in Digital Humanities
Master in Digital Humanities Faculty of Arts Faculty of Psychology and Educational Sciences Faculty of Social Sciences Faculty of Sciences Department of Computer Science KU Leuven. Inspiring the outstanding.
NAAR NEDERLAND HANDLEIDING
NAAR NEDERLAND HANDLEIDING www.naarnederland.nl 1. Introduction As of 15 March 2006, certain foreign nationals wishing to settle in the Netherlands for a prolonged period who require a provisional residence
Metadata for Corpora PATCOR and Domotica-2
July 2013 Technical Report: KUL/ESAT/PSI/1303 Metadata for Corpora PATCOR and Domotica-2 Tessema N., Ons B., van de Loo J., Gemmeke J.F., De Pauw G., Daelemans W., Van hamme H. Katholieke Universiteit
Comparing Ontology-based and Corpusbased Domain Annotations in WordNet.
Comparing Ontology-based and Corpusbased Domain Annotations in WordNet. A paper by: Bernardo Magnini Carlo Strapparava Giovanni Pezzulo Alfio Glozzo Presented by: rabee ali alshemali Motive. Domain information
Clustering Connectionist and Statistical Language Processing
Clustering Connectionist and Statistical Language Processing Frank Keller [email protected] Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised
Example-based Translation without Parallel Corpora: First experiments on a prototype
-based Translation without Parallel Corpora: First experiments on a prototype Vincent Vandeghinste, Peter Dirix and Ineke Schuurman Centre for Computational Linguistics Katholieke Universiteit Leuven Maria
CLARIN project DiscAn :
CLARIN project DiscAn : Towards a Discourse Annotation system for Dutch language corpora Ted Sanders Kirsten Vis Utrecht Institute of Linguistics Utrecht University Daan Broeder TLA Max-Planck Institute
Dialect Corpora Taken Further: The DynaSAND corpus and its application in newer tools
PACLIC 24 Proceedings 759 Dialect Corpora Taken Further: The DynaSAND corpus and its application in newer tools Jan Pieter Kunst a and Franca Wesseling b a Meertens Institute, Royal Netherlands Academy
Sentence Semantics. General Linguistics Jennifer Spenader, February 2006 (Most slides: Petra Hendriks)
Sentence Semantics General Linguistics Jennifer Spenader, February 2006 (Most slides: Petra Hendriks) Data to be analyzed (1) Maria slaapt. (2) Jan slaapt. (3) Maria slaapt en Jan slaapt. (4) Iedereen
Jolien Grandia MA MSc
Jolien Grandia MA MSc Curriculum Vitae Jolien Grandia MA MSC Address: E- mail: Faculty of Social Science Department of public Administration P.O. Box 1738, room T17-45 3000 DR Rotterdam The Netherlands
Jolien Grandia MA MSc
Jolien Grandia MA MSc Curriculum Vitae Jolien Grandia Address: E-mail: Website: Erasmus University Rotterdam Burgemeester Oudlaan 50 3062 PA Rotterdam The Netherlands [email protected] www.joliengrandia.com
Asking what. a person looks like some persons look like
1 Asking what people are wearing: What are you wearing? = Wat heb jij aan? I m / I am wearing jeans (or a dress or ) = Ik heb jeans aan. What is he / she wearing? = Wat heeft hij / zij aan? He / she s
Hybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
The polysemy of lexeme formation rules
The polysemy of lexeme formation rules Delphine Tribout & Olivier Bonami Workshop Semantics of derivational morphology: Empirical evidence and theoretical modeling July 01, 2014 Introduction We address
Your boldest wishes concerning online corpora: OpenSoNaR and you
1 Your boldest wishes concerning online corpora: OpenSoNaR and you Martin Reynaert TiCC, Tilburg University and CLST, Radboud Universiteit Nijmegen TiCC Colloquium, Tilburg University. October 16th, 2013
Timeline (1) Text Mining 2004-2005 Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?
Text Mining 2004-2005 Master TKI Antal van den Bosch en Walter Daelemans http://ilk.uvt.nl/~antalb/textmining/ Dinsdag, 10.45-12.30, SZ33 Timeline (1) [1 februari 2005] Introductie (WD) [15 februari 2005]
Curriculum Vitae - Maike van Damme
Curriculum Vitae - Maike van Damme Personal data Maike Catharina Dimphena van Damme Assistant professor IMPALLA Research fellow CEPS/INSTEAD/ KU Leuven 3 Avenue de la Fonte, L-4364 Esch-sur-Alzette, Luxembourg
Pascale Simons - Curriculum Vitae
Pascale Simons - Curriculum Vitae Personal details Name Pascale Anna Maria Simons Title MSc Address Henri Dunantstraat 131 3620 Lanaken Belgium Phone nr: ++31-6-15015071 E-mail: [email protected]
Corpus and Discourse. The Web As Corpus. Theory and Practice MARISTELLA GATTO LONDON NEW DELHI NEW YORK SYDNEY
Corpus and Discourse The Web As Corpus Theory and Practice MARISTELLA GATTO B L O O M S B U R Y LONDON NEW DELHI NEW YORK SYDNEY Contents List of Figures xiii List of Tables xvii Preface xix Acknowledgements
How To Kill A Black Person
Do big data hide or reveal stories? Recording history in the NewsReader project Funded by the EU - FP7 Work Programme Call FP7-ICT-2011-8 Research theme Information and Communication Technologies, challenge
Nominative-Dative Inversion and the Decline of Dutch
Nominative-Dative Inversion and the Decline of Dutch Amsterdam Center for Language and Communication, University of Amsterdam (E-mail: [email protected]) 1. Introduction Recently, linguistics was in the centre
Sum of all paintings opening slide Introduce myself. Nlwp, Commons, Wikidata, GLAMwiki, bots, Wiki Loves Monuments, uploads, Based on Wikimania 2015
Sum of all paintings opening slide Introduce myself. Nlwp, Commons, Wikidata, GLAMwiki, bots, Wiki Loves Monuments, uploads, Based on Wikimania 2015 presentation. Not a lot of overlap in people! 1 Galleries,
Ling 201 Syntax 1. Jirka Hana April 10, 2006
Overview of topics What is Syntax? Word Classes What to remember and understand: Ling 201 Syntax 1 Jirka Hana April 10, 2006 Syntax, difference between syntax and semantics, open/closed class words, all
De veranderende samenleving. De smartphone revolutie
De veranderende samenleving De smartphone revolutie GfK Retail and Technology Wie zijn wij? GfK Retail and Technology The Netherlands 21 November 2011 Consumenten Electronica, Telecom, IT, Klein- & Groot
Voorbeeld. Preview ISO 14518 INTERNATIONAL STANDARD. Cranes Requirements for test loads
INTERNATIONAL STANDARD ISO 14518 First edition 2005-02-01 Cranes Requirements for test loads Dit document mag slechts op een stand-alone PC worden geinstalleerd. Gebruik op een netwerk is alleen. toestaan
Verbondstraat 52, B- 2000 Antwerp, Belgium Telephone + 32 3 290 48 37
CURRICULUM VITAE Personal information Name Ann Laenen Address Verbondstraat 52, B- 2000 Antwerp, Belgium Telephone + 32 3 290 48 37 E-mail [email protected] Nationality Belgian Date of birth 21. 06.1966 Telephone
Taal & Tongval 2015. Borrowing. Pragmatic and Variational Linguistic Approaches. Gent, KANTL, 27 November
Taal & Tongval 2015 Borrowing Pragmatic and Variational Linguistic Approaches Gent, KANTL, 27 November Organizing Committee Reinhild Vandekerckhove Eline Zenner Gunther De Vogelaer Dirk Geeraerts Ben
Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets
Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets Maria Ruiz-Casado, Enrique Alfonseca and Pablo Castells Computer Science Dep., Universidad Autonoma de Madrid, 28049 Madrid, Spain
TAPAS. Arnt Offringa Director R&D Fokker Aerostructures Ypenburg, 12 juni 2013
TAPAS Arnt Offringa Director R&D Fokker Aerostructures Ypenburg, 12 juni 2013 Wat hebben zij gemeen? 2 Product of the Netherlands 3 focus op technologieën met groeipotentieel 4 Low weight producten door
Market Intelligence & Research Services. CRM Trends Overview. MarketCap International BV Januari 2011
Market Intelligence & Research Services CRM Trends Overview MarketCap International BV Januari 2011 Index 1. CRM Trends generiek 2. CRM & IT 3. CRM in Nederland 2011 2 Index 1. CRM Trends generiek 2. CRM
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
Lexical Competition: Round in English and Dutch
Lexical Competition: Round in English and Dutch Joost Zwarts * Abstract This paper studies the semantic division of labour between three Dutch words, om, rond and rondom, all three corresponding to the
Kansen in KP7 NMP. Aansluitend op de HTSM Roadmap Nanotechnologie. 11 juni 2012. Melvin A. Kasanrokijat
Kansen in KP7 NMP Aansluitend op de HTSM Roadmap Nanotechnologie 11 juni 2012 Melvin A. Kasanrokijat Mogelijkheden in KP7 - Cooperation Groot programma met 10 verschillende thema s NMP, ICT, Health, Energy,
Curriculum Vitae. Sophie Op de Beeck
Curriculum Vitae Sophie Op de Beeck PERSONALIA Name: Op de Beeck Surname: Sophie Professional address: KU Leuven Public Management Institute Parkstraat 45 - bus 3609-3000 Leuven - Belgium Tel.: +32 (0)
Tooway. consumenten prijzen KASAT. Tooway Nederland Rian BV Bergstraat 25 5581BL Waalre 040-2213656 [email protected]. Prijzen zijn excl 21% BTW
consumenten prijzen KASAT Tooway Prijzen zijn excl 2% BTW Tooway Nederland Rian BV Bergstraat 25 558BL Waalre 040-223656 [email protected] l www.skylogic.com PRICELIST HARDWARE KU MATERIAL: SET COMPLET
Natural answer presentation. through revision. of syntactic patterns
University of Twente Faculty: Electrical Engineering, Mathematics & Computer Science Department: Computer Science Group: Language, Knowledge & Interaction Natural answer presentation through revision of
Students Conceptions of Constructivist Learning. Sofie Loyens
Students Conceptions of Constructivist Learning Sofie Loyens ISBN 978-90-8559-275-4 Copyright 2007 S.M.M. Loyens All rights reserved. No part of this thesis may be reproduced or transmitted in any form,
Brill s rule-based PoS tagger
Beáta Megyesi Department of Linguistics University of Stockholm Extract from D-level thesis (section 3) Brill s rule-based PoS tagger Beáta Megyesi Eric Brill introduced a PoS tagger in 1992 that was based
MULTIFUNCTIONAL DICTIONARIES
In: A. Zampolli, A. Capelli (eds., 1984): The possibilities and limits of the computer in producing and publishing dictionaries. Linguistica Computationale III, Pisa: Giardini, 279-288 MULTIFUNCTIONAL
