Applying quantitative methods to dialect Dutch verb clusters
|
|
|
- Lawrence Hancock
- 9 years ago
- Views:
Transcription
1 Applying quantitative methods to dialect Dutch verb clusters Jeroen van Craenenbroeck KU Leuven/CRISSP 1 Introduction Verb cluster ordering is a well-known area of microparametric variation within Germanic (Barbiers & Bennis, 2010; Wurmbrand, 2005). For example, out of the six theoretically possible orderings in the three-verb cluster in (1), four are attested in Dutch dialects (Barbiers et al., 2008): (1) a. Ik vind dat iedereen moet kunnen zwemmen. I find that everyone must can swim I think everyone should be able to swim. b. Ik vind dat iedereen moet zwemmen kunnen. c. Ik vind dat iedereen zwemmen moet kunnen. d. Ik vind dat iedereen zwemmen kunnen moet. e. *Ik vind dat iedereen kunnen zwemmen moet. f. *Ik vind dat iedereen kunnen moet zwemmen. 1
2 In this talk I show that a quantitative analysis of these data can lead to new insights into the theory of verb clusters. 2 Methodology I examine the raw data from 8 maps in the Syntactic Atlas of the Dutch Dialects (Barbiers et al., 2008): 4 containing two-verb clusters and 4 containing three-verb clusters, for a total of 28 possible cluster orderings. Based on Spruit (2008) s version of the Hamming distance algorithm given in (2), I measure the differences in verb cluster ordering between 185 dialects of Dutch. (2) Hamming distance algorithm (Spruit, 2008, p.36) For each pair of dialects A and B, for each variant of all syntactic features, if it does occur in dialect A, but does not occur in dialect B or if it does not occur in dialect A, but does occur in dialect B, increment the distance between dialect A and B by 1. The result is a (diagonally symmetric) matrix, a small portion of which is shown in figure 1, that lists, for each pair of dialects, the difference between them with respect to the 28 cluster orderings under investigation. This matrix was then analyzed using Multidimensional Scaling (implemented in R with cmdscale), which reduced the 185-dimensional variational space to a 2-dimensional one, in which each dialect received two coordinates representing its relative similarity to the other 184 dialects. The outcome of this procedure is shown in figure 2. 2
3 Figure 1: Cluster ordering differences between Dutch dialects Figure 2: Two-dimensional representation of verb cluster ordering differences 3
4 3 Interpretation Three clusters can be discerned in figure 2: those with an x-coordinate greater than 5, those with a y-coordinate greater than 2 and those in the range (-6,-5)- (5,2). Moreover, dialects with a y-coordinate below -5 don t seem to pattern with any other dialects. Interestingly, if these groups are represented on a map of the Dutch language area, the result is surprisingly geographically homogeneous, see figure 3, suggesting that the patterns uncovered via this quantitative methodology correspond to an underlying linguistic reality. Preliminary investigations suggest that in the first pattern the main verb is placed at the left edge of the cluster, in pattern two a participle is placed to the left of its selecting verb, but an infinitive to the right, and in pattern three infinitives are placed to the right of their selecting verb, but participles either to the left or to the right. Figure 3: Verb cluster ordering patterns in Dutch dialects 4
5 4 Conclusion This study shows that a purely quantitative methodology can shed new light on the theoretical analysis of verb cluster ordering in Germanic. References Barbiers, S., Auwera, J. v. d., Bennis, H., Boef, E., Vogelaer, G. D., & Ham, M. v. d. (2008). Syntactische atlas van de Nederlandse dialecten. Deel II. Amsterdam: Amsterdam University Press. Barbiers, S., & Bennis, H. (2010). De plaats van het werkwoord in zuid en noord. In J. D. Caluwe & J. V. Keymeulen (Eds.), Voor Magda. Artikelen voor Magda Devos bij haar afscheid van de Universiteit Gent (pp ). Gent: Academia. Spruit, M. R. (2008). Quantitative perspectives on syntactic variation in Dutch dialects. Unpublished doctoral dissertation, Universiteit van Amsterdam. Wurmbrand, S. (2005). Verb clusters, verb raising, and restructuring. In M. Everaert & H. v. Riemsdijk (Eds.), The Blackwell Companion to Syntax (Vol. V, pp ). Oxford: Blackwell. 5
The Syntactic Atlas of the Dutch Dialects
The Syntactic Atlas of the Dutch Dialects A corpus of elicited speech as an on-line Dynamic Atlas Sjef Barbiers & Jan Pieter Kunst Meertens Institute (KNAW) 1 Coordination Hans Bennis (Meertens Institute)
SAND: Relation between the Database and Printed Maps
SAND: Relation between the Database and Printed Maps Erik Tjong Kim Sang Meertens Institute [email protected] May 16, 2014 1 Introduction SAND, the Syntactic Atlas of the Dutch Dialects,
John Benjamins Publishing Company
John Benjamins Publishing Company This is a contribution from Linguistics in the Netherlands 2014 This electronic file may not be altered in any way. The author(s) of this article is/are permitted to use
Dialect Corpora Taken Further: The DynaSAND corpus and its application in newer tools
PACLIC 24 Proceedings 759 Dialect Corpora Taken Further: The DynaSAND corpus and its application in newer tools Jan Pieter Kunst a and Franca Wesseling b a Meertens Institute, Royal Netherlands Academy
The Chat Box Revelation On the chat language of Flemish adolescents and young adults
!"#$%&'(*+,&(-,.,+/$"#('0 1**234567875549:#,(-; &81**2456787
[A Dutch version of this paper appeared in Nederlandse Taalkunde 13, 2008-2, pp 169-187. The English version was written in Spring 2009]
Verb clusters and the grammar of the right-periphery Sjef Barbiers, Meertens Instituut and Utrecht University [A Dutch version of this paper appeared in Nederlandse Taalkunde 13, 2008-2, pp 169-187. The
Acquiring grammatical gender in northern and southern Dutch. Jan Klom, Gunther De Vogelaer
Acquiring grammatical gender in northern and southern Acquring grammatical gender in southern and northern 2 Research questions How does variation relate to change? (transmission in Labov 2007 variation
Syntactic Atlas of the Dutch Dialects
Syntactic Atlas of the Dutch Dialects Sjef Barbiers, Meertens Institute (KNAW) & Utrecht University 1 Coordination Hans Bennis (Meertens Institute) Hans den Besten (University of Amsterdam) Magda Devos
MIMORE Educational Module Linguistic Microvariation
MIMORE Educational Module Linguistic Microvariation Meertens Instituut, Amsterdam October 2014 Authors Cora Pots Nina Wiedenhoff Supervision and Editing Sjef Barbiers [email protected] 1 TABLE
1. Introduction 2. Typology 3. Dialectology
Jespersen's Cycle: Notes from Typology and Dialectology Linguistic Cycles Workshop Arizona State University, April 2008 1. Introduction 2. Typology 3. Dialectology Johan van der Auwera Center for Grammar,Cognition,
How To Write A Sentence In Germany
Making Sense of Dutch Word Order Dick Grune [email protected] Version 1.2.1 (October 10, 2013) Note: Although Dutch and Flemish are officially the same language, the explanation below does not apply as
THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS
THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS O.U. Sezerman 1, R. Islamaj 2, E. Alpaydin 2 1 Laborotory of Computational Biology, Sabancı University, Istanbul, Turkey. 2 Computer Engineering
The Syntactic Atlas of the Dutch Dialects (SAND): A Corpus of Elicited Speech and Text as an Online Dynamic Atlas
4 The Syntactic Atlas of the Dutch Dialects (SAND): A Corpus of Elicited Speech and Text as an Online Dynamic Atlas Sjef Barbiers, Leonie Cornips and Jan Pieter Kunst 1 Background information The Syntactic
Adjacency, PF, and extraposition
Adjacency, PF, and extraposition Susi Wurmbrand and Jonathan David Bobaljik 1. Dutch 1 In the OV Germanic languages, certain verbs selecting infinitival complements (roughly, the restructuring predicates)
LEXICOGRAPHIC ISSUES IN COMBINATORICS
LEXICOGRAPHIC ISSUES IN COMBINATORICS Editing multiple idiomatic phraseological units for book and CD-ROM Lineke OPPENTOCHT, Utrecht, The Netherlands Abstract In dictionaries dating from pre-computerized
A Unified Structure for Dutch Dialect Dictionary Data
A Unified Structure for Dutch Dialect Dictionary Data Folkert de Vriend 1, Lou Boves 1,2, Henk van den Heuvel 1, Roeland van Hout 2, Joep Kruijsen 2, Jos Swanenberg 2 1 Centre for Language and Speech Technology
Syntactic extension. The historical development of Dutch verb clusters
Syntactic extension The historical development of Dutch verb clusters Published by LOT phone: +31 30 253 6111 Trans 10 3512 JK Utrecht e-mail: [email protected] The Netherlands http://www.lotschool.nl Cover illustration:
Annotation Guidelines for Dutch-English Word Alignment
Annotation Guidelines for Dutch-English Word Alignment version 1.0 LT3 Technical Report LT3 10-01 Lieve Macken LT3 Language and Translation Technology Team Faculty of Translation Studies University College
http://www.guido.be/intranet/enqueteoverview/tabid/152/ctl/eresults...
1 van 70 20/03/2014 11:55 EnqueteDescription 2 van 70 20/03/2014 11:55 3 van 70 20/03/2014 11:55 4 van 70 20/03/2014 11:55 5 van 70 20/03/2014 11:55 6 van 70 20/03/2014 11:55 7 van 70 20/03/2014 11:55
Comparing constructicons: A cluster analysis of the causative constructions with doen in Netherlandic and Belgian Dutch.
Comparing constructicons: A cluster analysis of the causative constructions with doen in Netherlandic and Belgian Dutch Natalia Levshina Outline 1. Dutch causative Cx with doen 2. Data and method 3. Quantitative
Visualization Techniques in Data Mining
Tecniche di Apprendimento Automatico per Applicazioni di Data Mining Visualization Techniques in Data Mining Prof. Pier Luca Lanzi Laurea in Ingegneria Informatica Politecnico di Milano Polo di Milano
Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 by Tan, Steinbach, Kumar 1 What is Cluster Analysis? Finding groups of objects such that the objects in a group will
Nederlandse antiterrorismeregelgeving getoetst aan fundamentele rechten. Een analyse met meer bijzonder aandacht voor het EVRM
Nederlandse antiterrorismeregelgeving getoetst aan fundamentele rechten Een analyse met meer bijzonder aandacht voor het EVRM P.H.P.H.M.C. van Kempen & J. Van de Voort Summary Dutch antiterrorism legislation
Decision Support System Methodology Using a Visual Approach for Cluster Analysis Problems
Decision Support System Methodology Using a Visual Approach for Cluster Analysis Problems Ran M. Bittmann School of Business Administration Ph.D. Thesis Submitted to the Senate of Bar-Ilan University Ramat-Gan,
Investigation of Process Optimization for Small and Medium Enterprises in the Aviation Maintenance Repair and Overhaul
Investigation of Process Optimization for Small and Medium Enterprises in the Aviation Maintenance Repair and Overhaul R. Bandurski, D. Snel, A. Stander Amsterdam University of Applied Sciences, 1097 DZ,
Apparent nonlocality
Apparent nonlocality Mark de Vries University of Groningen 1. Introduction and overview Syntactic processes and dependencies are normally restricted to certain domains. * This phenomenon of locality has
Example-Based Treebank Querying. Liesbeth Augustinus Vincent Vandeghinste Frank Van Eynde
Example-Based Treebank Querying Liesbeth Augustinus Vincent Vandeghinste Frank Van Eynde LREC 2012, Istanbul May 25, 2012 NEDERBOOMS Exploitation of Dutch treebanks for research in linguistics September
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups Abstract Yan Shen 1, Bao Wu 2* 3 1 Hangzhou Normal University,
FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM
International Journal of Innovative Computing, Information and Control ICIC International c 0 ISSN 34-48 Volume 8, Number 8, August 0 pp. 4 FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT
Tracking change in word meaning
Overview Intro DisSem Previous Case Visualisation Conclusion References Tracking change in word meaning A dynamic visualization of diachronic distributional semantics Kris Heylen, Thomas Wielfaert & Dirk
LASSY: LARGE SCALE SYNTACTIC ANNOTATION OF WRITTEN DUTCH
LASSY: LARGE SCALE SYNTACTIC ANNOTATION OF WRITTEN DUTCH Gertjan van Noord Deliverable 3-4: Report Annotation of Lassy Small 1 1 Background Lassy Small is the Lassy corpus in which the syntactic annotations
34. Research results rom on-line dialect databases and dynamic dialect maps
646 III. Supra-regional and regionally-unbound aspects Woolhiser, Curt 2005 Political borders and dialect divergence/convergence in Europe. In: Peter Auer, Frans Hinskens and Paul Kerswill (eds.), Dialect
The IPP-effect as a repair strategy
The IPP-effect as a repair strategy Thesis submitted in partial fulfilment of the requirement for the degree of Master of Philosophy University of Utrecht August 2009 Submitted by: Enrico Boone Studentnumber:
Turkish Radiology Dictation System
Turkish Radiology Dictation System Ebru Arısoy, Levent M. Arslan Boaziçi University, Electrical and Electronic Engineering Department, 34342, Bebek, stanbul, Turkey [email protected], [email protected]
4.16 National CARARE Workshop in the Netherlands
4.16 National CARARE Workshop in the Netherlands Organisation of the Workshop The Dutch heritage in a European Perspective workshop was organised jointly by the three partners from the Netherlands: Hella
SGL: Stata graph library for network analysis
SGL: Stata graph library for network analysis Hirotaka Miura Federal Reserve Bank of San Francisco Stata Conference Chicago 2011 The views presented here are my own and do not necessarily represent the
Free reflexives: Reflexives without
Nordic Atlas of Language Structures (NALS) Journal, Vol. 1, 522 526 C opyright Björn Lundquist 2014 Licensed under a Creative Commons Attribution 3.0 License Free reflexives: Reflexives without a sentence
Belgian Terminology. Dr. Benny Van Bruwaene, M.D. Semantic Interoperability Committee
Belgian Terminology Dr. Benny Van Bruwaene, M.D. Semantic Interoperability Committee 1 25-6-2010 Definitions Terminology List of terms for diagnoses, examinations, interventions, medication, nursing, professions,
DATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
Part 2: Community Detection
Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -
Professor Dr W.F.J.Buijink Education: Positions held: Selected service positions held:
Professor Dr W.F.J.Buijink Department of Accountancy Tilburg School of Economics and Management (TiSEM) Universiteit Tilburg (UvT) PO Box 90153 5000 LE Tilburg The Netherlands Education: 1992 Doctorate,
Lecture 2: Homogeneous Coordinates, Lines and Conics
Lecture 2: Homogeneous Coordinates, Lines and Conics 1 Homogeneous Coordinates In Lecture 1 we derived the camera equations λx = P X, (1) where x = (x 1, x 2, 1), X = (X 1, X 2, X 3, 1) and P is a 3 4
Design of LDPC codes
Design of LDPC codes Codes from finite geometries Random codes: Determine the connections of the bipartite Tanner graph by using a (pseudo)random algorithm observing the degree distribution of the code
Visualization of textual data: unfolding the Kohonen maps.
Visualization of textual data: unfolding the Kohonen maps. CNRS - GET - ENST 46 rue Barrault, 75013, Paris, France (e-mail: [email protected]) Ludovic Lebart Abstract. The Kohonen self organizing
By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
This material is posted here with permission of the IEEE Such permission of the IEEE does not in any way imply IEEE endorsement of any of Helsinki University of Technology's products or services Internal
Improving Traceability of Requirements Through Qualitative Data Analysis
Improving Traceability of Requirements Through Qualitative Data Analysis Andreas Kaufmann, Dirk Riehle Open Source Research Group, Computer Science Department Friedrich-Alexander University Erlangen Nürnberg
Dynamic Eigenvalues for Scalar Linear Time-Varying Systems
Dynamic Eigenvalues for Scalar Linear Time-Varying Systems P. van der Kloet and F.L. Neerhoff Department of Electrical Engineering Delft University of Technology Mekelweg 4 2628 CD Delft The Netherlands
Cultural Trends and language change
Cultural Trends and language change Gosse Bouma [email protected] Information Science University of Groningen NHL 2015/03 Gosse Bouma 1/25 Popularity of Wolf in English books Gosse Bouma 2/25 Google Books
FUTURE VENUES OF RESEARCH AND THE SAND (VOLUME 1)
FUTURE VENUES OF RESEARCH AND THE SAND (VOLUME 1) Introduction 1'1 The award-winning Syntactic Atlas ofdutch Dialects, 'Syntactische Atlas van de Nederlandse Dialecten, abbreviated as SAND (volume 1) features
UvA college Governance and Portfolio Management
UvA college Han Verniers Principal Consultant [email protected] Programma Governance IT Governance, wat is dat? Governance: structuren, processen, instrumenten Portfolio Management Portfolio Management,
Doctoral School of Historical Sciences Dr. Székely Gábor professor Program of Assyiriology Dr. Dezső Tamás habilitate docent
Doctoral School of Historical Sciences Dr. Székely Gábor professor Program of Assyiriology Dr. Dezső Tamás habilitate docent The theses of the Dissertation Nominal and Verbal Plurality in Sumerian: A Morphosemantic
The World Atlas of Language Structures & Follow-up notes
November 2007 Workshop on the Feasibility of a Web-based Database of the Syntactic Structures of the World s Languages The World Atlas of Language Structures & Follow-up notes Hans-Jörg Bibiko Max Planck
Integral Engineering
Integral Engineering Ir. Marcel A.M. Grooten Managing Director FHI High Tech Equipment Engineering 1 februari 2011 Introduction Market Trends Integral Engineering Examples 1-2-2011 2 High Tech Market &
Computational Geometry. Lecture 1: Introduction and Convex Hulls
Lecture 1: Introduction and convex hulls 1 Geometry: points, lines,... Plane (two-dimensional), R 2 Space (three-dimensional), R 3 Space (higher-dimensional), R d A point in the plane, 3-dimensional space,
Position: Address: Phone / Fax: E-mail address: Date of birth: Place of birth: Nationality: 2009-2010
E ACADEMIC CV GEERT THYSSEN PERSONAL DATA Position: Address: Phone / Fax: E-mail address: Date of birth: Place of birth: Nationality: Assistant-Researcher Campus Walferdange Route de Diekirch (B.P.2) L-7201
An example. Visualization? An example. Scientific Visualization. This talk. Information Visualization & Visual Analytics. 30 items, 30 x 3 values
Information Visualization & Visual Analytics Jack van Wijk Technische Universiteit Eindhoven An example y 30 items, 30 x 3 values I-science for Astronomy, October 13-17, 2008 Lorentz center, Leiden x An
A Chart Parsing implementation in Answer Set Programming
A Chart Parsing implementation in Answer Set Programming Ismael Sandoval Cervantes Ingenieria en Sistemas Computacionales ITESM, Campus Guadalajara [email protected] Rogelio Dávila Pérez Departamento de
3. How many winning lines are there in 5x5 Tic-Tac-Toe? 4. How many winning lines are there in n x n Tic-Tac-Toe?
Winning Lines in Tic-Tac-Toe 1. The standard Tic-Tac-Toe is played on a 3 x 3 board, where there are vertical winning lines, horizontal winning lines, diagonal winning lines. This is a grand total of winning
Torgerson s Classical MDS derivation: 1: Determining Coordinates from Euclidean Distances
Torgerson s Classical MDS derivation: 1: Determining Coordinates from Euclidean Distances It is possible to construct a matrix X of Cartesian coordinates of points in Euclidean space when we know the Euclidean
HIPPO STUDY DG Education And Culture Study On The Cooperation Between HEIs And Public And Private Organisations In Europe. Valorisatie 9/26/2013
Valorisatie Hoe goed doen we het in Nederland en doet het HBO het anders dan universiteiten? Peter van der Sijde Todd Davey HIPPO STUDY DG Education And Culture Study On The Cooperation Between HEIs And
Summary In the introduction of this dissertation, three main research questions were posed. The first question was: how do physical, economic, cultural and institutional distance act as barriers to international
Implementations of tests on the exogeneity of selected. variables and their Performance in practice ACADEMISCH PROEFSCHRIFT
Implementations of tests on the exogeneity of selected variables and their Performance in practice ACADEMISCH PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Amsterdam op gezag
Exhibit 7.5: Graph of Total Costs vs. Quantity Produced and Total Revenue vs. Quantity Sold
244 13. 7.5 Graphical Approach to CVP Analysis (Break-Even Chart) A break-even chart is a graphical representation of the following on the same axes: 1. Fixed costs 2. Total costs at various levels of
Met wie moet je als erasmusstudent het eerst contact opnemen als je aankomt?
Erasmusbestemming: University of Edinburgh, Edinburgh, UK Academiejaar: 2011-2012 Één/twee semester(s) Universiteit Waar is de universiteit ergens gelegen (in het centrum/ ver uit het centrum)? For law
Using Trace Clustering for Configurable Process Discovery Explained by Event Log Data
Master of Business Information Systems, Department of Mathematics and Computer Science Using Trace Clustering for Configurable Process Discovery Explained by Event Log Data Master Thesis Author: ing. Y.P.J.M.
The Syntactic Location of Events. Aspects of Verbal Complementation in Dutch
The Syntactic Location of Events Aspects of Verbal Complementation in Dutch Published by LOT phone: +31 30 253 6006 Janskerkhof 13 fax: +31 30 253 6406 3512 BL Utrecht e-mail: [email protected] The Netherlands
Exploratory data analysis for microarray data
Eploratory data analysis for microarray data Anja von Heydebreck Ma Planck Institute for Molecular Genetics, Dept. Computational Molecular Biology, Berlin, Germany [email protected] Visualization
Visualization of General Defined Space Data
International Journal of Computer Graphics & Animation (IJCGA) Vol.3, No.4, October 013 Visualization of General Defined Space Data John R Rankin La Trobe University, Australia Abstract A new algorithm
RSA algorithm for blurred on blinded deconvolution technique
RSA algorithm for blurred on technique *Kirti Bhadauria 1, *U. Dutta 2, **Priyank gupta 3 *Computer Science and Engineering Department, Maharana Pratap College of technology Putli Ghar Road, Near Collectorate,
