Curation Report KEMPENSCH TAALEIGEN
|
|
|
- Clyde Shelton
- 9 years ago
- Views:
Transcription
1 Curation Report KEMPENSCH TAALEIGEN BERGEIJKS DIALECTWOORDENBOEK CLARIN NL Data Curation Service Version 1, 8 oktober 2013 Henk van den Heuvel CLST, Radboud University Nijmegen
2 1. Introduction There are various small local dialect dictionaries for the province of Noord Brabant in the Netherlands. One of these dictionaries is: Panken, P.N. (1850) Kempensch taaleigen. Bergeijk: Johan Biemans [red. 2010]. This dictionary contains dialect entries for the village Bergeijk and surroundings in Noord Brabant and is added to the curated version as a PDF file. In this report we report upon the curation of this dictionary. This dictionary was offered for curation by prof dr Jos Swanenberg. The entries were manually enriched with a Dutch keyword, before they were provide to the DCS. Each record contains the following information: Field name dialectwoord trefwoord English dialectword Dutch keyword begrip grammaticale informatie voorbeeldzinnen Sense Grammatical information Example sentences Further information is known and added to the curated version: Kloeke = K279p Area = Noord Brabant / Dutch Brabant Place = Bergeijk Sourcebook = Panken, P.N. (1850) Kempensch taaleigen. Bergeijk: Johan Biemans [red. 2010] 2. Data The dictionary was provided in as text dump of SQL. The fields mentioned above were split and
3 converted into a CSV file (tab separated) in UTF8 encoding. 3. Metadata Parts of the Limburgian and Brabant dialect dictionaries (WLD and WBD) were digitized in the 1 CLARIN NL COAVA project. In the COAVA project a CMDI profile was developed by Folkert de Vriend for WBD and WLD. This profile was extended by the DCS to a more general profile for Dutch Dialect Dictionaries and published in the as WND (Woordenbank van de Nederlandse Dialecten). This profile was used to generate the CMDI metadatafile for this dictionary. 4. Restructuring the database The TAB separated files were used as starting point for converting the data into LMF format. 5. Converting formats 2 The TAB separated files were converted to an LMF format. The LMF model for dialect dictionary data was developed by the DCS in close cooperation with Menzo Windhouwer. During this process dialectologists were consulted as to the proper inclusion and naming of lexical features in the model. The model consists of three main classes for a Lexical Entry : Sense, Form,. is a new class in the model. Keyword (trefwoord in Dutch) is the only mandatory feature for a lexical entry in the model. Next, the data of the dictionary were fitted into the model as shown in the table below. Kempensch Taaleigen trefwoord LMF Form Keyword= 1 Refer to 2 LMF: Lexical Markup Framework:
4 dialectwoord Begrip grammaticale informatie voorbeeldzinnen boek Form Representation Dialectform= Sense Meaning= Form Representation GrammaticalInfo= Context Example= Definition sourcebook=panken, P.N. (1850) Kempensch taaleigen. Bergeijk: Johan Biemans [red. 2010] bron place=bergeijk area=noord Brabant / Dutch Brabant kloeke kloeke=k279p A corresponding LMF file was created including the LMF categories in the table above. 6. Documentation Provided in this Curation Report. Relevant information about the dictionary and its design can be found in the book:panken, P.N. (1850) Kempensch taaleigen. Bergeijk: Johan Biemans [red. 2010] (in Dutch) 7. Persistent identifiers
5 Persistent identifiers were attributed by the CLARIN Data Centre (Meertens Institute). 8. Transfer data to CLARIN data centre The curated dictionary consisting of the lmf file, the dictionary/book as PDF file, this curation report and a cmdi metadata file are stored at the Meertens Institute as CLARIN data centre. Metadata harvesting and accessibility are taken care of of by Meertens.
A Unified Structure for Dutch Dialect Dictionary Data
A Unified Structure for Dutch Dialect Dictionary Data Folkert de Vriend 1, Lou Boves 1,2, Henk van den Heuvel 1, Roeland van Hout 2, Joep Kruijsen 2, Jos Swanenberg 2 1 Centre for Language and Speech Technology
How To Develop A Project For The Netherlands National Library
Project name: CLARIN NL Project number: 184.021.003 Reporting Period: Jan 1, 2010 t/m December 31, 2010 Report Submitted by (name address data): Prof. Dr. Jan Odijk Professor of Language and Speech Technology
The Syntactic Atlas of the Dutch Dialects
The Syntactic Atlas of the Dutch Dialects A corpus of elicited speech as an on-line Dynamic Atlas Sjef Barbiers & Jan Pieter Kunst Meertens Institute (KNAW) 1 Coordination Hans Bennis (Meertens Institute)
Component MetaData Infrastructure
It s fun to play with the Component MetaData Infrastructure Using component metadata Dieter Van Uytvanck Max Planck Institute for Psycholinguistics [email protected] Overview Traditional metadata
SAND: Relation between the Database and Printed Maps
SAND: Relation between the Database and Printed Maps Erik Tjong Kim Sang Meertens Institute [email protected] May 16, 2014 1 Introduction SAND, the Syntactic Atlas of the Dutch Dialects,
How To Create A Clarin Metadata Infrastructure
Creating & Testing CLARIN Metadata Components Folkert de Vriend (1), Daan Broeder (2), Griet Depoorter (3), Laura van Eerten (3), Dieter van Uytvanck (2) 1) Meertens Institute Joan Muyskenweg 25, Amsterdam,
LEXUS: a web based lexicon tool
LEXUS: a web based lexicon tool Jacquelijn Ringersma Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Content Max Planck Institute Archive of linguistic resources Tool support (archiving
Curation report - Dutch Bilingual Database
Curation report - Dutch Bilingual Database CLARIN-NL Data Curation Service November 2013 / February 2014 Maaske Treurniet, Henk van den Heuvel, Vanja de Lint, Eric Sanders CLST, Radboud University Nijmegen
Sustainable Solutions for Endangered Languages Data: The Language Archive
Charting Vanishing Voices: A Collaborative Workshop to Map Endangered Oral Cultures World Oral Literature Project 2012 Workshop CRASSH, Cambridge Sustainable Solutions for Endangered Languages Data: The
http://www.guido.be/intranet/enqueteoverview/tabid/152/ctl/eresults...
1 van 70 20/03/2014 11:55 EnqueteDescription 2 van 70 20/03/2014 11:55 3 van 70 20/03/2014 11:55 4 van 70 20/03/2014 11:55 5 van 70 20/03/2014 11:55 6 van 70 20/03/2014 11:55 7 van 70 20/03/2014 11:55
Your boldest wishes concerning online corpora: OpenSoNaR and you
1 Your boldest wishes concerning online corpora: OpenSoNaR and you Martin Reynaert TiCC, Tilburg University and CLST, Radboud Universiteit Nijmegen TiCC Colloquium, Tilburg University. October 16th, 2013
CLARIN-NL Third Call: Closed Call
CLARIN-NL Third Call: Closed Call CLARIN-NL launches in its third call a Closed Call for project proposals. This called is only open for researchers who have been explicitly invited to submit a project
Introduction to the Digital Literacy Instructor
Introduction to the Digital Literacy Instructor Helmer Strik Department of Linguistics Centre for Language and Speech Technology (CLST), The Netherlands Newcastle meeting millennium bridge Target group
Dialect Corpora Taken Further: The DynaSAND corpus and its application in newer tools
PACLIC 24 Proceedings 759 Dialect Corpora Taken Further: The DynaSAND corpus and its application in newer tools Jan Pieter Kunst a and Franca Wesseling b a Meertens Institute, Royal Netherlands Academy
The Language Archive at the Max Planck Institute for Psycholinguistics. Alexander König (with thanks to J. Ringersma)
The Language Archive at the Max Planck Institute for Psycholinguistics Alexander König (with thanks to J. Ringersma) Fourth SLCN Workshop, Berlin, December 2010 Content 1.The Language Archive Why Archiving?
Applying quantitative methods to dialect Dutch verb clusters
Applying quantitative methods to dialect Dutch verb clusters Jeroen van Craenenbroeck KU Leuven/CRISSP [email protected] 1 Introduction Verb cluster ordering is a well-known area of microparametric
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
DC, MODS and CERIF-XML
DC, MODS and CERIF-XML A Tale of Two Cultures Ed Simons Radboud University Nijmegen, NL. Some personal data Ed Simons Workplace: Information Centre (UCI) of Radboud University UCI takes care of all IT-services
System Requirements for Archiving Electronic Records PROS 99/007 Specification 1. Public Record Office Victoria
System Requirements for Archiving Electronic Records PROS 99/007 Specification 1 Public Record Office Victoria Version 1.0 April 2000 PROS 99/007 Specification 1: System Requirements for Archiving Electronic
OpenDocument Format. The future of ODF. Jos van den Oever Logius / KOOP Ministery for the Interior The Netherlands
OpenDocument Format The future of ODF Jos van den Oever Logius / KOOP Ministery for the Interior The Netherlands Jos van den Oever Ministery of the Interior The Netherlands What is the point of ODF? application-independent
Next Generation Sequencing; Technologies, applications and data analysis
; Technologies, applications and data analysis Course 2542 Dr. Martie C.M. Verschuren Research group Analysis techniques in Life Science, Breda Prof. dr. Johan T. den Dunnen Leiden Genome Technology Center,
FoLiA: Format for Linguistic Annotation
Maarten van Gompel Radboud University Nijmegen 20-01-2012 Introduction Introduction What is FoLiA? Generalised XML-based format for a wide variety of linguistic annotation Characteristics Generalised paradigm
Next Generation Sequencing; Technologies, applications and data analysis
; Technologies, applications and data analysis Course 2542 Dr. Martie C.M. Verschuren Research group Analysis techniques in Life Science, Breda Prof. dr. Johan T. den Dunnen Leiden Genome Technology Center,
Die Vielfalt vereinen: Die CLARIN-Eingangsformate CMDI und TCF
Die Vielfalt vereinen: Die CLARIN-Eingangsformate CMDI und TCF Susanne Haaf & Bryan Jurish Deutsches Textarchiv 1. The Metadata Format CMDI Metadata? Metadata Format? and more Metadata? Metadata Format?
Project notes of CLARIN project DiscAn: Towards a Discourse Annotation system for Dutch language corpora
Project notes of CLARIN project DiscAn: Towards a Discourse Annotation system for Dutch language corpora Ted Sanders University Utrecht Utrecht Institute of Linguistics Trans 10 NL-3512 JK Utrecht [email protected]
Using Dataverse Virtual Archive Technology for Research Data Management. Jonathan Crabtree Thu-Mai Christian Amanda Gooch
Using Dataverse Virtual Archive Technology for Research Data Management Jonathan Crabtree Thu-Mai Christian Amanda Gooch H. W. Odum Institute Archive Services The Howard W. Odum Institute was founded in
Mobility Tool+ Guide for Beneficiaries of the Erasmus+ programme
EUROPEAN COMMISSION DIRECTORATE-GENERAL FOR EDUCATION AND CULTURE Education and vocational training; Coordination of Erasmus+ Coordination of National Agencies Erasmus+ Mobility Tool+ Guide for Beneficiaries
Mobility Tool+ Guide for Beneficiaries of the Erasmus+ programme
EUROPEAN COMMISSION DIRECTORATE-GENERAL FOR EDUCATION AND CULTURE Education and vocational training; Coordination of Erasmus+ Coordination of National Agencies Erasmus+ Mobility Tool+ Guide for Beneficiaries
Next Generation Sequencing; Technologies, applications and data analysis
; Technologies, applications and data analysis Course 2542 Dr. Martie C.M. Verschuren Avans Hogeschool Research group Analysis techniques in Life Science, Breda Prof. dr. Johan T. den Dunnen Leiden Genome
Assessment of low-frequency noise due to windturbines in relation to low-frequency background
Assessment of low-frequency noise due to windturbines in relation to low-frequency background noise Eugène de Beer ([email protected]) Peutz bv, Paletsingel 2, 2718 NT, Zoetermeer, The Netherlands. Summary
4.16 National CARARE Workshop in the Netherlands
4.16 National CARARE Workshop in the Netherlands Organisation of the Workshop The Dutch heritage in a European Perspective workshop was organised jointly by the three partners from the Netherlands: Hella
Nevada NSF EPSCoR Track 1 Data Management Plan
Nevada NSF EPSCoR Track 1 Data Management Plan August 1, 2011 INTRODUCTION Our data management plan is driven by the overall project goals and aims to ensure that the following are achieved: Assure that
Input by KPS on Call for advice by EIOPA For the review of IORP II
Input by KPS on Call for advice by EIOPA For the review of IORP II KPS study group International pensions October 11, 2011 General remark relating to Call for advice from EIOPA for review of IORP II Page
Breeze ediscovery Suite 3.5.0
Introductions to Breeze Workflows: (1) Converting Native Files to TIF with Bates Stamp and (2) Deduplication: Eliminating Duplicate Native Files in Data Sets Convert Native Files to TIF with Bates Stamp
The Chat Box Revelation On the chat language of Flemish adolescents and young adults
!"#$%&'(*+,&(-,.,+/$"#('0 1**234567875549:#,(-; &81**2456787
De Nederlandse topsporter en het antidopingbeleid
De Nederlandse topsporter en het antidopingbeleid De Nederlandse topsporter en het antidopingbeleid Oktober 2010 Irene Eijs (Mindshare Research), Arno Havenga (NL Sporter), Olivier de Hon (Dopingautoriteit)
Executive summary. Executive summary 8
Executive summary Q fever is a zoonosis an infectious disease that can be transmitted from animals to humans caused by the bacterium Coxiella burnetii (C. burnetii). Until 2006, Q fever was a rare disease
CLARIN-NL CALL 3 Proposal Verrijkt Koninkrijk
1. Project Title & Acronym and Abstract Title: Verrijkt Koninkrijk (Enriched Kingdom) Acronym: VK Abstract: Dr Loe de Jong s Het Koninkrijk der Nederlanden in de Tweede Wereldoorlog remains the most appealing
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
NWO-DANS Data Contracts
NWO-DANS Data Contracts Information about the mandatory set of data to be made available from studies carried out with support from the NWO programs for Humanities and Social Sciences >>> NWO-DANS Data
[email protected] 2005 2009 Tulsa Community College (Associate of Arts -Psychology)
Curriculum Vitae Personal Information Surname: Given Name: Trujillo James Paul Address: Paulinastraat 62 Postal code, city and country: Email: 2595GK, Den Haag, NL [email protected] Date of birth:
Tibiscus University, Timişoara
PDF/A standard for long term archiving Ramona Vasilescu Tibiscus University, Timişoara ABSTRACT. PDF/A is defined by ISO 19005-1 as a file format based on PDF format. The standard provides a mechanism
Turning Emergency Plans into Executable
Turning Emergency Plans into Executable Artifacts José H. Canós-Cerdá, Juan Sánchez-Díaz, Vicent Orts, Mª Carmen Penadés ISSI-DSIC Universitat Politècnica de València, Spain {jhcanos jsanchez mpenades}@dsic.upv.es
Baan IV Tools. Baan IV Database Administration (DBA) on Windows NT
Baan IV Tools Baan IV Database Administration (DBA) on Windows NT A publication of: Baan Development B.V. P.O.Box 143 3770 AC Barneveld The Netherlands Printed in the Netherlands Baan Development B.V.
RHS VISUAL ICD 9 to ICD 10 Conversion
RHS VISUAL ICD 9 to ICD 10 Conversion ICD 10 is mandatory as of 10/1/2015 There are two main areas that help you convert your residents ICD 9 codes to ICD 10: 1. The ICD 10 DB (Dashboard) button on the
How to translate VisualPlace
Translation tips 1 How to translate VisualPlace The international language support in VisualPlace is based on the Rosette library. There are three sections in this guide. It starts with instructions for
Police Academy of the Netherlands
Police Academy of the Netherlands Opportunities & Threats for Police Universities Drs Harry Peeters Photo: Thea van den Heuvel FEATURES Police Academy of the Netherlands RENEWAL OF POLICE EDUCATION & RESEARCH
Next Generation Sequencing; Technologies, applications and data analysis
; Technologies, applications and data analysis Course 2542 Dr. Martie C.M. Verschuren Avans Hogeschool Research group Analysis techniques in Life Science, Breda Prof. dr. Johan T. den Dunnen Leiden Genome
WHERE IS THE DUTCH OER LIBRARIAN?
2015 Open and Online Education Trend Report 44 WHERE IS THE DUTCH OER LIBRARIAN? by Hilde van Wijngaarden and Frederike Vernimmen A growing portion of teaching materials are available online. How is this
M.A.Lips. Copyright 2014, M.A.Lips, Amsterdam, the Netherlands
Roux-en-Y Gastric Bypass and Calorie Restriction: Differences and Similarities of Endocrine and Metabolic Effects in Obesity and Type 2 Diabetes Mellitus Mirjam Anne Lips Roux-en-Y Gastric Bypass and Calorie
Zeynep Azar. English Teacher, Açı Private Primary School, Istanbul, Turkey Azar, E.Z.
Zeynep Azar Date/Place of birth : 13 November 1988, Bursa, Turkey Nationality : Turkish Address : Bisschop Zwijsenstraat 103-01 Zipcode, Residence : 5021KB, Tilburg, Netherlands Phone number : +31 (0)
Buurten van gemeente Groningen
Page 1 of 7 Buurten van gemeente Groningen Shapefile Tags Buurtindeling Groningen Summary buurtindeling van de gemeente Groningen, buurten zijn samengesteld uit subbuurten Description Het bestand buurtindeling.shp
National Background: Netherlands 2015
Date of issue: March 2015 National Background: Netherlands 2015 André Bouwman (Universitaire Bibliotheken Leiden) This report has been compiled with the kind assistance of Ad Leerintveld (Koninklijke Bibliotheek)
The University of Amsterdam s Question Answering System at QA@CLEF 2007
The University of Amsterdam s Question Answering System at QA@CLEF 2007 Valentin Jijkoun, Katja Hofmann, David Ahn, Mahboob Alam Khalid, Joris van Rantwijk, Maarten de Rijke, and Erik Tjong Kim Sang ISLA,
Microinvest Warehouse Pro Light Restaurant is designed to work in tandem with Microinvest Warehouse Pro which provides all back office functions.
Important to know! Microinvest Warehouse Pro Light Restaurant is designed to work in tandem with Microinvest Warehouse Pro which provides all back office functions. When you start up the restaurant module
Acquiring grammatical gender in northern and southern Dutch. Jan Klom, Gunther De Vogelaer
Acquiring grammatical gender in northern and southern Acquring grammatical gender in southern and northern 2 Research questions How does variation relate to change? (transmission in Labov 2007 variation
The Victim and Compensation: the Dutch approach Alex Sas Victim Support NL
The Victim and Compensation: the Dutch approach Alex Sas Victim Support NL What is this about? Compensation of the victim in the criminal procedure in the Netherlands. EU Directive Have Member States an
SQL*Plus s Forgotten History
SQL*Plus s Forgotten History Harald van Breederode Senior Principal DBA Instructor Oracle University NL You use SQL*Plus on a daily basis to perform various DBA activities but wonder why the SQL*Plus command
There are various ways to find data using the Hennepin County GIS Open Data site:
Finding Data There are various ways to find data using the Hennepin County GIS Open Data site: Type in a subject or keyword in the search bar at the top of the page and press the Enter key or click the
Technical Report. The KNIME Text Processing Feature:
Technical Report The KNIME Text Processing Feature: An Introduction Dr. Killian Thiel Dr. Michael Berthold [email protected] [email protected] Copyright 2012 by KNIME.com AG
REDCap General Security Overview
REDCap General Security Overview Introduction REDCap is a web application for building and managing online surveys and databases, and thus proper security practices must instituted on the network and server(s)
MTEC Legislation Programme : 23 November - 4 December, The Hague
MTEC Legislation Programme : 23 November - 4 December, The Hague Monday 23/11 Week 1 09.15 10.00 Official Opening: introduction to the course and introduction of the participants Edward Vriends (Ministry
Province of North Brabant: Enhancing Efficiency by Integrating Geographic Information into SAP ERP
2014 SAP SE or an SAP affiliate company. All rights reserved. Province of North Brabant: Enhancing Efficiency by Integrating Geographic Information into SAP ERP Organization Province of North Brabant Location
ESISS Security Scanner
ESISS Security Scanner How to use the ESISS Automated Security Scanner January 2013 v1.1 Table of Contents The ESISS Automated Security Scanner... 3 Using The ESISS Security Scanner... 4 1. Logging On...
Electronic Remittance Advice (ERA) Processor
Electronic Remittance Advice (ERA) Processor Users Guide Brief description of the Electronic Remittance Advice (835) or Electronic EOB A Remittance Advice (RA) is a notice of payments and adjustments sent
How To Plan An Ambulance In The Netherlands
RBS Care The intelligence system for the dispatch centre Version: 1.0 First International Workshop on Emergency Services Planning Amsterdam, June 25-27, 2014 AGENDA Welcome The company: Witte Kruis The
TUTORIAL: Reporting Gold-Vision 6
Reporting Using SQL reporting Services Tutorial Objectives: Introduction to Gold-Vision Reporting Standard Reports Searching for a Report Running a Standard Report Viewing a Report Exporting Data Example
