German Language Processing Thesis
|
|
|
- Rachel Richards
- 5 years ago
- Views:
Transcription
1 Yannick Versley Institut für Computerlinguistik Im Neuenheimer Feld Heidelberg Telephone: WWW: Yannick Versley Diplom-Informatiker, Dr. Phil. Computerlinguistik General Information Date of Birth September 14th, 1979 Place of Birth Hamburg, Germany Citizenship German Languages German (native), English (near-native), French (fluent), Italian (basic), Spanish (very basic) Research Interests Use of common sense knowledge in the context of the microand macrostructure of discourse; Methods for Lexical Acquisition; Machine Learning methods for structured data; Natural Language Processing techniques for German Education University University of Tübingen, Seminar für Sprachwissenschaft. PhD in Computational Linguistics Thesis title: Resolving Coreferent Bridging in German Newspaper text Grade: Magna cum laude; Thesis Advisor: Prof. Erhard Hinrichs University of Hamburg, Department of Computer Science Degree obtained: Informatik-Diplom Thesis title: Tagging kausaler Relationen Grade: 1.4 (sehr gut); Thesis Advisor: Prof. Christopher Habel School Gymnasium Osterbek, Hamburg Abitur; Grade: 1.4 (sehr gut) Lycée Français de Hambourg, Hamburg Work Experience 2013-current University of Heidelberg, Institute for Computational Linguistics Visiting professor ( Professurvertretung ) University of Tübingen, Collaborative Research Center 833 Research associate in the project A3: Desambiguierung von Diskurskonnektoren mit korpusinduzierten semantischen Relationen 2009 University of Trento, Center for Mind/Brain Sciences (CiMeC) Research fellow in the project LiveMemories University of Tübingen, Collaborative Research Center 441
2 Work Experience (continued) Research associate in the project A1: Representation and Automatic Acquisition of Linguistic Data Part time / student employment 2007 Johns Hopkins Summer Workshop, Project Encyclopedic and Lexical Knowledge for Entity Disambiguation Graduate Research Team Member University of Hamburg, Knowledge and Language Processing group: Student research assistant 2001 Internship at Mummert+Partner, Hamburg Customer-specific ABAP programming (SAP R/3) Bitsdontbyte GbR, Hamburg Lotus Notes programming in LotusScript and Java; Java Servlets; Apple WebObjects University of Hamburg Tutor Praktische Informatik I (FB Inf.), Java-Programmierung (RRZ) Hamburger Bildungsserver, Hamburg Linux installation in schools 1997 Internship at Ergole Informatique, Grenoble GUI programming for Windows using C Teaching 2015 Mathematical foundations for CL Structured Inference for NLP applications (Hauptseminar) 2014 Mathematical foundations for CL Introduction to Computational Linguistics Multimodal Semantics (Hauptseminar) NLP methods for Digital Humanities (Proseminar) Software project (SoSe, WiSe) 2013 Introduction to Computational Linguistics Statistical Parsing (Hauptseminar) Computational Linguistics in Context (Proseminar) 2009 Anaphora Resolution (with Prof. Massimo Poesio, Kepa Rodriguez) Kurs bei der 5th DGfS Fall School, September 2009, Universität Konstanz. Teaching Assistant / Tutor Praktische Informatik I (Prof. Wolfang Menzel, Prof. Leonie Dreschler Fischer) 2001 Java-Programmierung (Bernd Eggink) Regionales Rechenzentrum (RRZ), Universität Hamburg
3 Administration Studienreformausschuss (SRA; studentisches Mitglied) Prüfungsausschuss (PA; studentisches Mitglied) Publications Journal Articles Yannick Versley (2013): A graph-based approach for implicit discourse relations. CLIN Journal 3: Yannick Versley and Anna Gastel (2013): Linguistic Tests for Discourse Relations. Stefanie Dipper, Bonnie Webber and Heike Zinsmeister (eds.): Dialogue and Discourse 4(2). Special Issue on Beyond Semantics: The Challenges of Annotating Pragmatic and Discourse Phenomena. Contributions: [YV] General conception of the paper, writing; [AG] writing; example selection Heike Telljohann, Yannick Versley, Kathrin Beck, Erhard Hinrichs and Thomas Zastrow (2013): STTS als Part-of-Speech-Tagset in Tübinger Baumbanken (in German). Journal for Language Technology and Computational Linguistics 28(1):1 16. Contributions: [HT] Details on treebanks and treebank annotation schemes, writing; [YV] Experimental part of the paper, general conception; writing; [KB, EH] Comments on details of annotation schemes. Yannick Versley (2008): Vagueness and Referential Ambiguity in a Large-scale Annotated Corpus. Massimo Poesio and Ron Artstein (eds.): Ambiguity and Semantic Judgement. Special Issue of the Journal on Research in Language and Computation. Conference papers Michael Haas and Yannick Versley (to appear): Subsentential Sentiment on a Shoestring: A Crosslingual Analysis of Compositional Classification. Accepted for: 2015 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies (NAACL HLT 2015). Contributions: [MH] Implementation and experimental work; [YV] General conception for the conference paper, research supervision, writing. Yannick Versley (2013): Graph-based Classification of Explicit and Implicit Discourse Relations. International Conference on Computational Semantics (IWCS 2013), Potsdam, Germany. Yannick Versley (2011): Multilabel tagging of discourse relations in ambiguous temporal connectives. Proceedings of Recent Advances in Natural Language Processing (RANLP 2011). Samuel Broscheid, Simone Ponzetto, Yannick Versley and Massimo Poesio (2010): Extending BART to provide a coreference resolution system for German. Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010) Contributions: [SB] Implementation of coreference features for German, experiments; [SP] supervision of SB, writing; [YV] Data conversions, German preprocessing and mention extraction, writing; [MP] general ideas and comments
4 Publications (continued) Massimo Poesio, Olga Uryupina and Yannick Versley (2010): Creating a Coreference Resolution System for Italian. Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010) Contributions: [MP] supervision of work, general ideas, writing; [OU] Implementation of coreference features for Italian, experiments, writing; [YV] Preprocessing for Italian; Italian-specific adaptations for the BART framework. Yannick Versley, Kathrin Beck, Erhard Hinrichs and Heike Telljohann (2010): A Syntax-first approach to High-quality Morphological Analysis and Lemma Disambiguation for the TüBa-D/Z Treebank. Proceedings of the 9th Conference on Treebanks and Linguistic Theories (TLT9). Contributions: [YV] Lemmatizer implementation and experiments, general conception writing; [KB, EH, HT] annotation guidelines for closed-class lemmas, general description on the treebank, supervision of the lemma annotation of the gold standard used. Yannick Versley and Ines Rehbein (2009): Scalable Discriminative Parsing for German. International Conference on Parsing Technology (IWPT 09). Contributions: [YV] Parser implementation, experiments, paper conception, writing; [IR] General discussion, insights on the Tiger annotation scheme. Yannick Versley, Alessandro Moschitti, Massimo Poesio and Xiaofeng Yang (2008): Coreference Systems based on Kernel Methods. Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008). [YV] Integration of Kernel-based learning into BART, expletive kernel, experiments, general conception, writing; [AM] word sequence kernel, writing; [XY] binding kernel; [MP] general discussion, general conception, comments on paper. Yannick Versley (2007): Antecedent Selection Techniques for High-Recall Coreference Resolution. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP- CoNLL). Yannick Versley (2007): Using the Web to Resolve Coreferent Bridging in German Newspaper Text. Proceedings der GLDV-Frühjahrestagung Workshop Papers Yannick Versley (2014): Experiments with Easy-first nonprojective constituent parsing. Proceedings of the First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages. Yannick Versley (2013): SFS-TUE: Compound Paraphrasing with a Language Model and Discriminative Reranking. Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013), Atlanta, US. Yannick Versley (2012): Supervised Learning of German Qualia Relations. ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages (SP-Sem-MRL 2012) Yannick Versley and Yana Panchenko (2012): Not Just Bigger: Towards Better- Quality Web Corpora. Proceedings of the 7th Web as Corpus Workshop at WWW2012 (WAC7). Yannick Versley (2011): Towards finer-grained tagging of discourse connectives. AG Beyond Semantics, Deutsche Gesellschaft für Sprachwissenschaft (DGfS 2011). Yannick Versley (2010): Discovery of Ambiguous and Unambiguous Discourse Connectives via Annotation Projection. Workshop on the Annotation and Exploitation of Parallel Corpora (AEPC).
5 Publications (continued) Marta Recasens, Lluís Màrquez, Emili Sapena, M. Antònia Martí, Mariona Taulé, Véronique Hoste, Massimo Poesio, and Yannick Versley (2010): SemEval-2010 Task 1: Coreference Resolution in Multiple Languages. In Proceedings of the ACL Workshop on Semantic Evaluations (SemEval-2010). Reut Tsarfaty, Djamé Seddah, Yoav Goldberg, Sandra Kuebler, Yannick Versley, Marie Candito, Jennifer Foster, Ines Rehbein and Lamia Tounsi (2010): Statistical Parsing of Morphologically Rich Languages (SPMRL): What, How and Whither. Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages. Yannick Versley (2008): Decorrelation and Shallow Semantic Patterns for Distributional Clustering of Nouns and Verbs. Stefan Evert and Marco Baroni (eds.), Proceedings of the ESSLLI 08 Workshop on Distributional Lexical Semantics. Yannick Versley, Simone Paolo Ponzetto, Massimo Poesio, Vladimir Eidelman, Alan Jern, Jason Smith, Xiaofeng Yang, Alessandro Moschitti (2008): BART: A Modular Toolkit for Coreference Resolution. In Companion Volume of the Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL 2008). Yannick Versley, Holger Wunsch and Heike Zinsmeister (2007): A Pilot Study on Computer-aided Coreference Annotation. Constantin Orasan and Sandra Kübler (eds.) Proceedings of the International Workshop on Computer Aided Language Processing (CALP) Yannick Versley and Heike Zinsmeister (2006): From Dependency Parsing to Deep(er) Semantics. Proceedings of the Fifth International Workshop on Treebanks and Linguistic Theories (TLT 2006). Yannick Versley (2006): A Constraint-based Approach to Noun Phrase Coreference Resolution in German Newspaper Text. Konferenz zur Verarbeitung Natürlicher Sprache (KONVENS 2006). Yannick Versley (2006): Disagreement Dissected: Vagueness as a Source of Ambiguity in Nominal (Co-)Reference. Ron Artstein and Massimo Poesio (eds.), Proceedings of the ESSLLI 2006 Workshop on Ambiguity in Anaphora Yannick Versley (2005): Parser Evaluation across Text Types. Proceedings of the Fourth Workshop on Treebanks and Linguistic Theories (TLT 2005). Schilder, F., Versley, Y., and Habel, Ch. (2004) Extracting spatial information: grounding, classifying and linking spatial expressions. Ross Purves and Christopher B. Jones (eds.), SIGIR Workshop on Geographic Information Retrieval. Edited Volumes Yoav Goldberg, Yuval Marton, Yannick Versley, Özlem Cetinoǧlu, Ines Rehbein, Joel Tetrault, Sandra Kübler, Djamé Seddah and Reut Tsarfaty (2014): Proceedings of the First Joint Workshop on Statistical Parsing of Morphologically Rich Languages and Syntactic Analysis of Non-Canonical Languages (SPMRL-SANCL 2014). Yoav Goldberg, Yuval Marton, Ines Rehbein, Yannick Versley, Sandra Kübler, Djamé Seddah and Reut Tsarfaty (2013): Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2013). Yves Peirsman, Yannick Versley and Tim Van de Cruys (2009): Proceedings of the CogSci 2009 Workshop on Distributional Semantics beyond Concrete Concepts (DisCo 2009).
6 Publications (continued) Massimo Poesio, Roland Stuckardt and Yannick Versley (in preparation): Anaphora Resolution. Book in preparation, to be published by Springer. Sam Featherston and Yannick Versley (in preparation): Firm Foundations: Quantitative Studies of Sentence Grammar and Grammatical Change in Germanic. Book in preparation, to be published by De Gruyter in the Trends in Linguistics. Studies and Monographs (TiLSM) series. Theses Yannick Versley (2010) Resolving Coreferent Bridging in German Newspaper Text. PhD Thesis, Seminar für Sprachwissenschaft, Universität Tübingen. Yannick Versley (2004) Tagging kausaler Relationen (in German). Diploma Thesis.
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
WebLicht: Web-based LRT services for German
WebLicht: Web-based LRT services for German Erhard Hinrichs, Marie Hinrichs, Thomas Zastrow Seminar für Sprachwissenschaft, University of Tübingen [email protected] Abstract This software
Comprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
Chapter 8. Final Results on Dutch Senseval-2 Test Data
Chapter 8 Final Results on Dutch Senseval-2 Test Data The general idea of testing is to assess how well a given model works and that can only be done properly on data that has not been seen before. Supervised
CURRICULUM VITAE SILKE BRANDT
CURRICULUM VITAE SILKE BRANDT CONTACT Silke Brandt, PhD English Department Nadelberg 6 CH-4051 Basel Switzerland [email protected] POSITIONS 2011-present Postdoctoral researcher English Department
Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Text Mining Anya Yarygina Boris Novikov Introduction Generally used to denote any system that analyzes large quantities of natural language text and detects lexical or
Research Portfolio. Beáta B. Megyesi January 8, 2007
Research Portfolio Beáta B. Megyesi January 8, 2007 Research Activities Research activities focus on mainly four areas: Natural language processing During the last ten years, since I started my academic
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata Alessandra Giordani and Alessandro Moschitti Department of Computer Science and Engineering University of Trento Via Sommarive
Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2
Structure of the talk Sebastian Bücking 1 and Markus Egg 2 1 Universität Tübingen [email protected] 2 Rijksuniversiteit Groningen [email protected] 12 December 2008 two challenges for a
Semantic Mapping Between Natural Language Questions and SQL Queries via Syntactic Pairing
Semantic Mapping Between Natural Language Questions and SQL Queries via Syntactic Pairing Alessandra Giordani and Alessandro Moschitti Department of Computer Science and Engineering University of Trento
Clustering Connectionist and Statistical Language Processing
Clustering Connectionist and Statistical Language Processing Frank Keller [email protected] Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised
Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
Sentiment analysis for news articles
Prashant Raina Sentiment analysis for news articles Wide range of applications in business and public policy Especially relevant given the popularity of online media Previous work Machine learning based
Empirical Machine Translation and its Evaluation
Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Jesús Giménez (Advisor, Lluís Màrquez) Universitat Politècnica de Catalunya May 28, 2010 Empirical Machine Translation Empirical
Interactive Dynamic Information Extraction
Interactive Dynamic Information Extraction Kathrin Eichler, Holmer Hemsen, Markus Löckelt, Günter Neumann, and Norbert Reithinger Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI, 66123 Saarbrücken
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) James Clarke, Vivek Srikumar, Mark Sammons, Dan Roth Department of Computer Science, University of Illinois, Urbana-Champaign.
Automatic Detection and Correction of Errors in Dependency Treebanks
Automatic Detection and Correction of Errors in Dependency Treebanks Alexander Volokh DFKI Stuhlsatzenhausweg 3 66123 Saarbrücken, Germany [email protected] Günter Neumann DFKI Stuhlsatzenhausweg
Veronika VINCZE, PhD. PERSONAL DATA Date of birth: 1 July 1981 Nationality: Hungarian
Veronika VINCZE, PhD CONTACT INFORMATION Hungarian Academy of Sciences Research Group on Artificial Intelligence Tisza Lajos krt. 103., 6720 Szeged, Hungary Phone: +36 62 54 41 40 Mobile: +36 70 22 99
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati
Special Topics in Computer Science
Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS
ALEXANDER KOLLER July 2015
ALEXANDER KOLLER July 2015 Focus Area Cognitive Sciences [email protected] University of Potsdam http://www.ling.uni-potsdam.de/ koller/ Karl-Liebknecht-Str. 24-25 phone: +49 331 977 2692 14476
Open Domain Information Extraction. Günter Neumann, DFKI, 2012
Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for
Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1
Korpus-Abfrage: Werkzeuge und Sprachen Gastreferat zur Vorlesung Korpuslinguistik mit und für Computerlinguistik Charlotte Merz 3. Dezember 2002 Motivation Lizentiatsarbeit: A Corpus Query Tool for Automatically
TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments
TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments Grzegorz Dziczkowski, Katarzyna Wegrzyn-Wolska Ecole Superieur d Ingenieurs
Shallow Parsing with Apache UIMA
Shallow Parsing with Apache UIMA Graham Wilcock University of Helsinki Finland [email protected] Abstract Apache UIMA (Unstructured Information Management Architecture) is a framework for linguistic
Mahesh Srinivasan. Assistant Professor of Psychology and Cognitive Science University of California, Berkeley
Department of Psychology University of California, Berkeley Tolman Hall, Rm. 3315 Berkeley, CA 94720 Phone: (650) 823-9488; Email: [email protected] http://ladlab.ucsd.edu/srinivasan.html Education
Parsing Software Requirements with an Ontology-based Semantic Role Labeler
Parsing Software Requirements with an Ontology-based Semantic Role Labeler Michael Roth University of Edinburgh [email protected] Ewan Klein University of Edinburgh [email protected] Abstract Software
CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test
CINTIL-PropBank I. Basic Information 1.1. Corpus information The CINTIL-PropBank (Branco et al., 2012) is a set of sentences annotated with their constituency structure and semantic role tags, composed
English Descriptive Grammar
English Descriptive Grammar 2015/2016 Code: 103410 ECTS Credits: 6 Degree Type Year Semester 2500245 English Studies FB 1 1 2501902 English and Catalan FB 1 1 2501907 English and Classics FB 1 1 2501910
Hybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
Less Grammar, More Features
Less Grammar, More Features David Hall Greg Durrett Dan Klein Computer Science Division University of California, Berkeley {dlwh,gdurrett,klein}@cs.berkeley.edu Abstract We present a parser that relies
RRSS - Rating Reviews Support System purpose built for movies recommendation
RRSS - Rating Reviews Support System purpose built for movies recommendation Grzegorz Dziczkowski 1,2 and Katarzyna Wegrzyn-Wolska 1 1 Ecole Superieur d Ingenieurs en Informatique et Genie des Telecommunicatiom
Research Assistant in the Research Group: Diversity and Inclusion, Faculty of Human Sciences, University of Potsdam.
Sabrina Gerth Research Group: Diversity and Inclusion Human Sciences Faculty University of Potsdam Karl-Liebknecht-Str. 24-25 D-14476 Potsdam / Golm phone: ++49 (0)331-977-2758 email: [email protected]
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer Timur Gilmanov, Olga Scrivner, Sandra Kübler Indiana University
Zeynep Azar. English Teacher, Açı Private Primary School, Istanbul, Turkey Azar, E.Z.
Zeynep Azar Date/Place of birth : 13 November 1988, Bursa, Turkey Nationality : Turkish Address : Bisschop Zwijsenstraat 103-01 Zipcode, Residence : 5021KB, Tilburg, Netherlands Phone number : +31 (0)
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
Ming-Wei Chang. Machine learning and its applications to natural language processing, information retrieval and data mining.
Ming-Wei Chang 201 N Goodwin Ave, Department of Computer Science University of Illinois at Urbana-Champaign, Urbana, IL 61801 +1 (917) 345-6125 [email protected] http://flake.cs.uiuc.edu/~mchang21 Research
Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov. 2008 [Folie 1]
Content 1. Empirical linguistics 2. Text corpora and corpus linguistics 3. Concordances 4. Application I: The German progressive 5. Part-of-speech tagging 6. Fequency analysis 7. Application II: Compounds
Automatic Pronominal Anaphora Resolution. in English Texts
Automatic Pronominal Anaphora Resolution in English Texts Tyne Liang and Dian-Song Wu Department of Computer and Information Science National Chiao Tung University Hsinchu, Taiwan Email: [email protected];
Automatic Pronominal Anaphora Resolution in English Texts
Computational Linguistics and Chinese Language Processing Vol. 9, No.1, February 2004, pp. 21-40 21 The Association for Computational Linguistics and Chinese Language Processing Automatic Pronominal Anaphora
CS 6740 / INFO 6300. Ad-hoc IR. Graduate-level introduction to technologies for the computational treatment of information in humanlanguage
CS 6740 / INFO 6300 Advanced d Language Technologies Graduate-level introduction to technologies for the computational treatment of information in humanlanguage form, covering natural-language processing
Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde
Statistical Verb-Clustering Model soft clustering: Verbs may belong to several clusters trained on verb-argument tuples clusters together verbs with similar subcategorization and selectional restriction
Customer Intentions Analysis of Twitter Based on Semantic Patterns
Customer Intentions Analysis of Twitter Based on Semantic Patterns Mohamed Hamroun [email protected] Mohamed Salah Gouider [email protected] Lamjed Ben Said [email protected] ABSTRACT
A Framework-based Online Question Answering System. Oliver Scheuer, Dan Shen, Dietrich Klakow
A Framework-based Online Question Answering System Oliver Scheuer, Dan Shen, Dietrich Klakow Outline General Structure for Online QA System Problems in General Structure Framework-based Online QA system
Curriculum Vitae. PD Dr. Boris Hirsch
Curriculum Vitae PD Dr. Boris Hirsch Address Home: Paniersplatz 35 D-90403 Nuremberg Phone: +49(0)911 / 99 44 079 Mobile: +49(0)179 / 100 22 63 E-Mail: [email protected] Office: Friedrich Alexander University
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
A Mixed Trigrams Approach for Context Sensitive Spell Checking
A Mixed Trigrams Approach for Context Sensitive Spell Checking Davide Fossati and Barbara Di Eugenio Department of Computer Science University of Illinois at Chicago Chicago, IL, USA [email protected], [email protected]
Transition-Based Dependency Parsing with Long Distance Collocations
Transition-Based Dependency Parsing with Long Distance Collocations Chenxi Zhu, Xipeng Qiu (B), and Xuanjing Huang Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science,
Factored Translation Models
Factored Translation s Philipp Koehn and Hieu Hoang [email protected], [email protected] School of Informatics University of Edinburgh 2 Buccleuch Place, Edinburgh EH8 9LW Scotland, United Kingdom
Introduction. Philipp Koehn. 28 January 2016
Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)
EDUCATIONAL REGULATION OF THE MASTER S DEGREE COURSE IN COGNITIVE SCIENCE
EDUCATIONAL REGULATION OF THE MASTER S DEGREE COURSE IN COGNITIVE SCIENCE CONTENTS Title I - Establishment and start-up... 3 Art. 1 General information... 3 Art. 2 - Initiatives for quality assurance...
Linguistics to Structure Unstructured Information
Linguistics to Structure Unstructured Information Authors: Günter Neumann (DFKI), Gerhard Paaß (Fraunhofer IAIS), David van den Akker (Attensity Europe GmbH) Abstract The extraction of semantics of unstructured
Context Grammar and POS Tagging
Context Grammar and POS Tagging Shian-jung Dick Chen Don Loritz New Technology and Research New Technology and Research LexisNexis LexisNexis Ohio, 45342 Ohio, 45342 [email protected] [email protected]
Overview of MT techniques. Malek Boualem (FT)
Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,
Julia Englert, PhD Student. Curriculum Vitae
Julia Englert, PhD Student Curriculum Vitae Name: Nationality: Julia Valerie Englert German Date of Birth: April 14 th 1987 E-Mail: [email protected] Phone 0049-681-302-68563 Office Address: Saarland
ANALEC: a New Tool for the Dynamic Annotation of Textual Data
ANALEC: a New Tool for the Dynamic Annotation of Textual Data Frédéric Landragin, Thierry Poibeau and Bernard Victorri LATTICE-CNRS École Normale Supérieure & Université Paris 3-Sorbonne Nouvelle 1 rue
POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition
POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics
Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing
1 Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing Lourdes Araujo Dpto. Sistemas Informáticos y Programación, Univ. Complutense, Madrid 28040, SPAIN (email: [email protected])
The PALAVRAS parser and its Linguateca applications - a mutually productive relationship
The PALAVRAS parser and its Linguateca applications - a mutually productive relationship Eckhard Bick University of Southern Denmark [email protected] Outline Flow chart Linguateca Palavras History
Antonino Freno. Curriculum Vitae. Phone (office): Office: +33 (0)3 59 35 87 27. [email protected]; http://researchers.lille.inria.fr/~freno/.
Antonino Freno Curriculum Vitae Personal Information First name: Antonino Family name: Freno Date of birth: July 1, 1980 Place of birth: Reggio Calabria (RC) Italy Citizenship: Italian Phone (office):
Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic
Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic by Sigrún Helgadóttir Abstract This paper gives the results of an experiment concerned with training three different taggers on tagged
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University [email protected] Kapil Dalwani Computer Science Department
Developing a large semantically annotated corpus
Developing a large semantically annotated corpus Valerio Basile, Johan Bos, Kilian Evang, Noortje Venhuizen Center for Language and Cognition Groningen (CLCG) University of Groningen The Netherlands {v.basile,
DEPENDENCY PARSING JOAKIM NIVRE
DEPENDENCY PARSING JOAKIM NIVRE Contents 1. Dependency Trees 1 2. Arc-Factored Models 3 3. Online Learning 3 4. Eisner s Algorithm 4 5. Spanning Tree Parsing 6 References 7 A dependency parser analyzes
Modeling coherence in ESOL learner texts
University of Cambridge Computer Lab Building Educational Applications NAACL 2012 Outline 1 2 3 4 The Task: Automated Text Scoring (ATS) ATS systems Discourse coherence & cohesion The Task: Automated Text
Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System
Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Athira P. M., Sreeja M. and P. C. Reghuraj Department of Computer Science and Engineering, Government Engineering
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURKISH CORPUS
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURKISH CORPUS Gürkan Şahin 1, Banu Diri 1 and Tuğba Yıldız 2 1 Faculty of Electrical-Electronic, Department of Computer Engineering
University of Münster, Institute of Political Science, Scharnhorststraße 100, 48151 Münster, Germany. Email: thomas.dietz@uni-muenster.
Professor Dr. Thomas Dietz University of Münster, Institute of Political Science, Scharnhorststraße 100, 48151 Münster, Germany Email: [email protected] CURRENT POSITION University of Münster,
Statistical Machine Translation
Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery Jan Paralic, Peter Smatana Technical University of Kosice, Slovakia Center for
Customizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
Text Mining - Scope and Applications
Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss
Protein-protein Interaction Passage Extraction Using the Interaction Pattern Kernel Approach for the BioCreative 2015 BioC Track
Protein-protein Interaction Passage Extraction Using the Interaction Pattern Kernel Approach for the BioCreative 2015 BioC Track Yung-Chun Chang 1,2, Yu-Chen Su 3, Chun-Han Chu 1, Chien Chin Chen 2 and
Genre distinctions and discourse modes: Text types differ in their situation type distributions
Genre distinctions and discourse modes: Text types differ in their situation type distributions Alexis Palmer and Annemarie Friedrich Department of Computational Linguistics Saarland University, Saarbrücken,
Curriculum Vitae. CV P. Khader 1 of 5. Patrick H. Khader PD Dr. rer. nat., Dipl.-Psych. Born: April 19, 1976 Citizenship: German
CV P. Khader 1 of 5 Curriculum Vitae Name: Patrick H. Khader PD Dr. rer. nat., Dipl.-Psych. Born: April 19, 1976 Citizenship: German Work Address: Ludwig Maximilian University of Munich Department of Psychology
CS4025: Pragmatics. Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature
CS4025: Pragmatics Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature For more info: J&M, chap 18,19 in 1 st ed; 21,24 in 2 nd Computing Science, University of
Analysis of EU PhD Education and Research. Prof. Dr. Hans G. Sonntag, MF Heidelberg
Analysis of EU PhD Education and Research Prof. Dr. Hans G. Sonntag, MF Heidelberg History of Doctoral Degrees Doctoral degrees as old as universities Universities had the permission to award doctoral
Curriculum Vitae Ruben Sipos
Curriculum Vitae Ruben Sipos Mailing Address: 349 Gates Hall Cornell University Ithaca, NY 14853 USA Mobile Phone: +1 607-229-0872 Date of Birth: 8 October 1985 E-mail: [email protected] Web: http://www.cs.cornell.edu/~rs/
Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia
Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites
How To Complete The Danish Masters Program In Lct
European Masters Program in Language and Communication Technologies (LCT) Modules Handbook for Prospective Students European Masters Program in LCT - Modules Handbook Page ii Chapter 1 Study Program The
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
