LINGUISTIC PROCESSING IN THE ATLAS PROJECT
|
|
|
- Myron Webb
- 10 years ago
- Views:
Transcription
1 LINGUISTIC PROCESSING IN THE ATLAS PROJECT Leonardo LESMO, Alessandro Mazzei, Daniele Radicioni Interaction Models Group Dipartimento di Informatica e Centro di Scienze Cognitive Università di Torino {lesmo,mazzei,radicion}@di.unito.it
2 Written representation of LIS (Linguaggio Italiano dei Segni: Italian Sign Language) Automatic translation from Italian to LIS Creation of linguistic corpora Definition of a virtual character with proper expressions Transmission/visualization on several media,,, ATLAS : Scientific challenges
3 ATLAS Translation Pipeline Italian Text NL to AWLIS Translator AWLIS Text AWLIS to AL Translator AL Text Input texts wri-en in Italian will be translated into an intermediate representa4on, called ATLAS Wri-en Italian Sign Language (AWLIS)
4 ATLAS Translation Pipeline Italian Text NL to AWLIS Translator AWLIS Text AWLIS to AL Translator AL Text Sentences expressed in AWLIS are then translated into a character's gestures Anima4on Language (AL) which describes the way the basic movements are produced and linked.
5 ATLAS Translation Pipeline AL Text 3D Rendering Engine Screen Motion capture Keyframing Procedural Animation Techniques AL sequences will be rendered through a 3D rendering engine. An automa4c anima4on blending system will ensure smooth transi4ons between signs of the generated LIS anima4ons
6 SL Translation Issues Translating from a spoken language to a SL is a complex undertaking: The translation must transform a symbolic representation into another symbolic representation But SLs incorporate gestures, facial expressions, head movements, body language and even the space around the speaker So, we need a way to express in symbolic form the LIS sentence, i.e. a written LIS
7 Linguistic Analysis ITALIAN SENTENCE Dictionary Chunking Rules MORPHOLOGICAL ANALYSIS AND POS TAGGING Sequence of lexical items CHUNKING Verbs + Par4al Chunks COORDINATION ANALYSIS Verbs + Final Chunks Morphological Analysis PARSER Subcategorization frames Word meaning Ontology ANALYSIS OF VERBAL DEPENDENTS Parse tree SEMANTIC ANNOTATION Annotated parse tree ONTOLOGICAL INTERPRETATION SEMANTIC INTERPRETER Ontological representa4on Ontology-to-frames mapping TRANSLATION IN ATLAS FRAMES LIS Dictionary CCG Generator CCG LIS Grammar AWLIS
8 Example Locali addensamenti potranno interessare il settore nord-orientale (local clouding could affect the North-eastern sector) Result of the Morphologic-lexical analysis+ POS Tagging): 1 Locali (LOCALE ADJ QUALIF ALLVAL PL) 2 addensamenti (ADDENSAMENTO NOUN COMMON M PL ADDENSARE TRANS) 3 potranno (POTERE VERB MOD IND FUT INTRANS 3 PL) 4 interessare (INTERESSARE VERB MAIN INFINITE PRES TRANS) 5 il (IL ART DEF M SING) 6 settore (SETTORE NOUN COMMON M SING) 7 nord-orientale (NORD-ORIENTALE ADJ QUALIF ALLVAL SING) 8. (#\. PUNCT)
9 Example (2) Result of syntactic analysis 1 Locali (LOCALE ADJ QUALIF ALLVAL PL) [2;ADJC+QUALIF-RMOD] 2 addensamenti (ADDENSAMENTO NOUN COMMON M PL ADDENSARE TRANS) [3;VERB-SUBJ] 3 potranno (POTERE VERB MOD IND FUT INTRANS 3 PL) [0;TOP-VERB] 4 interessare (INTERESSARE VERB MAIN INFINITE PRES TRANS) [3;VERB+MODAL-INDCOMPL] 4.10 t [2f] (ADDENSAMENTO NOUN COMMON M PL ADDENSARE TRANS) [4;VERB-SUBJ] 5 il (IL ART DEF M SING) [4;VERB-OBJ] 6 settore (SETTORE NOUN COMMON M SING) [5;DET+DEF-ARG] 7 nord-orientale (NORD-ORIENTALE ADJ QUALIF ALLVAL SING) [6;ADJC+QUALIF-RMOD] 8. (#\. PUNCT) [3;END] adjc+qualif- rmod locale verb- subj addensamento potere verb+modal- indcompl interessare verb- subj verb- obj t il det+def- arg se-ore adjc+qualif- rmod nord- orientale
10 Semantics: a fragment of the ontology en4ty descrip4on geogr- part- selec4on- criterium geographic- area meteo- status- situa4on 4me- interval evaluable- en4ty it- region- group sea it- geogr- area day posi4ve- eval- en4ty evening it- region posi4ve- day posi4ve- evening it- cardinal- region it- island- region it- western- region it- eastern- region it- southern- region it- northern- region it- central- region it- adria4c- central- region it- adria4c- region
11 Example (3) Semantics: an ontological expression meteo-status-situation has-subclass (and ( weather-event has-subclass clouds subclass-of weather-event domain-of &has-event-width range weather-event-width (eq local-phenomenon) ( weather-status-situation domain-of &has-weather-location range geographic-area has-subclass it-geogr-area has-instance it-northeastern-area arg-of &has-it-area7 relinstance &has-it-area-spec range it-area-spec (eq northeastern)))
12 Example (4) Semantics: First Order Predicate Logic e (possible (e) clouds (e) &has-event-width (e, local-phenomenon) &has-weather-location (e, it-northeastern-area)) Notes 1) Modal logics:?; currently reification 2) The sequence it-northeastern-area arg-of &has-it-area7 relinstance &has-it-area-spec range it-area-spec (eq northeastern) specifies that, in this context, northeastern is a linguistic description (specification) of the involved area
13 Generation 1. Starting from an ATLAS frame we compute a semantic tree constituting an intermediate semantic representation, by using IF-THEN rules 2. Starting from the semantic tree we produce the sequence of LIS glosses, by using a CCG for LIS
14 Example (4) Per domani quindi nubi in aumento al nord. (ev 1 t 1 l 1 x 1 ) (cloud-increase(ev1) time(ev 1, t 1 ) deictic-descr(t 1, t 2 ) id (t 2,tomorrow) location(ev 1, l 1 ) id(l 1, nord) agent(ev 1, x 1 ) cloud(x 1 )) <?xml version="1.0" encoding="utf-8"?> <xml> <lf> <satop nom="c1:meteo-statussituation"> <prop name="cloud-increase"/> <diamond mode="agent"> <nom name="c2:clouds"/> <prop name="cloud"/> </diamond> <diamond mode="location"> <nom name="n1:it-northern-region"/> <prop name="north"/> </diamond> <diamond mode="time"> <nom name="t1:deictic-daydescription"/> <prop name="tomorrow"/> </diamond> </satop> </lf> </xml>
15 <?xml version="1.0" encoding="utf-8"?> <xml> <lf> <satop nom="c1:meteo-statussituation"> <prop name="cloud-increase"/> <diamond mode="agent"> <nom name="c2:clouds"/> <prop name="cloud"/> </diamond> <diamond mode="location"> <nom name="n1:it-northern-region"/ > <prop name="north"/> </diamond> <diamond mode="time"> <nom name="t1:deictic-daydescription"/> <prop name="tomorrow"/> </diamond> </satop> </lf> </xml> OpenCCG LIS-CCG domani_n nord_u nuvola_u nuvolaaumentare_u...
16 Example (5) Generation: Semantic Tree <ACTOR> be T <LOCATION> group north <ATTRIBUTE> <ATTRIBUTE> local east
17 Example (6) Generation: AWLIS INVECE NORDORIENTALE L ZONA L NUVOLE invece nordorientale zona nuvole both sx dx dx std std std std
18 A note on the dictionary: The ATLAS Sign set The set of signs used within the ATLAS project includes: Signs associated with Lemmas included in the Radutzky dictionary Signs widely used within the various Italian deaf communities, but not included in the Radutzky dictionary Signs newly defined within the ATLAS project, previously not existing, and needed to represent peculiar concepts, mainly linked with meteorological events and situations
19 From AWLIS to AEWLIS: Information Tracks A sentence is represented, in AEWLIS, resorting to a set on concurrent synchronized Tracks. The following 4 Tracks (TRs) are used: Lemma: Used to store information items related to the Lemmas, including, among the others, the lemma s ID, Identifier, its Part of Speech, and the set of Modifiers for the Lemma.
20 Information Tracks (2) Sentence Structure: Used to store information items related to the Structure of the Sentence, including, among the others, proper pointer to the related syntax tree, the sentence s syntactic and semantic roles, and its speech acts.
21 Channel Modifiers: Information Tracks (3) Its usage is twofold: To record the actual values of the following sign parameters: Sign Spatial Location, for Relocatable signs Action performed by the not-signing hand, when the sign requires just one hand. To record the differences on the various Communication Channels between the actual values of the performed sign and the default ones stored in the ATLAS MultiMedia Archive (AMMA)
22 Information Tracks (4) Time Stamps: Used to store info related to the Time Stamps associated with the sentence Time Slices
23 Example (7) <ACTOR> group <ATTRIBUTE> local be T <LOCATION> north <ATTRIBUTE> east (lemma de00 nuvola) (lemma de01 zona) (lemma de02 nordorientale) (lemma p00 in) (istopic de01) (into de00 de01) (rel de00 de01) (right ndom) (left dom) (free dom) (free ndom) (is_relocatable de00) (is_relocatable de01) (not_relocatable de02) (default_position de ) (default_position de ) (default_position de ) (position de )
24 Example (6) invece nordorientale zona nuvole (lemma de00 nuvola) (lemma de01 zona) (lemma de02 nordorientale) (lemma p00 in) (istopic de01) (into de00 de01) (rel de00 de01) (right ndom) (left dom) (free dom) (free ndom) (is_relocatable de00) (is_relocatable de01) (not_relocatable de02) (default_position de ) (default_position de ) (default_position de ) (position de ) (!!handlocation both ) (!!handanimation both invece) (!!handlocation both ) (!!handanimation both nordorientale) (!!handlocation dom ) (!!handanimation dom zona) (!!handlocation both ) (!!handanimation both nuvola) (!removeentity (entity de00))
25 CONCLUSIONS Linguistic Analysis is a hard task It involves both grammatical knowledge and knowledge of the world (ontology) Though the problem is not solvable today in open-ended domains, good results can be obtained for restricted domains (as Weather Forecast) The adopted solutions can be extended to wider domains provided that the needed semantic knowledge is made available
A Virtual Interpreter for the Italian Sign Language
A Virtual Interpreter for the Italian Sign Language Vincenzo Lombardo Fabrizio Nunnari Rossana Damiano Università di Torino, Virtual Reality & Multi Media Park Current practical approach The ATLAS project
Annotation in Language Documentation
Annotation in Language Documentation Univ. Hamburg Workshop Annotation SEBASTIAN DRUDE 2015-10-29 Topics 1. Language Documentation 2. Data and Annotation (theory) 3. Types and interdependencies of Annotations
CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test
CINTIL-PropBank I. Basic Information 1.1. Corpus information The CINTIL-PropBank (Branco et al., 2012) is a set of sentences annotated with their constituency structure and semantic role tags, composed
COMPUTATIONAL DATA ANALYSIS FOR SYNTAX
COLING 82, J. Horeck~ (ed.j North-Holland Publishing Compa~y Academia, 1982 COMPUTATIONAL DATA ANALYSIS FOR SYNTAX Ludmila UhliFova - Zva Nebeska - Jan Kralik Czech Language Institute Czechoslovak Academy
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde
Statistical Verb-Clustering Model soft clustering: Verbs may belong to several clusters trained on verb-argument tuples clusters together verbs with similar subcategorization and selectional restriction
Special Topics in Computer Science
Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS
A Mixed Trigrams Approach for Context Sensitive Spell Checking
A Mixed Trigrams Approach for Context Sensitive Spell Checking Davide Fossati and Barbara Di Eugenio Department of Computer Science University of Illinois at Chicago Chicago, IL, USA [email protected], [email protected]
INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning [email protected]
INF5820 Natural Language Processing - NLP H2009 Jan Tore Lønning [email protected] Semantic Role Labeling INF5830 Lecture 13 Nov 4, 2009 Today Some words about semantics Thematic/semantic roles PropBank &
Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang
Sense-Tagging Verbs in English and Chinese Hoa Trang Dang Department of Computer and Information Sciences University of Pennsylvania [email protected] October 30, 2003 Outline English sense-tagging
LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*
LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,
What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?
What s in a Lexicon What kind of Information should a Lexicon contain? The Lexicon Miriam Butt November 2002 Semantic: information about lexical meaning and relations (thematic roles, selectional restrictions,
Timeline (1) Text Mining 2004-2005 Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?
Text Mining 2004-2005 Master TKI Antal van den Bosch en Walter Daelemans http://ilk.uvt.nl/~antalb/textmining/ Dinsdag, 10.45-12.30, SZ33 Timeline (1) [1 februari 2005] Introductie (WD) [15 februari 2005]
AN OPEN KNOWLEDGE BASE FOR ITALIAN LANGUAGE IN A COLLABORATIVE PERSPECTIVE
AN OPEN KNOWLEDGE BASE FOR ITALIAN LANGUAGE IN A COLLABORATIVE PERSPECTIVE Chiari I, A. Gangemi, E. Jezek, A. Oltramari, G. Vetere, L. Vieu http://www.sensocomune.it/ Sapienza Università di Roma Université
Evalita 09 Parsing Task: constituency parsers and the Penn format for Italian
Evalita 09 Parsing Task: constituency parsers and the Penn format for Italian Cristina Bosco, Alessandro Mazzei, and Vincenzo Lombardo Dipartimento di Informatica, Università di Torino, Corso Svizzera
TRANSLATING POLISH TEXTS INTO SIGN LANGUAGE IN THE TGT SYSTEM
20th IASTED International Multi-Conference Applied Informatics AI 2002, Innsbruck, Austria 2002, pp. 282-287 TRANSLATING POLISH TEXTS INTO SIGN LANGUAGE IN THE TGT SYSTEM NINA SUSZCZAŃSKA, PRZEMYSŁAW SZMAL,
Parsing Technology and its role in Legacy Modernization. A Metaware White Paper
Parsing Technology and its role in Legacy Modernization A Metaware White Paper 1 INTRODUCTION In the two last decades there has been an explosion of interest in software tools that can automate key tasks
Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Query optimization. DBMS Architecture. Query optimizer. Query optimizer.
DBMS Architecture INSTRUCTION OPTIMIZER Database Management Systems MANAGEMENT OF ACCESS METHODS BUFFER MANAGER CONCURRENCY CONTROL RELIABILITY MANAGEMENT Index Files Data Files System Catalog BASE It
Hybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
Application of Natural Language Interface to a Machine Translation Problem
Application of Natural Language Interface to a Machine Translation Problem Heidi M. Johnson Yukiko Sekine John S. White Martin Marietta Corporation Gil C. Kim Korean Advanced Institute of Science and Technology
Social Media Text and Predictive Analytics
Social Media Text and Predictive Analytics Invited Talk at Big Data Everywhere Cambridge, MA June 23, 2015 Dr. Subrata Das Machine Analytics, Inc., Cambridge, MA www.machineanalytics.com Outline Analytics
A Survey of ASL Tenses
A Survey of ASL Tenses Karen Alkoby DePaul University School of Computer Science Chicago, IL [email protected] Abstract This paper examines tenses in American Sign Language (ASL), which will be
GATE Mímir and cloud services. Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation
GATE Mímir and cloud services Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation GATE Mímir GATE Mímir is an indexing system for GATE documents. Mímir can index: Text: the original
Learning Translation Rules from Bilingual English Filipino Corpus
Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,
Customizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
How to make Ontologies self-building from Wiki-Texts
How to make Ontologies self-building from Wiki-Texts Bastian HAARMANN, Frederike GOTTSMANN, and Ulrich SCHADE Fraunhofer Institute for Communication, Information Processing & Ergonomics Neuenahrer Str.
Semantic Analysis of Natural Language Queries Using Domain Ontology for Information Access from Database
I.J. Intelligent Systems and Applications, 2013, 12, 81-90 Published Online November 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijisa.2013.12.07 Semantic Analysis of Natural Language Queries
Using the BNC to create and develop educational materials and a website for learners of English
Using the BNC to create and develop educational materials and a website for learners of English Danny Minn a, Hiroshi Sano b, Marie Ino b and Takahiro Nakamura c a Kitakyushu University b Tokyo University
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
M3039 MPEG 97/ January 1998
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND ASSOCIATED AUDIO INFORMATION ISO/IEC JTC1/SC29/WG11 M3039
Paraphrasing controlled English texts
Paraphrasing controlled English texts Kaarel Kaljurand Institute of Computational Linguistics, University of Zurich [email protected] Abstract. We discuss paraphrasing controlled English texts, by defining
Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1
Korpus-Abfrage: Werkzeuge und Sprachen Gastreferat zur Vorlesung Korpuslinguistik mit und für Computerlinguistik Charlotte Merz 3. Dezember 2002 Motivation Lizentiatsarbeit: A Corpus Query Tool for Automatically
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati
CHARACTERISTICS FOR STUDENTS WITH: LIMITED ENGLISH PROFICIENCY (LEP)
CHARACTERISTICS FOR STUDENTS WITH: LIMITED ENGLISH PROFICIENCY (LEP) Research has shown that students acquire a second language in the same way that they acquire the first language. It is an exploratory
LANGUAGE! 4 th Edition, Levels A C, correlated to the South Carolina College and Career Readiness Standards, Grades 3 5
Page 1 of 57 Grade 3 Reading Literary Text Principles of Reading (P) Standard 1: Demonstrate understanding of the organization and basic features of print. Standard 2: Demonstrate understanding of spoken
Presented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software
Semantic Research using Natural Language Processing at Scale; A continued look behind the scenes of Semantic Insights Research Assistant and Research Librarian Presented to The Federal Big Data Working
Using NLP and Ontologies for Notary Document Management Systems
Outline Using NLP and Ontologies for Notary Document Management Systems Flora Amato, Antonino Mazzeo, Antonio Penta and Antonio Picariello Dipartimento di Informatica e Sistemistica Universitá di Napoli
The PALAVRAS parser and its Linguateca applications - a mutually productive relationship
The PALAVRAS parser and its Linguateca applications - a mutually productive relationship Eckhard Bick University of Southern Denmark [email protected] Outline Flow chart Linguateca Palavras History
Schema documentation for types1.2.xsd
Generated with oxygen XML Editor Take care of the environment, print only if necessary! 8 february 2011 Table of Contents : ""...........................................................................................................
Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia
Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites
Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es
KYOTO () Intelligent Content and Semantics Knowledge Yielding Ontologies for Transition-Based Organization http://www.kyoto-project.eu/ Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU
Computer Standards & Interfaces
Computer Standards & Interfaces 35 (2013) 470 481 Contents lists available at SciVerse ScienceDirect Computer Standards & Interfaces journal homepage: www.elsevier.com/locate/csi How to make a natural
Domain Specific Sign Language Animation for Virtual Characters
Domain Specific Sign Language Animation for Virtual Characters.. Anonymous Keywords: Abstract: Animation, Sign Language, Avatar, News Generation, Automatic This paper describes the design, implementation
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
THE BACHELOR S DEGREE IN SPANISH
Academic regulations for THE BACHELOR S DEGREE IN SPANISH THE FACULTY OF HUMANITIES THE UNIVERSITY OF AARHUS 2007 1 Framework conditions Heading Title Prepared by Effective date Prescribed points Text
D2.4: Two trained semantic decoders for the Appointment Scheduling task
D2.4: Two trained semantic decoders for the Appointment Scheduling task James Henderson, François Mairesse, Lonneke van der Plas, Paola Merlo Distribution: Public CLASSiC Computational Learning in Adaptive
Package wordnet. January 6, 2016
Title WordNet Interface Version 0.1-11 Package wordnet January 6, 2016 An interface to WordNet using the Jawbone Java API to WordNet. WordNet () is a large lexical database
An Overview of Applied Linguistics
An Overview of Applied Linguistics Edited by: Norbert Schmitt Abeer Alharbi What is Linguistics? It is a scientific study of a language It s goal is To describe the varieties of languages and explain the
Outline of today s lecture
Outline of today s lecture Generative grammar Simple context free grammars Probabilistic CFGs Formalism power requirements Parsing Modelling syntactic structure of phrases and sentences. Why is it useful?
A chart generator for the Dutch Alpino grammar
June 10, 2009 Introduction Parsing: determining the grammatical structure of a sentence. Semantics: a parser can build a representation of meaning (semantics) as a side-effect of parsing a sentence. Generation:
Compiler I: Syntax Analysis Human Thought
Course map Compiler I: Syntax Analysis Human Thought Abstract design Chapters 9, 12 H.L. Language & Operating Sys. Compiler Chapters 10-11 Virtual Machine Software hierarchy Translator Chapters 7-8 Assembly
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
Latin WordNet project
Latin WordNet project Stefano Minozzi Laboratorio di Informatica Umanistica Università degli Studi di Verona Latin WordNet project Laboratorio di Informatica Umanistica Università degli Studi di Verona
A System for Labeling Self-Repairs in Speech 1
A System for Labeling Self-Repairs in Speech 1 John Bear, John Dowding, Elizabeth Shriberg, Patti Price 1. Introduction This document outlines a system for labeling self-repairs in spontaneous speech.
Automated Extraction of Security Policies from Natural-Language Software Documents
Automated Extraction of Security Policies from Natural-Language Software Documents Xusheng Xiao 1 Amit Paradkar 2 Suresh Thummalapenta 3 Tao Xie 1 1 Dept. of Computer Science, North Carolina State University,
Interactive Dynamic Information Extraction
Interactive Dynamic Information Extraction Kathrin Eichler, Holmer Hemsen, Markus Löckelt, Günter Neumann, and Norbert Reithinger Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI, 66123 Saarbrücken
Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
Distributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
INFORMING A INFORMATION DISCOVERY TOOL FOR USING GESTURE
INFORMING A INFORMATION DISCOVERY TOOL FOR USING GESTURE Luís Manuel Borges Gouveia Feliz Ribeiro Gouveia {lmbg, fribeiro}@ufp.pt Centro de Recursos Multimediáticos Universidade Fernando Pessoa Porto -
Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari [email protected]
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari [email protected] Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
Open Domain Information Extraction. Günter Neumann, DFKI, 2012
Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for
Morphology. Morphology is the study of word formation, of the structure of words. 1. some words can be divided into parts which still have meaning
Morphology Morphology is the study of word formation, of the structure of words. Some observations about words and their structure: 1. some words can be divided into parts which still have meaning 2. many
Language and Computation
Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University [email protected] http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters
LESSON THIRTEEN STRUCTURAL AMBIGUITY. Structural ambiguity is also referred to as syntactic ambiguity or grammatical ambiguity.
LESSON THIRTEEN STRUCTURAL AMBIGUITY Structural ambiguity is also referred to as syntactic ambiguity or grammatical ambiguity. Structural or syntactic ambiguity, occurs when a phrase, clause or sentence
Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Text Mining Anya Yarygina Boris Novikov Introduction Generally used to denote any system that analyzes large quantities of natural language text and detects lexical or
CSCI 3136 Principles of Programming Languages
CSCI 3136 Principles of Programming Languages Faculty of Computer Science Dalhousie University Winter 2013 CSCI 3136 Principles of Programming Languages Faculty of Computer Science Dalhousie University
Classification of Natural Language Interfaces to Databases based on the Architectures
Volume 1, No. 11, ISSN 2278-1080 The International Journal of Computer Science & Applications (TIJCSA) RESEARCH PAPER Available Online at http://www.journalofcomputerscience.com/ Classification of Natural
Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology
Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Makoto Nakamura, Yasuhiro Ogawa, Katsuhiko Toyama Japan Legal Information Institute, Graduate
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata Alessandra Giordani and Alessandro Moschitti Department of Computer Science and Engineering University of Trento Via Sommarive
Comprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing
1 Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing Lourdes Araujo Dpto. Sistemas Informáticos y Programación, Univ. Complutense, Madrid 28040, SPAIN (email: [email protected])
Parsing Software Requirements with an Ontology-based Semantic Role Labeler
Parsing Software Requirements with an Ontology-based Semantic Role Labeler Michael Roth University of Edinburgh [email protected] Ewan Klein University of Edinburgh [email protected] Abstract Software
Teaching Methodology for 3D Animation
Abstract The field of 3d animation has addressed design processes and work practices in the design disciplines for in recent years. There are good reasons for considering the development of systematic
Modern foreign languages
Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007
a Chinese-to-Spanish rule-based machine translation
Chinese-to-Spanish rule-based machine translation system Jordi Centelles 1 and Marta R. Costa-jussà 2 1 Centre de Tecnologies i Aplicacions del llenguatge i la Parla (TALP), Universitat Politècnica de
COURSE SYLLABUS ESU 561 ASPECTS OF THE ENGLISH LANGUAGE. Fall 2014
COURSE SYLLABUS ESU 561 ASPECTS OF THE ENGLISH LANGUAGE Fall 2014 EDU 561 (85515) Instructor: Bart Weyand Classroom: Online TEL: (207) 985-7140 E-Mail: [email protected] COURSE DESCRIPTION: This is a practical
Empirical Machine Translation and its Evaluation
Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Jesús Giménez (Advisor, Lluís Màrquez) Universitat Politècnica de Catalunya May 28, 2010 Empirical Machine Translation Empirical
Master Degree Project Ideas (Fall 2014) Proposed By Faculty Department of Information Systems College of Computer Sciences and Information Technology
Master Degree Project Ideas (Fall 2014) Proposed By Faculty Department of Information Systems College of Computer Sciences and Information Technology 1 P age Dr. Maruf Hasan MS CIS Program Potential Project
Culture and Language. What We Say Influences What We Think, What We Feel and What We Believe
Culture and Language What We Say Influences What We Think, What We Feel and What We Believe Unique Human Ability Ability to create and use language is the most distinctive feature of humans Humans learn
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System
Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Athira P. M., Sreeja M. and P. C. Reghuraj Department of Computer Science and Engineering, Government Engineering
Why major in linguistics (and what does a linguist do)?
Why major in linguistics (and what does a linguist do)? Written by Monica Macaulay and Kristen Syrett What is linguistics? If you are considering a linguistics major, you probably already know at least
An Introduction to Language Processing with Perl and Prolog
An Introduction to Language Processing with Perl and Prolog Pierre Nugues Lund University [email protected] http://www.cs.lth.se/home/pierre_nugues/ Pierre Nugues An Introduction to Language Processing
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery Jan Paralic, Peter Smatana Technical University of Kosice, Slovakia Center for
