Empirical Machine Translation and its Evaluation
|
|
|
- Erin Fitzgerald
- 10 years ago
- Views:
Transcription
1 Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Jesús Giménez (Advisor, Lluís Màrquez) Universitat Politècnica de Catalunya May 28, 2010
2 Empirical Machine Translation
3 Empirical Machine Translation
4 Empirical Machine Translation
5 Empirical Machine Translation
6 Empirical Machine Translation
7 Discriminative Phrase Translation A brilliant play written by William Locke Una obra brillante escrita por William Locke vs. A brilliant play by Lionel Messi that produced a wonderful goal Una brillante jugada de Lionel Messi que resultó en un bello gol
8 Discriminative Phrase Translation A brilliant play written by William Locke Una obra brillante escrita por William Locke vs. A brilliant play by Lionel Messi that produced a wonderful goal Una brillante jugada de Lionel Messi que resultó en un bello gol
9 Discriminative Phrase Translation A brilliant play written by William Locke Una obra brillante escrita por William Locke vs. A brilliant play by Lionel Messi that produced a wonderful goal Una brillante jugada de Lionel Messi que resultó en un bello gol
10 Discriminative Phrase Translation A brilliant play written by William Locke Una obra brillante escrita por William Locke vs. A brilliant play by Lionel Messi that produced a wonderful goal Una brillante jugada de Lionel Messi que resultó en un bello gol
11 Discriminative Phrase Translation A brilliant play written by William Locke Una obra brillante escrita por William Locke vs. A brilliant play by Lionel Messi that produced a wonderful goal Una brillante jugada de Lionel Messi que resultó en un bello gol
12 Discriminative Phrase Translation A brilliant play written by William Locke Una obra brillante escrita por William Locke vs. A brilliant play by Lionel Messi that produced a wonderful goal Una brillante jugada de Lionel Messi que resultó en un bello gol
13 Discriminative Phrase Translation Idea Use discriminative Machine Learning techniques in SMT to estimate P(e i f j ), actually, P(e i f j, context f j ) Translation modeling is addressed as a classification problem
14 Discriminative Phrase Translation Idea Use discriminative Machine Learning techniques in SMT to estimate P(e i f j ), actually, P(e i f j, context f j ) Translation modeling is addressed as a classification problem
15 Contributions 1 Discriminative Phrase Selection for SMT Shallow-syntactic features word, lemma, parts of speech, chunking local / global context Improved translation quality 2 Domain Dependence in SMT Parliament proceedings Dictionary definitions Adaptation based on: EuroWordNet out-of-domain data a small amount of in-domain data Improved translation quality
16 Contributions 1 Discriminative Phrase Selection for SMT Shallow-syntactic features word, lemma, parts of speech, chunking local / global context Improved translation quality 2 Domain Dependence in SMT Parliament proceedings Dictionary definitions Adaptation based on: EuroWordNet out-of-domain data a small amount of in-domain data Improved translation quality
17 Contributions 1 Discriminative Phrase Selection for SMT Shallow-syntactic features word, lemma, parts of speech, chunking local / global context Improved translation quality 2 Domain Dependence in SMT Parliament proceedings Dictionary definitions Adaptation based on: EuroWordNet out-of-domain data a small amount of in-domain data Improved translation quality
18 Contributions 1 Discriminative Phrase Selection for SMT Shallow-syntactic features word, lemma, parts of speech, chunking local / global context Improved translation quality 2 Domain Dependence in SMT Parliament proceedings Dictionary definitions Adaptation based on: EuroWordNet out-of-domain data a small amount of in-domain data Improved translation quality
19 ... and its Evaluation
20 ... and its Evaluation
21 ... and its Evaluation
22 Limits of Lexical Similarity Candidate On Tuesday several missiles and mortar Translation shells fell in southern Israel, but there were no casualties. Reference Several Qassam rockets and mortar shells Translation fell today, Tuesday, in southern Israel without causing any casualties. Only one 4-gram in common!
23 Limits of Lexical Similarity Candidate On Tuesday several missiles and mortar Translation shells fell in southern Israel, but there were no casualties. Reference Several Qassam rockets and mortar shells Translation fell today, Tuesday, in southern Israel without causing any casualties. Only one 4-gram in common!
24 Limits of Lexical Similarity Candidate On Tuesday several missiles and mortar Translation shells fell in southern Israel, but there were no casualties. Reference Several Qassam rockets and mortar shells Translation fell today, Tuesday, in southern Israel without causing any casualties. Only one 4-gram in common!
25 Linguistic Features for Automatic MT Evaluation Idea Define similarity measures based on deeper linguistic information Compare linguistic structures and their lexical realizations Linguistic levels Syntax Parts-of-speech Base phrase chunks Phrase constituents Dependency relationships Semantics Named entities Semantic roles Discourse representations
26 Linguistic Features for Automatic MT Evaluation S PP TMP S. On NP NP A1 VP, but S Tuesday several missiles and mortar shells fell PP LOC NP VP in NP there were NP southern Israel no casualties
27 Linguistic Features for Automatic MT Evaluation S PP TMP S. On NP NP A1 VP, but S Tuesday several missiles and mortar shells fell PP LOC NP VP in NP there were NP southern Israel no casualties
28 Linguistic Features for Automatic MT Evaluation S NP A1 A0 VP. NP and NP fell NP PP LOC PP ADV Several Qassam rockets mortar shells NP TMP, NP, in NP without S today Tuesday southern VP Israel causing NP A1 any casualties
29 Linguistic Features for Automatic MT Evaluation S NP A1 A0 VP. NP and NP fell NP PP LOC PP ADV Several Qassam rockets mortar shells NP TMP, NP, in NP without S today Tuesday southern VP Israel causing NP A1 any casualties
30 Contributions 1 Linguistic measures provide more reliable rankings when the systems under evaluation are based on different paradigms (statistical vs. rule-based, fully-automatic vs. human-aided) 2 Linguistic measures have proven effective in shared tasks (WMT ) both at the system and sentence levels 3 Some linguistic measures suffer a substantial quality decrease at the sentence level (due to parsing errors!) 4 Lexical and Linguistic measures are complementary suitable for being combined!
31 Contributions 1 Linguistic measures provide more reliable rankings when the systems under evaluation are based on different paradigms (statistical vs. rule-based, fully-automatic vs. human-aided) 2 Linguistic measures have proven effective in shared tasks (WMT ) both at the system and sentence levels 3 Some linguistic measures suffer a substantial quality decrease at the sentence level (due to parsing errors!) 4 Lexical and Linguistic measures are complementary suitable for being combined!
32 Contributions 1 Linguistic measures provide more reliable rankings when the systems under evaluation are based on different paradigms (statistical vs. rule-based, fully-automatic vs. human-aided) 2 Linguistic measures have proven effective in shared tasks (WMT ) both at the system and sentence levels 3 Some linguistic measures suffer a substantial quality decrease at the sentence level (due to parsing errors!) 4 Lexical and Linguistic measures are complementary suitable for being combined!
33 Contributions 1 Linguistic measures provide more reliable rankings when the systems under evaluation are based on different paradigms (statistical vs. rule-based, fully-automatic vs. human-aided) 2 Linguistic measures have proven effective in shared tasks (WMT ) both at the system and sentence levels 3 Some linguistic measures suffer a substantial quality decrease at the sentence level (due to parsing errors!) 4 Lexical and Linguistic measures are complementary suitable for being combined!
34 Contributions 1 Linguistic measures provide more reliable rankings when the systems under evaluation are based on different paradigms (statistical vs. rule-based, fully-automatic vs. human-aided) 2 Linguistic measures have proven effective in shared tasks (WMT ) both at the system and sentence levels 3 Some linguistic measures suffer a substantial quality decrease at the sentence level (due to parsing errors!) 4 Lexical and Linguistic measures are complementary suitable for being combined!
35 Acknowledgements 1 Spanish Government Ministry of Science and Technology Ministry of Education 2 Organizers and participants of the NIST, WMT and IWSLT Evaluation Campaigns 3 A number of NLP researchers worldwide sharing their software 4 Enrique Amigó and German Rigau
36 Acknowledgements 1 Spanish Government Ministry of Science and Technology Ministry of Education 2 Organizers and participants of the NIST, WMT and IWSLT Evaluation Campaigns 3 A number of NLP researchers worldwide sharing their software 4 Enrique Amigó and German Rigau
37 Acknowledgements 1 Spanish Government Ministry of Science and Technology Ministry of Education 2 Organizers and participants of the NIST, WMT and IWSLT Evaluation Campaigns 3 A number of NLP researchers worldwide sharing their software 4 Enrique Amigó and German Rigau
38 Acknowledgements 1 Spanish Government Ministry of Science and Technology Ministry of Education 2 Organizers and participants of the NIST, WMT and IWSLT Evaluation Campaigns 3 A number of NLP researchers worldwide sharing their software 4 Enrique Amigó and German Rigau
39 Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Thanks!
HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN
HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN Yu Chen, Andreas Eisele DFKI GmbH, Saarbrücken, Germany May 28, 2010 OUTLINE INTRODUCTION ARCHITECTURE EXPERIMENTS CONCLUSION SMT VS. RBMT [K.
Introduction. Philipp Koehn. 28 January 2016
Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)
Hybrid Machine Translation Guided by a Rule Based System
Hybrid Machine Translation Guided by a Rule Based System Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez Kepa Sarasola Universitat Politècnica de Catalunya University of the
Multi-Engine Machine Translation by Recursive Sentence Decomposition
Multi-Engine Machine Translation by Recursive Sentence Decomposition Bart Mellebeek Karolina Owczarzak Josef van Genabith Andy Way National Centre for Language Technology School of Computing Dublin City
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006 Yidong Chen, Xiaodong Shi Institute of Artificial Intelligence Xiamen University P. R. China November 28, 2006 - Kyoto 13:46 1
Computer Aided Translation
Computer Aided Translation Philipp Koehn 30 April 2015 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user is tolerant of inferior quality focus of majority
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
Why Evaluation? Machine Translation. Evaluation. Evaluation Metrics. Ten Translations of a Chinese Sentence. How good is a given system?
Why Evaluation? How good is a given system? Machine Translation Evaluation Which one is the best system for our purpose? How much did we improve our system? How can we tune our system to become better?
Machine Translation. Why Evaluation? Evaluation. Ten Translations of a Chinese Sentence. Evaluation Metrics. But MT evaluation is a di cult problem!
Why Evaluation? How good is a given system? Which one is the best system for our purpose? How much did we improve our system? How can we tune our system to become better? But MT evaluation is a di cult
Statistical Machine Translation
Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning [email protected]
INF5820 Natural Language Processing - NLP H2009 Jan Tore Lønning [email protected] Semantic Role Labeling INF5830 Lecture 13 Nov 4, 2009 Today Some words about semantics Thematic/semantic roles PropBank &
Hybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
CS 6740 / INFO 6300. Ad-hoc IR. Graduate-level introduction to technologies for the computational treatment of information in humanlanguage
CS 6740 / INFO 6300 Advanced d Language Technologies Graduate-level introduction to technologies for the computational treatment of information in humanlanguage form, covering natural-language processing
Big Data Adaptation DatAptor Dr Khalil Sima'an
Big Data Adaptation DatAptor Dr Khalil Sima'an an Institute for Logic, Language and Computation University of Amsterdam The Netherlands MT at ILLC-UvA Statistical Language Processing and Learning Lab.
SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems Jin Yang, Satoshi Enoue Jean Senellart, Tristan Croiset SYSTRAN Software, Inc. SYSTRAN SA 9333 Genesee Ave. Suite PL1 La Grande
Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output
Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output Christian Federmann Language Technology Lab, German Research Center for Artificial Intelligence, Stuhlsatzenhausweg 3, D-66123 Saarbrücken,
CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test
CINTIL-PropBank I. Basic Information 1.1. Corpus information The CINTIL-PropBank (Branco et al., 2012) is a set of sentences annotated with their constituency structure and semantic role tags, composed
Collaborative Machine Translation Service for Scientific texts
Collaborative Machine Translation Service for Scientific texts Patrik Lambert [email protected] Jean Senellart Systran SA [email protected] Laurent Romary Humboldt Universität Berlin
Comprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es
KYOTO () Intelligent Content and Semantics Knowledge Yielding Ontologies for Transition-Based Organization http://www.kyoto-project.eu/ Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University [email protected] Kapil Dalwani Computer Science Department
BILINGUAL TRANSLATION SYSTEM
BILINGUAL TRANSLATION SYSTEM (FOR ENGLISH AND TAMIL) Dr. S. Saraswathi Associate Professor M. Anusiya P. Kanivadhana S. Sathiya Abstract--- The project aims in developing Bilingual Translation System for
Machine translation techniques for presentation of summaries
Grant Agreement Number: 257528 KHRESMOI www.khresmoi.eu Machine translation techniques for presentation of summaries Deliverable number D4.6 Dissemination level Public Delivery date April 2014 Status Author(s)
Factored Translation Models
Factored Translation s Philipp Koehn and Hieu Hoang [email protected], [email protected] School of Informatics University of Edinburgh 2 Buccleuch Place, Edinburgh EH8 9LW Scotland, United Kingdom
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:
Semantic analysis of text and speech
Semantic analysis of text and speech SGN-9206 Signal processing graduate seminar II, Fall 2007 Anssi Klapuri Institute of Signal Processing, Tampere University of Technology, Finland Outline What is semantic
IRIS - English-Irish Translation System
IRIS - English-Irish Translation System Mihael Arcan, Unit for Natural Language Processing of the Insight Centre for Data Analytics at the National University of Ireland, Galway Introduction about me,
Machine Translation at the European Commission
Directorate-General for Translation Machine Translation at the European Commission Konferenz 10 Jahre Verbmobil Saarbrücken, 16. November 2010 Andreas Eisele Project Manager Machine Translation, ICT Unit
Machine Translation and the Translator
Machine Translation and the Translator Philipp Koehn 8 April 2015 About me 1 Professor at Johns Hopkins University (US), University of Edinburgh (Scotland) Author of textbook on statistical machine translation
Customizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
Jane 2: Open Source Phrase-based and Hierarchical Statistical Machine Translation
Jane 2: Open Source Phrase-based and Hierarchical Statistical Machine Translation Joern Wuebker M at thias Huck Stephan Peitz M al te Nuhn M arkus F reitag Jan-Thorsten Peter Saab M ansour Hermann N e
Overview of MT techniques. Malek Boualem (FT)
Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,
PROMT Technologies for Translation and Big Data
PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED
Language and Computation
Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University [email protected] http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters
Paraphrasing controlled English texts
Paraphrasing controlled English texts Kaarel Kaljurand Institute of Computational Linguistics, University of Zurich [email protected] Abstract. We discuss paraphrasing controlled English texts, by defining
Report on the embedding and evaluation of the second MT pilot
Report on the embedding and evaluation of the second MT pilot quality translation by deep language engineering approaches DELIVERABLE D3.10 VERSION 1.6 2015-11-02 P2 QTLeap Machine translation is a computational
Speech understanding in dialogue systems
Speech understanding in dialogue systems Sergio Grau Puerto [email protected] Departament de Sistemes Informàtics i Computació Universitat Politècnica de València Sergio Grau Puerto. Carnegie Mellon: June
An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System
An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System Thiruumeni P G, Anand Kumar M Computational Engineering & Networking, Amrita Vishwa Vidyapeetham, Coimbatore,
Learning Translation Rules from Bilingual English Filipino Corpus
Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,
Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC)
Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger European Commission Joint Research Centre (JRC) https://ec.europa.eu/jrc/en/research-topic/internet-surveillance-systems
What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?
What s in a Lexicon What kind of Information should a Lexicon contain? The Lexicon Miriam Butt November 2002 Semantic: information about lexical meaning and relations (thematic roles, selectional restrictions,
Deciphering Foreign Language
Deciphering Foreign Language NLP 1! Sujith Ravi and Kevin Knight [email protected], [email protected] Information Sciences Institute University of Southern California! 2 Statistical Machine Translation (MT) Current
Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System
Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Athira P. M., Sreeja M. and P. C. Reghuraj Department of Computer Science and Engineering, Government Engineering
Machine Learning for natural language processing
Machine Learning for natural language processing Introduction Laura Kallmeyer Heinrich-Heine-Universität Düsseldorf Summer 2016 1 / 13 Introduction Goal of machine learning: Automatically learn how to
Context Grammar and POS Tagging
Context Grammar and POS Tagging Shian-jung Dick Chen Don Loritz New Technology and Research New Technology and Research LexisNexis LexisNexis Ohio, 45342 Ohio, 45342 [email protected] [email protected]
Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014
Introduction! BM1 Advanced Natural Language Processing Alexander Koller! 17 October 2014 Outline What is computational linguistics? Topics of this course Organizational issues Siri Text prediction Facebook
The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
Overview of the TACITUS Project
Overview of the TACITUS Project Jerry R. Hobbs Artificial Intelligence Center SRI International 1 Aims of the Project The specific aim of the TACITUS project is to develop interpretation processes for
Automatic slide assignation for language model adaptation
Automatic slide assignation for language model adaptation Applications of Computational Linguistics Adrià Agustí Martínez Villaronga May 23, 2013 1 Introduction Online multimedia repositories are rapidly
POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition
POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
Application of Natural Language Interface to a Machine Translation Problem
Application of Natural Language Interface to a Machine Translation Problem Heidi M. Johnson Yukiko Sekine John S. White Martin Marietta Corporation Gil C. Kim Korean Advanced Institute of Science and Technology
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
Statistical Pattern-Based Machine Translation with Statistical French-English Machine Translation
Statistical Pattern-Based Machine Translation with Statistical French-English Machine Translation Jin'ichi Murakami, Takuya Nishimura, Masato Tokuhisa Tottori University, Japan Problems of Phrase-Based
NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR
NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR 1 Gauri Rao, 2 Chanchal Agarwal, 3 Snehal Chaudhry, 4 Nikita Kulkarni,, 5 Dr. S.H. Patil 1 Lecturer department o f Computer Engineering BVUCOE,
a Chinese-to-Spanish rule-based machine translation
Chinese-to-Spanish rule-based machine translation system Jordi Centelles 1 and Marta R. Costa-jussà 2 1 Centre de Tecnologies i Aplicacions del llenguatge i la Parla (TALP), Universitat Politècnica de
An Overview of Computational Advertising
An Overview of Computational Advertising Evgeniy Gabrilovich in collaboration with many colleagues throughout the company 1 What is Computational Advertising? New scientific sub-discipline that provides
Machine Translation. Agenda
Agenda Introduction to Machine Translation Data-driven statistical machine translation Translation models Parallel corpora Document-, sentence-, word-alignment Phrase-based translation MT decoding algorithm
LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier. Université Paris-Est Marne-la-Vallée
LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier Université Paris-Est Marne-la-Vallée [email protected] Penguin from http://tux.crystalxp.net/ 1 Linguistic data
The Challenge of Machine Translation of Patent Specifications and the Approach of the European Patent Office
The Challenge of Machine Translation of Patent Specifications and the Approach of the European Patent Office Georg Artelsmair Head of Department European Affairs/Member States European Patent Office Ottawa,
How to make Ontologies self-building from Wiki-Texts
How to make Ontologies self-building from Wiki-Texts Bastian HAARMANN, Frederike GOTTSMANN, and Ulrich SCHADE Fraunhofer Institute for Communication, Information Processing & Ergonomics Neuenahrer Str.
M LTO Multilingual On-Line Translation
O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in
Statistical Machine Translation
Statistical Machine Translation What works and what does not Andreas Maletti Universität Stuttgart [email protected] Stuttgart May 14, 2013 Statistical Machine Translation A. Maletti 1 Main
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
Example-based Translation without Parallel Corpora: First experiments on a prototype
-based Translation without Parallel Corpora: First experiments on a prototype Vincent Vandeghinste, Peter Dirix and Ineke Schuurman Centre for Computational Linguistics Katholieke Universiteit Leuven Maria
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
Timeline (1) Text Mining 2004-2005 Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?
Text Mining 2004-2005 Master TKI Antal van den Bosch en Walter Daelemans http://ilk.uvt.nl/~antalb/textmining/ Dinsdag, 10.45-12.30, SZ33 Timeline (1) [1 februari 2005] Introductie (WD) [15 februari 2005]
Structure of Clauses. March 9, 2004
Structure of Clauses March 9, 2004 Preview Comments on HW 6 Schedule review session Finite and non-finite clauses Constituent structure of clauses Structure of Main Clauses Discuss HW #7 Course Evals Comments
Deep Learning For Text Processing
Deep Learning For Text Processing Jeffrey A. Bilmes Professor Departments of Electrical Engineering & Computer Science and Engineering University of Washington, Seattle http://melodi.ee.washington.edu/~bilmes
Named Entity Recognition Experiments on Turkish Texts
Named Entity Recognition Experiments on Dilek Küçük 1 and Adnan Yazıcı 2 1 TÜBİTAK - Uzay Institute, Ankara - Turkey [email protected] 2 Dept. of Computer Engineering, METU, Ankara - Turkey
University of Massachusetts Boston Applied Linguistics Graduate Program. APLING 601 Introduction to Linguistics. Syllabus
University of Massachusetts Boston Applied Linguistics Graduate Program APLING 601 Introduction to Linguistics Syllabus Course Description: This course examines the nature and origin of language, the history
TAPTA: Translation Assistant for Patent Titles and Abstracts
TAPTA: Translation Assistant for Patent Titles and Abstracts https://www3.wipo.int/patentscope/translate A. What is it? This tool is designed to give users the possibility to understand the content of
Segmentation and Punctuation Prediction in Speech Language Translation Using a Monolingual Translation System
Segmentation and Punctuation Prediction in Speech Language Translation Using a Monolingual Translation System Eunah Cho, Jan Niehues and Alex Waibel International Center for Advanced Communication Technologies
Outline of today s lecture
Outline of today s lecture Generative grammar Simple context free grammars Probabilistic CFGs Formalism power requirements Parsing Modelling syntactic structure of phrases and sentences. Why is it useful?
Text Generation for Abstractive Summarization
Text Generation for Abstractive Summarization Pierre-Etienne Genest, Guy Lapalme RALI-DIRO Université de Montréal P.O. Box 6128, Succ. Centre-Ville Montréal, Québec Canada, H3C 3J7 {genestpe,lapalme}@iro.umontreal.ca
