Why Machine Translation? Machine Translation Overview & Introduction. Why Machine Translation? Why is translation hard?
|
|
- Thomasine Anderson
- 7 years ago
- Views:
Transcription
1 Why Machine Translation? Machine Translation Overview & Introduction Jörg Tiedemann Department of Linguistics and Philology Uppsala University translation is expensive on-line demand for translation (on-the-fly) globalization, growing export, lots of language pairs political issues (EU, support of minority languages...) tourism, movies, news... MT combines various aspects of computational linguistics Jörg Tiedemann 1/39 Jörg Tiedemann 2/39 Why Machine Translation? Why is translation hard? MT makes us laugh... Input: Hur många människor bor i Uppsala? Google 2009: How many people live in water? Input: Vem vann allsvenskan i fjol? Google 2010: Who stole headlines last year?... but constantly improves! Google 2011: How many people live in Uppsala? Who won the league last year? Languages are different on many levels: lexicon, syntax, semantics source language ambiguity cross-lingual divergences target language variation pragmatics, style, culture, background Computers have big problems with ambiguity. Jörg Tiedemann 3/39 Jörg Tiedemann 4/39
2 Translation by humans What can we do?... browsing quality A human translator needs... to understand the source language to know how to speak the target language (well!) knowledge about the relationship between source and target language knowledge about the topic of the text to be translated knowledge about culture, values, traditions and expectation av speakers of source and target language Computers have big problems with all of these issues Jörg Tiedemann 5/39 Jörg Tiedemann 6/39 What can we do? Outline Balance between quality and input restrictions Automatic translation depending on task general purpose semi-automatic sublanguage rough translation post-editing domain specific browsing quality editing quality publishing quality Motivation Course Overview fully automatic (gisting) computer-aided (human) translation (CAT) fully automatic (FAHQMT) What are the problems? MT history on-line service localization,... product manuals, limited domains Jörg Tiedemann 7/39 Jörg Tiedemann 8/39
3 General Course Related Information General Course Related Information Course: 5LN426 (7.5hp) introductory course presentation of main approaches (lectures) focus on practical sessions test & use some existing systems and tools develop & present projects Examination: pass obligatory lab assignments (G) translation project (graded) define & develop your own MT project present the project in class (2 seminars at the end) describe project in a report Jörg Tiedemann 9/39 Jörg Tiedemann 10/39 General Course Related Information What exactly is MT? Literature: Philipp Koehn: Statistical Machine Translation Cambridge University Press on-line material (scientific papers, slides,...) For the labs: you need an account on the STP machines... Website: joerg/mt11 Additional information might appear at any time during the course. MT = automatic translation from one language (source language) to another (target language) using computers MT translation memories and bilingual dictionaries MT - often: sentence-by-sentence translation MT often refers to translation of written text (not speech) semi-automatic: CAT = computer-aided translation Jörg Tiedemann 11/39 Jörg Tiedemann 12/39
4 Why is MT hard? Why is it hard to accept MT? Processing of languages is hard: analysis, information extraction, message understanding natural language generation, discourse fluency ambiguity resolution, synonymy & redundancy non-compositionality, idioms... domain detection & adaptation unrealistic expectations bad translations in available systems unexpected (not humanlike) errors problems of workflow integration Languages are different on many levels: lexicon, syntax, pragmatics style, culture, background changing now due to recent developments (!?) Jörg Tiedemann 13/39 Jörg Tiedemann 14/39 What are the problems? Source language ambiguity source language ambiguity target language variation cross-lingual divergences poäng point, points, credit,... var was, were (verb) each (pron) where (adv) every (adj) anta suppose admit (anta någon till...) Ambiguity is usually solved in context Jörg Tiedemann 15/39 Jörg Tiedemann 16/39
5 Lexical ambiguities across languages Language divergences and mismatches systematic differences between the 2 languages morphology (isolating vs polysynthetic, agglutinative vs fusion) syntax (SVO, SOV, VSO, argument structure, pro-drop) idiosyncratic and lexical differences differences in lexical ambiguity (polysemy) lexical gaps differences in tempus, aspect, voice different idiomatic/fixed expressions... (from Jurafsky & Martin, 2008: Speech and Language Processing) Jörg Tiedemann 17/39 Jörg Tiedemann 18/39 Divergences English - Spanish, sample of 19,000 sentences: 1 in 3 presents divergences (Dorr et al, 2002) Types (Habash, 2002) 1. categorical divergences: tener celos (N) (lit. to have jealousy ) to be jealous (A) 2. conflation: ir flotando (lit. to go floating ) to float 3. structural divergence: entrar en N (lit. to enter in N ) to enter N 4. head swapping: entrar corriendo (lit. to enter running ) to run in 5. thematic divergence: me gustan uvas (lit. to-me they-please grapes ) I like grapes Variation in Target language Redundancy of natural languages translate: Vid avslutad kurs... On completion of the course... After completion of the course... Having completed the course... After finishing the course... Once the course has been completed Which one is best? How do we decide that? Jörg Tiedemann 19/39 Jörg Tiedemann 20/39
6 Linguist s view on MT Engineer s view on MT Three steps for MT 1. source language analysis (understanding) 2. source to target language transfer (or interlingua) 3. target language generation (grammatical & fluent) appropriate tools? handling ambiguity? (transfer) formalism? interlingua? MT as decoding "I have a text in front of me which is written in Russian but I am going to pretend that it is really written in English and that it has been coded in some strange symbols. All I need to do is strip off the code in order to retrieve the information coded in the text" (Warren Weavers (1947/49), Rockefeller Foundation) (... and computers are good in encoding/decoding) Jörg Tiedemann 21/39 Jörg Tiedemann 22/39 Linguist s view on MT Engineer s view on MT What do we need for MT? What linguistic knowledge is needed to translate? 1. Phonological knowledge (speech-to-speech) 2. Morphological knowledge (inflection) 3. Syntactical knowledge (grammar) 4. Semantic knowledge (meaning) 5. Pragmatic knowledge (fluency, style, politeness,...) 1. more training data 2. statistical models 3. machine learning and efficient computing 4. more data! more data! more data! Fred Jelinek (IBM speech group) in the 80s: "Every time I fire a linguist the performance goes up" (... or something similar) Jörg Tiedemann 23/39 Jörg Tiedemann 24/39
7 Some words about the history of MT A brief history of MT (based on Hutchins) early days of computers: translation was thought to be one of the first tasks to be solved by computers (1930 s & 40s) MT = first task for language technology development of NLP/computational linguistics MT has a long history with many ups & downs 1. Precursors and pioneers, interest in cryptography, information theory, coding early computers, political/military interests 2. High expectations and disillusion, Georgetown experiment (rus-eng) big success three (classical) MT approaches (direct MT, interlingua-based, transfer-based) polarization: empiricists and theoretical (computational) linguists world-wide establishment of MT research groups problems: insufficient computer power, formal theories under development, no NLP tools/resources ALPAC report 1966: no need for MT (it s not going to work in near future) Jörg Tiedemann 25/39 Jörg Tiedemann 26/39 Misconceptions about MT A brief history of MT (based on Hutchins) "MT is a waste of time because you will never make a machine that can translate Shakespeare." "There was/is an MT system which translated The spirit is willing but the flesh is weak into the Russian equivalent of The vodka is good, but the steak is lousy. MT is useless". D.J. Arnold, Lorna Balkan, Siety Meijer, R.Lee Humphreys and Louisa Sadler Machine Translation: an Introductory Guide, 1994 The quiet decade, domain-specific translation (controlled language) mainly interlingua approaches but no real progress Operational and commercial systems, Systran, METAL computer-aided translation (especially in Japan) Revival of MT research, Data-driven MT, 1990 SMT, EBMT, synchronous grammars, hybrid models Jörg Tiedemann 27/39 Jörg Tiedemann 28/39
8 Where are we now? Two main paradigms Today: more automatic translations than manual ca 50 million translation requests every day on the Internet (van der Meer, 2008) Google: provides on-line translation for 63 languages approaches inspired by linguistic theory data-driven techniques (re-using existing translations) Let s look at the classical rule-based systems first Tasks: higher quality, other domains, other language pairs Jörg Tiedemann 29/39 Jörg Tiedemann 30/39 The Vauquois triangle (translation triangle) The Vauquois triangle: Classical MT approaches some intermediate symbolic representation (from Jurafsky & Martin, 2008: Speech and Language Processing) Jörg Tiedemann 31/39 Jörg Tiedemann 32/39
9 Direct Translation Transfer-based MT morphological analysis word-by-word using large bilingual dictionaries (may include phrases) local reordering using simple rules morphological generation Example (from Jurafsky & Martin) Input: Mary didn t slap the green witch After 1: Morphology Mary DO-PAST not slap the green witch After 2: Lexical Transfer Maria PAST no dar una bofetada a la verde bruja After 3: Local reordering Maria no dar PAST una bofetada a la bruja verde After 4: Morphology Maria no dió una bofetada a la bruja verde 3 steps: 1. source language analysis 2. transfer (source to target structures) 3. target language generation transfer rules: lexical transfer syntactic transfer semantic transfer Jörg Tiedemann 33/39 Jörg Tiedemann 34/39 Transfer-based MT Transfer rules to cover systematic structural differences Informal description of some transformations English to Spanish: NP Adjective1 Noun2 NP Noun2 Adjective1 Chinese to English: VP PP[+Goal] V VP V PP[+Goal] English to Japanese: VP V NP VP NP V NP NP1 RelClause2 NP RelClause2 NP1 Source language analysis depends on features necessary for transfer (purely syntactic, semantic features,...) Lexical transfer rules (phrasal) bilingual dictionaries Jörg Tiedemann 35/39 Interlingua-based MT Basic idea: use meaning representation semantic analysis of source language input (formal) representation of meaning using interlingua ( no language-pair specific transfer necessary!) target language generation from meaning representation Nice idea, but... Where are the tools? What is a good way of representing meaning? Many problems with ambiguity and language divergences Jörg Tiedemann 36/39
10 Interlingua-based MT Summary Example interlingual representation: Mary did not slap the green witch long, bumpy history rule-based systems vs. data-driven systems linguistic theory vs. examples & statistics large demand for MT world-wide MT quality depending on task MT = hot topic again Jörg Tiedemann 37/39 Jörg Tiedemann 38/39 Next lecture: more on rule-based systems domain-specific approaches evaluation of MT output (manual & automatic) first lab session: evaluation of MT output a little experiment with understandability Jörg Tiedemann 39/39
Maskinöversättning 2008. F2 Översättningssvårigheter + Översättningsstrategier
Maskinöversättning 2008 F2 Översättningssvårigheter + Översättningsstrategier Flertydighet i källspråket poäng point, points, credit, credits, var verb ->was, were pron -> each adv -> where adj -> every
More informationOverview of MT techniques. Malek Boualem (FT)
Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,
More informationThe history of machine translation in a nutshell
1. Before the computer The history of machine translation in a nutshell 2. The pioneers, 1947-1954 John Hutchins [revised January 2014] It is possible to trace ideas about mechanizing translation processes
More informationComprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
More informationThe history of machine translation in a nutshell
The history of machine translation in a nutshell John Hutchins [Web: http://ourworld.compuserve.com/homepages/wjhutchins] [Latest revision: November 2005] 1. Before the computer 2. The pioneers, 1947-1954
More informationStatistical Machine Translation Lecture 4. Beyond IBM Model 1 to Phrase-Based Models
p. Statistical Machine Translation Lecture 4 Beyond IBM Model 1 to Phrase-Based Models Stephen Clark based on slides by Philipp Koehn p. Model 2 p Introduces more realistic assumption for the alignment
More informationISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION
ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION FLORENCE REEDER The MITRE Corporation 1 / George Mason University freeder@mitre.org ABSTRACT Developing a system that accurately produces
More informationWHITE PAPER. Machine Translation of Language for Safety Information Sharing Systems
WHITE PAPER Machine Translation of Language for Safety Information Sharing Systems September 2004 Disclaimers; Non-Endorsement All data and information in this document are provided as is, without any
More informationLing 201 Syntax 1. Jirka Hana April 10, 2006
Overview of topics What is Syntax? Word Classes What to remember and understand: Ling 201 Syntax 1 Jirka Hana April 10, 2006 Syntax, difference between syntax and semantics, open/closed class words, all
More informationIntroduction. Philipp Koehn. 28 January 2016
Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)
More informationStatistical Machine Translation
Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language
More informationPROMT Technologies for Translation and Big Data
PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED
More informationIRIS - English-Irish Translation System
IRIS - English-Irish Translation System Mihael Arcan, Unit for Natural Language Processing of the Insight Centre for Data Analytics at the National University of Ireland, Galway Introduction about me,
More informationEmpirical Machine Translation and its Evaluation
Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Jesús Giménez (Advisor, Lluís Màrquez) Universitat Politècnica de Catalunya May 28, 2010 Empirical Machine Translation Empirical
More informationDRAFT MACHINE TRANSLATION. The process of translating comprises in its essence the whole secret of human understanding and social communication.
Speech and Language Processing: An introduction to natural language processing, computational linguistics, and speech recognition. Daniel Jurafsky & James H. Martin. Copyright c 2007, All rights reserved.
More informationExtracting translation relations for humanreadable dictionaries from bilingual text
Extracting translation relations for humanreadable dictionaries from bilingual text Overview 1. Company 2. Translate pro 12.1 and AutoLearn 3. Translation workflow 4. Extraction method 5. Extended
More informationWeb-based automatic translation: the Yandex.Translate API
Maarten van Hees m.van.hees@umail.leidenuniv.nl Web-based automatic translation: the Yandex.Translate API Paulina Kozłowska pk.kozlowska@gmail.com Nana Tian n.tian.2@umail.leidenuniv.nl ABSTRACT Yandex.Translate
More informationTRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION
TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION Dr. Siddhartha Ghosh 1, Sujata Thamke 2 and Kalyani U.R.S 3 1 Head of the Department of Computer Science & Engineering,
More informationMachine Translation Computer Aided Translation Machine Language Processing
Machine Translation Computer Aided Translation Machine Language Processing Martin Kappus (kapm@zhaw.ch) 1 Machine Translation Computer-Aided Translation Agenda Machine Translation Introduction History
More informationCS 6740 / INFO 6300. Ad-hoc IR. Graduate-level introduction to technologies for the computational treatment of information in humanlanguage
CS 6740 / INFO 6300 Advanced d Language Technologies Graduate-level introduction to technologies for the computational treatment of information in humanlanguage form, covering natural-language processing
More informationAutomation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken
Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken Introduction First attempts in "automating" the process of translation between natural
More informationHow the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
More informationSemantic analysis of text and speech
Semantic analysis of text and speech SGN-9206 Signal processing graduate seminar II, Fall 2007 Anssi Klapuri Institute of Signal Processing, Tampere University of Technology, Finland Outline What is semantic
More information4.1 Multilingual versus bilingual systems
This chapter is devoted to some fundamental questions on the basic strategies of MT systems. These concern decisions which designers of MT systems have to address before any construction can start, and
More informationHybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
More informationLearning Translation Rules from Bilingual English Filipino Corpus
Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,
More informationTranslating while Parsing
Gábor Prószéky Translating while Parsing Abstract Translations unconsciously rely on the hypothesis that sentences of a certain source language L S can be expressed by sentences of the target language
More informationREALIZATION SORTING ALGORITHM USING PARALLEL TECHNOLOGIES bachelor, Mikhelev Vladimir candidate of Science, prof., Sinyuk Vasily
2. В. Гергель: Современные языки и технологии параллельного программирования. М., Изд-во МГУ, 2012. 3. Синюк, В. Г. Алгоритмы и структуры данных. Белгород: Изд-во БГТУ им. В. Г. Шухова, 2013. REALIZATION
More informationAn Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System
An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System Thiruumeni P G, Anand Kumar M Computational Engineering & Networking, Amrita Vishwa Vidyapeetham, Coimbatore,
More informationSYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:
More informationSyntactic Theory. Background and Transformational Grammar. Dr. Dan Flickinger & PD Dr. Valia Kordoni
Syntactic Theory Background and Transformational Grammar Dr. Dan Flickinger & PD Dr. Valia Kordoni Department of Computational Linguistics Saarland University October 28, 2011 Early work on grammar There
More informationStatistical NLP Spring 2008. Machine Translation: Examples
Statistical NLP Spring 2008 Lecture 11: Word Alignment Dan Klein UC Berkeley Machine Translation: Examples 1 Machine Translation Madame la présidente, votre présidence de cette institution a été marquante.
More informationSyntax: Phrases. 1. The phrase
Syntax: Phrases Sentences can be divided into phrases. A phrase is a group of words forming a unit and united around a head, the most important part of the phrase. The head can be a noun NP, a verb VP,
More informationModule Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
More informationM LTO Multilingual On-Line Translation
O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in
More informationSakkamMT White Paper
SakkamMT White Paper Sakkam K.K. Wakamatsu Building 7F 3-3-6 Nihonbashi Honcho Chuo-ku Tokyo Introduction Ever since the emergence of machine translation there has been a debate about if and when machines
More informationAutomatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
More informationThe SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge
The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter
More informationStudy Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan
Study Plan Bachelor s in Spanish and English Faculty of Foreign Languages University of Jordan 2009/2010 Department of European Languages Faculty of Foreign Languages University of Jordan Degree: B.A.
More informationParaphrasing controlled English texts
Paraphrasing controlled English texts Kaarel Kaljurand Institute of Computational Linguistics, University of Zurich kaljurand@gmail.com Abstract. We discuss paraphrasing controlled English texts, by defining
More informationInterlingua-based Machine Translation Systems: UNL versus Other Interlinguas
Interlingua-based Machine Translation Systems: UNL versus Other Interlinguas Sameh AlAnsary Department of Phonetics and Linguistics, Faculty of Arts, Alexandria University ElShatby, Alexandria, Egypt.
More informationIntroduction to formal semantics -
Introduction to formal semantics - Introduction to formal semantics 1 / 25 structure Motivation - Philosophy paradox antinomy division in object und Meta language Semiotics syntax semantics Pragmatics
More informationPsychology G4470. Psychology and Neuropsychology of Language. Spring 2013.
Psychology G4470. Psychology and Neuropsychology of Language. Spring 2013. I. Course description, as it will appear in the bulletins. II. A full description of the content of the course III. Rationale
More informationAn Overview of Applied Linguistics
An Overview of Applied Linguistics Edited by: Norbert Schmitt Abeer Alharbi What is Linguistics? It is a scientific study of a language It s goal is To describe the varieties of languages and explain the
More informationTranslation Solution for
Translation Solution for Case Study Contents PROMT Translation Solution for PayPal Case Study 1 Contents 1 Summary 1 Background for Using MT at PayPal 1 PayPal s Initial Requirements for MT Vendor 2 Business
More informationAn English-to-Arabic Prototype Machine Translator for Statistical Sentences
Intelligent Information Management, 2012, 4, 13-22 http://dx.doi.org/10.4236/iim.2012.41003 Published Online January 2012 (http://www.scirp.org/journal/iim) An English-to-Arabic Prototype Machine Translator
More informationHybrid Machine Translation Guided by a Rule Based System
Hybrid Machine Translation Guided by a Rule Based System Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez Kepa Sarasola Universitat Politècnica de Catalunya University of the
More informationWhy Evaluation? Machine Translation. Evaluation. Evaluation Metrics. Ten Translations of a Chinese Sentence. How good is a given system?
Why Evaluation? How good is a given system? Machine Translation Evaluation Which one is the best system for our purpose? How much did we improve our system? How can we tune our system to become better?
More informationMachine Translation. Why Evaluation? Evaluation. Ten Translations of a Chinese Sentence. Evaluation Metrics. But MT evaluation is a di cult problem!
Why Evaluation? How good is a given system? Which one is the best system for our purpose? How much did we improve our system? How can we tune our system to become better? But MT evaluation is a di cult
More informationIntroduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014
Introduction! BM1 Advanced Natural Language Processing Alexander Koller! 17 October 2014 Outline What is computational linguistics? Topics of this course Organizational issues Siri Text prediction Facebook
More informationCustomizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
More informationMachine Translation and the Translator
Machine Translation and the Translator Philipp Koehn 8 April 2015 About me 1 Professor at Johns Hopkins University (US), University of Edinburgh (Scotland) Author of textbook on statistical machine translation
More informationWhat s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?
What s in a Lexicon What kind of Information should a Lexicon contain? The Lexicon Miriam Butt November 2002 Semantic: information about lexical meaning and relations (thematic roles, selectional restrictions,
More informationPrivacy Issues in Online Machine Translation Services European Perspective.
Privacy Issues in Online Machine Translation Services European Perspective. Pawel Kamocki, Jim O'Regan IDS Mannheim / Paris Descartes / WWU Münster Centre for Language and Communication Studies, Trinity
More informationOutline of today s lecture
Outline of today s lecture Generative grammar Simple context free grammars Probabilistic CFGs Formalism power requirements Parsing Modelling syntactic structure of phrases and sentences. Why is it useful?
More informationSYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems Jin Yang, Satoshi Enoue Jean Senellart, Tristan Croiset SYSTRAN Software, Inc. SYSTRAN SA 9333 Genesee Ave. Suite PL1 La Grande
More informationMachine Translation at the European Commission
Directorate-General for Translation Machine Translation at the European Commission Konferenz 10 Jahre Verbmobil Saarbrücken, 16. November 2010 Andreas Eisele Project Manager Machine Translation, ICT Unit
More informationCOURSE SYLLABUS ESU 561 ASPECTS OF THE ENGLISH LANGUAGE. Fall 2014
COURSE SYLLABUS ESU 561 ASPECTS OF THE ENGLISH LANGUAGE Fall 2014 EDU 561 (85515) Instructor: Bart Weyand Classroom: Online TEL: (207) 985-7140 E-Mail: weyand@maine.edu COURSE DESCRIPTION: This is a practical
More informationAccording to the Argentine writer Jorge Luis Borges, in the Celestial Emporium of Benevolent Knowledge, animals are divided
Categories Categories According to the Argentine writer Jorge Luis Borges, in the Celestial Emporium of Benevolent Knowledge, animals are divided into 1 2 Categories those that belong to the Emperor embalmed
More informationMACHINE TRANSLATION. I.1 Introduction to Statistical Machine Translation
Universitat Politècnica de València Master en Inteligencia Artificial, Renoconocimiento de Formas e Imagen Digital Master in Artificial Intelligence, Pattern Recognition and Digital Imaging 2015-2016 MACHINE
More informationEnglish Descriptive Grammar
English Descriptive Grammar 2015/2016 Code: 103410 ECTS Credits: 6 Degree Type Year Semester 2500245 English Studies FB 1 1 2501902 English and Catalan FB 1 1 2501907 English and Classics FB 1 1 2501910
More informationI m sorry, Dave, I m afraid I can t do that :
I m sorry, Dave, I m afraid I can t do that : Linguistics, Statistics, and Natural-Language Processing in the Big Data Era Lillian Lee Professor, Computer Science http://www.cs.cornell.edu/home/llee the
More informationModern foreign languages
Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007
More informationLanguage and Computation
Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University tamas.biro@yale.edu http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters
More informationConvergence of Translation Memory and Statistical Machine Translation
Convergence of Translation Memory and Statistical Machine Translation Philipp Koehn and Jean Senellart 4 November 2010 Progress in Translation Automation 1 Translation Memory (TM) translators store past
More informationA Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students
69 A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students Sarathorn Munpru, Srinakharinwirot University, Thailand Pornpol Wuttikrikunlaya, Srinakharinwirot University,
More informationNoam Chomsky: Aspects of the Theory of Syntax notes
Noam Chomsky: Aspects of the Theory of Syntax notes Julia Krysztofiak May 16, 2006 1 Methodological preliminaries 1.1 Generative grammars as theories of linguistic competence The study is concerned with
More informationTo download the script for the listening go to: http://www.teachingenglish.org.uk/sites/teacheng/files/learning-stylesaudioscript.
Learning styles Topic: Idioms Aims: - To apply listening skills to an audio extract of non-native speakers - To raise awareness of personal learning styles - To provide concrete learning aids to enable
More informationTowards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
More informationHow To Overcome Language Barriers In Europe
MACHINE TRANSLATION AS VALUE-ADDED SERVICE by Prof. Harald H. Zimmermann, Saarbrücken (FRG) draft: April 1990 (T6IBM1) I. THE LANGUAGE BARRIER There is a magic number common to the European Community (E.C.):
More informationA Machine Translation System Between a Pair of Closely Related Languages
A Machine Translation System Between a Pair of Closely Related Languages Kemal Altintas 1,3 1 Dept. of Computer Engineering Bilkent University Ankara, Turkey email:kemal@ics.uci.edu Abstract Machine translation
More informationPROMT-Adobe Case Study:
For Americas: 330 Townsend St., Suite 117, San Francisco, CA 94107 Tel: (415) 913-7586 Fax: (415) 913-7589 promtamericas@promt.com PROMT-Adobe Case Study: For other regions: 16A Dobrolubova av. ( Arena
More informationStudy Plan for Master of Arts in Applied Linguistics
Study Plan for Master of Arts in Applied Linguistics Master of Arts in Applied Linguistics is awarded by the Faculty of Graduate Studies at Jordan University of Science and Technology (JUST) upon the fulfillment
More informationSharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium
Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium Grégoire Détrez, Víctor M. Sánchez-Cartagena, Aarne Ranta gregoire.detrez@gu.se, vmsanchez@prompsit.com,
More informationLanguage Arts Literacy Areas of Focus: Grade 6
Language Arts Literacy : Grade 6 Mission: Learning to read, write, speak, listen, and view critically, strategically and creatively enables students to discover personal and shared meaning throughout their
More informationCurriculum for the Bachelor Degree in English Language and Literature
Curriculum for the Bachelor Degree in The curriculum for the B.S.c. degree in (12) credit hours are distributed as follows:- No. 1 2 4 Requirements University Requirements Compulsory Elective Faculty Requirements
More informationRegister Differences between Prefabs in Native and EFL English
Register Differences between Prefabs in Native and EFL English MARIA WIKTORSSON 1 Introduction In the later stages of EFL (English as a Foreign Language) learning, and foreign language learning in general,
More informationLSI TRANSLATION PLUG-IN FOR RELATIVITY. within
within LSI Translation Plug-in (LTP) for Relativity is a free plug-in that allows the Relativity user to access the STS system 201 Broadway, Cambridge, MA 02139 Contact: Mark Ettinger Tel: 800-654-5006
More informationAutomated Online English -Arabic Translator
Faculty of Engineering & Architecture Electrical & Computer Engineering Department Final Year Project - Spring 2006 Automated Online English -Arabic Translator Supervisor: Prof. Ali El Hajj Report Grader:
More informationThis page intentionally left blank
This page intentionally left blank Statistical Machine Translation The field of machine translation has recently been energized by the emergence of statistical techniques, which have brought the dream
More informationLANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market
LANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market Driving Forces Language Translation Software Cloud
More informationa Chinese-to-Spanish rule-based machine translation
Chinese-to-Spanish rule-based machine translation system Jordi Centelles 1 and Marta R. Costa-jussà 2 1 Centre de Tecnologies i Aplicacions del llenguatge i la Parla (TALP), Universitat Politècnica de
More informationwww.newsflashenglish.com
www.newsflashenglish.com The 4 page 60 minute ESL British English lesson 10/03/13 There are many fine universities in the world today to study at. Each has its own history and reputation. In this lesson
More informationAn Interactive Hypertextual Environment for MT Training
An Interactive Hypertextual Environment for MT Training Etienne Blanc GETA, CLIPS-IMAG BP 53, F-38041 Grenoble cedex 09 Etienne.Blanc@imag.fr Abstract An interactive hypertextual environment for MT training
More informationACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no.
ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. 248347 Deliverable D5.4 Report on requirements, implementation
More informationComparative Stylistics in an Integrated Machine Translation System
Comparative Stylistics in an Integrated Machine Translation System by Keith Mah A thesis presented to the University of Waterloo in fulfilment of the thesis requirement for the degree of Master of Mathematics
More informationThe XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006 Yidong Chen, Xiaodong Shi Institute of Artificial Intelligence Xiamen University P. R. China November 28, 2006 - Kyoto 13:46 1
More informationTHE BACHELOR S DEGREE IN SPANISH
Academic regulations for THE BACHELOR S DEGREE IN SPANISH THE FACULTY OF HUMANITIES THE UNIVERSITY OF AARHUS 2007 1 Framework conditions Heading Title Prepared by Effective date Prescribed points Text
More informationWhy major in linguistics (and what does a linguist do)?
Why major in linguistics (and what does a linguist do)? Written by Monica Macaulay and Kristen Syrett What is linguistics? If you are considering a linguistics major, you probably already know at least
More informationMorphology. Morphology is the study of word formation, of the structure of words. 1. some words can be divided into parts which still have meaning
Morphology Morphology is the study of word formation, of the structure of words. Some observations about words and their structure: 1. some words can be divided into parts which still have meaning 2. many
More informationA. Schedule: Reading, problem set #2, midterm. B. Problem set #1: Aim to have this for you by Thursday (but it could be Tuesday)
Lecture 5: Fallacies of Clarity Vagueness and Ambiguity Philosophy 130 September 23, 25 & 30, 2014 O Rourke I. Administrative A. Schedule: Reading, problem set #2, midterm B. Problem set #1: Aim to have
More informationDifferences in linguistic and discourse features of narrative writing performance. Dr. Bilal Genç 1 Dr. Kağan Büyükkarcı 2 Ali Göksu 3
Yıl/Year: 2012 Cilt/Volume: 1 Sayı/Issue:2 Sayfalar/Pages: 40-47 Differences in linguistic and discourse features of narrative writing performance Abstract Dr. Bilal Genç 1 Dr. Kağan Büyükkarcı 2 Ali Göksu
More informationCOURSE OBJECTIVES SPAN 100/101 ELEMENTARY SPANISH LISTENING. SPEAKING/FUNCTIONAl KNOWLEDGE
SPAN 100/101 ELEMENTARY SPANISH COURSE OBJECTIVES This Spanish course pays equal attention to developing all four language skills (listening, speaking, reading, and writing), with a special emphasis on
More informationNatural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationHistorical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics?
Historical Linguistics Diachronic Analysis What is Historical Linguistics? Historical linguistics is the study of how languages change over time and of their relationships with other languages. All languages
More informationComputer Aided Translation
Computer Aided Translation Philipp Koehn 30 April 2015 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user is tolerant of inferior quality focus of majority
More informationOPTIMIZING CONTENT FOR TRANSLATION ACROLINX AND VISTATEC
OPTIMIZING CONTENT FOR TRANSLATION ACROLINX AND VISTATEC We ll look at these questions. Why does translation cost so much? Why is it hard to keep content consistent? Why is it hard for an organization
More informationExtraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology
Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Makoto Nakamura, Yasuhiro Ogawa, Katsuhiko Toyama Japan Legal Information Institute, Graduate
More informationUNIVERSITY OF JORDAN ADMISSION AND REGISTRATION UNIT COURSE DESCRIPTION
Course Description B.A Degree Spanish and English Language and Literature 2203103 Spanish Language for Beginners (1) (3 credit hours) Prerequisite : none In combination with Spanish for Beginners (2),
More informationDeclarative Models for Hybrid Machine Translation Proposal for VR Projektbidrag, Appendix A
Declarative Models for Hybrid Machine Translation Proposal for VR Projektbidrag, Appendix A Professor Aarne Ranta (PI) Department of Computer Science and Engineering Chalmers University of Technology and
More information