Extracting translation relations for humanreadable dictionaries from bilingual text

Size: px
Start display at page:

Download "Extracting translation relations for humanreadable dictionaries from bilingual text"

Transcription

1 Extracting translation relations for humanreadable dictionaries from bilingual text

2 Overview 1. Company 2. Translate pro 12.1 and AutoLearn<word> 3. Translation workflow 4. Extraction method 5. Extended AutoLearn with selection restrictions 6. Improving accuracy, coverage and availability 7. On-the-fly extraction TeKom Lingenio

3 Company Funded in 1999 Spin-off of the IBM research center Germany located in Heidelberg develops and markets language technology software and services. Core compentence machine translation electronic dictionaries text analysis (morphology, syntax, semantics) Several research projects TeKom Lingenio

4 Translate pro 12.1 Single user versions for professional translators and private use including dictionaries with context-sensitive search functions TeKom Lingenio

5 Translate pro 12.1 Corporate solutions Client/Server networks for workgroups Lingenio Translation Server: web-based solutions for company-wide intranets TeKom Lingenio

6 Translate pro 12.1 Integration via Plug-Ins into Publishing Tools Wordpress, CAT-Tools Trados OmegaT, TeKom Lingenio

7 Translate pro 12.1 Translation Center MS Office Plug-Ins Browser Plug-Ins (IE, Firefox) Pdf translation TeKom Lingenio

8 Translation Center User dictionaries edition, settings Translation Memories selection, settings Automatic extraction of dictionary entries Postediting: Alternative translations Assistant: Unknown words, Statistics, settings, TeKom Lingenio

9 AutoLearn<word> extracts suggestions for dictionary entries from postedited MT Translation memories TeKom Lingenio

10 AutoLearn<word> creates suggestions from postedited text TeKom Lingenio

11 AutoLearn<word> creates suggestions from postedited text TeKom Lingenio

12 AutoLearn<word> creates suggestions from translation memory sentence pairs TeKom Lingenio

13 AutoLearn<word> suggestions extracted from translation memory sentence pairs TeKom Lingenio

14 AutoLearn<word> suggestions relate (potentially) to all parts of speech (nouns, verbs, adjectives, ) include multiword expressions can be selected for integration into active user dictionary TeKom Lingenio

15 AutoLearn<word> suggestions can be added to dictionary single relations or all TeKom Lingenio

16 Dictionary entries assigned to suggestions make use of morpho-syntactic & semantic information & defaults of the MT system can be edited TeKom Lingenio

17 AutoLearn entries adapt the translation to the references extracted TeKom Lingenio

18 AutoLearn<word> extracts suggestions for dictionary entries from postedited MT Translation memories from single sentence pairs complete TMs from bilingual text via Lingenio sentence aligner workflow TeKom Lingenio

19 AutoLearn<word> bilingual texts European insurance regulation TeKom Lingenio

20 AutoLearn<word> align & import into translation memory TeKom Lingenio

21 AutoLearn<word> extract translation suggestions from single sentence pairs TeKom Lingenio

22 AutoLearn<word> extract translation suggestions from single sentence pairs TeKom Lingenio

23 AutoLearn<word> or from complete translation memories TeKom Lingenio

24 AutoLearn<word> from complete translation memories TeKom Lingenio

25 AutoLearn<word> - Extraction method 1. Translation relations from system dictionaries 2. Structures assigned to source and target sentence by the analysis components of the MT system TeKom Lingenio

26 Example Die Lithofazien-Analyse des oberen Teils der Pliozän-Schicht im Valdelsa-Becken (Mittelitalien) hat eine gewisse Anzahl von Umweltablagerungen ergeben, von der Schwemm- zur Küsten- und zur Meeresebene. Lithofacies analysis of the upper part of the Pliocene succession of the Valdelsa basin (central Italy) unravelled a number of depositional environments, ranging from alluvial plain to coastal, to marine TeKom Lingenio

27 Example Die Lithofazien-Analyse der Pliozän-Schicht hat eine gewisse Anzahl von Umweltablagerungen ergeben. Lithofacies analysis of the Pliocene succession unravelled a number of depositional environments TeKom Lingenio

28 Dependence grammar structures TeKom Lingenio

29 Dependence grammar structures + transfer knowledge TeKom Lingenio

30 Dependence grammar structures + transfer knowledge (+ statistics) TeKom Lingenio

31 Dependence grammar structures + transfer knowledge (+ statistics) Derive new relations TeKom Lingenio

32 Dependence grammar structures + transfer knowledge (+ statistics) Derive new relations AutoLearn<word> TeKom Lingenio

33 Do more! Use analysis constraints! syntactic constraints semantic constraints morphological constraints 33

34 Extended AutoLearn with selection restrictions TeKom Lingenio

35 Extended AutoLearn with selection restrictions genitive object constraint direct object constraints 35

36 Extended AutoLearn with selection restrictions extract restrictions Lithofazien-Analyse ergibt Anzahl Umweltablagerungen ~ unravel weaken conditions Analyse ergibt Ablagerung Vorgang ergibt Ergebnis Select conditions by evaluating occurrences in corpora Analyse/Vorgang ergibt Rückstand/ Ergebnis ~ unravel, yield? 36

37 Extended AutoLearn soon: version 12.5 with selection restrictions 37

38 Extended AutoLearn soon: version 12.5 with selection restrictions supporting research: improve accuracy and coverage 38

39 Improving accuracy and coverage EU Marie Curie project (Hybrid high quality machine translation) BMWi project FlexNeuroTrans (Flexible MT for medium-sized businesses using neural nets) combination of rule-based and statistical methods extract information from the internet, TeKom Lingenio

40 AutoLearn<word> information Example: European insurance regulation search bilingual text (on the fly) that suits information requirement For example via Wikipedia, TeKom Lingenio

41 AutoLearn<word> information store & examine extracted texts TeKom Lingenio

42 Availability for translation service (Lingenio Translation Server LTS) for CAT-tools publishing tools (Wordpress,..) Intranet solutions AutoLearn<word> TeKom Lingenio

43 Improving availability for multilingual platforms TeKom Lingenio

44 On-the-fly extraction Text to be processed TeKom Lingenio

45 Summary: products & research 1. version 12.1 AutoLearn<word> (for several parts of speech & multiwords) 2. version 12.5 (soon: with selection restrictions) 3. learning, user dictionaries & memories available for Lingenio Translation Server (for CAT-tools, publishing and intranet) 4. supporting research for improving accuracy, coverage and onthe-fly extraction of translation information TeKom Lingenio

46 Thank you for your attention! Questions? (please visit us at stand 420)

Hybrid Strategies. for better products and shorter time-to-market

Hybrid Strategies. for better products and shorter time-to-market Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter

More information

Project Management. From industrial perspective. A. Helle M. Herranz. EXPERT Summer School, 2014. Pangeanic - BI-Europe

Project Management. From industrial perspective. A. Helle M. Herranz. EXPERT Summer School, 2014. Pangeanic - BI-Europe Project Management From industrial perspective A. Helle M. Herranz Pangeanic - BI-Europe EXPERT Summer School, 2014 Outline 1 Introduction 2 3 Translation project management without MT Translation project

More information

Computer Aided Translation

Computer Aided Translation Computer Aided Translation Philipp Koehn 30 April 2015 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user is tolerant of inferior quality focus of majority

More information

Machine Translation Computer Aided Translation Machine Language Processing

Machine Translation Computer Aided Translation Machine Language Processing Machine Translation Computer Aided Translation Machine Language Processing Martin Kappus (kapm@zhaw.ch) 1 Machine Translation Computer-Aided Translation Agenda Machine Translation Introduction History

More information

Comprendium Translator System Overview

Comprendium Translator System Overview Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4

More information

Integration of Content Optimization Software into the Machine Translation Workflow. Ben Gottesman Acrolinx

Integration of Content Optimization Software into the Machine Translation Workflow. Ben Gottesman Acrolinx Integration of Content Optimization Software into the Machine Translation Workflow Ben Gottesman Acrolinx What is Acrolinx? Acrolinx is Content Optimization Software. It helps authors make their text!

More information

WHITE PAPER. Machine Translation of Language for Safety Information Sharing Systems

WHITE PAPER. Machine Translation of Language for Safety Information Sharing Systems WHITE PAPER Machine Translation of Language for Safety Information Sharing Systems September 2004 Disclaimers; Non-Endorsement All data and information in this document are provided as is, without any

More information

Overview of MT techniques. Malek Boualem (FT)

Overview of MT techniques. Malek Boualem (FT) Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,

More information

M LTO Multilingual On-Line Translation

M LTO Multilingual On-Line Translation O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in

More information

JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM. Table of Contents

JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM. Table of Contents JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM Job Bank for Employers Creating a Job Offer Table of Contents Building the Automated Translation System Integration Steps Automated Translation System

More information

The history of machine translation in a nutshell

The history of machine translation in a nutshell 1. Before the computer The history of machine translation in a nutshell 2. The pioneers, 1947-1954 John Hutchins [revised January 2014] It is possible to trace ideas about mechanizing translation processes

More information

LANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market

LANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market LANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market Driving Forces Language Translation Software Cloud

More information

Machine Translation at the European Commission

Machine Translation at the European Commission Directorate-General for Translation Machine Translation at the European Commission Konferenz 10 Jahre Verbmobil Saarbrücken, 16. November 2010 Andreas Eisele Project Manager Machine Translation, ICT Unit

More information

SYSTRAN Enterprise Server 7 Online Tools User Guide. Chapter 1: Overview... 1 SYSTRAN Enterprise Server 7 Overview... 2

SYSTRAN Enterprise Server 7 Online Tools User Guide. Chapter 1: Overview... 1 SYSTRAN Enterprise Server 7 Overview... 2 g SYSTRAN Enterprise Server 7 Online Tools User Guide i Table of Contents Chapter 1: Overview... 1 SYSTRAN Enterprise Server 7 Overview... 2 SYSTRAN Enterprise Server 7 Components... 2 SYSTRAN Translation

More information

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and

More information

Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken

Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken Introduction First attempts in "automating" the process of translation between natural

More information

REALIZATION SORTING ALGORITHM USING PARALLEL TECHNOLOGIES bachelor, Mikhelev Vladimir candidate of Science, prof., Sinyuk Vasily

REALIZATION SORTING ALGORITHM USING PARALLEL TECHNOLOGIES bachelor, Mikhelev Vladimir candidate of Science, prof., Sinyuk Vasily 2. В. Гергель: Современные языки и технологии параллельного программирования. М., Изд-во МГУ, 2012. 3. Синюк, В. Г. Алгоритмы и структуры данных. Белгород: Изд-во БГТУ им. В. Г. Шухова, 2013. REALIZATION

More information

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no.

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. 248347 Deliverable D5.4 Report on requirements, implementation

More information

Translation Solution for

Translation Solution for Translation Solution for Case Study Contents PROMT Translation Solution for PayPal Case Study 1 Contents 1 Summary 1 Background for Using MT at PayPal 1 PayPal s Initial Requirements for MT Vendor 2 Business

More information

PROMT-Adobe Case Study:

PROMT-Adobe Case Study: For Americas: 330 Townsend St., Suite 117, San Francisco, CA 94107 Tel: (415) 913-7586 Fax: (415) 913-7589 promtamericas@promt.com PROMT-Adobe Case Study: For other regions: 16A Dobrolubova av. ( Arena

More information

Question template for interviews

Question template for interviews Question template for interviews This interview template creates a framework for the interviews. The template should not be considered too restrictive. If an interview reveals information not covered by

More information

Glossary of translation tool types

Glossary of translation tool types Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze

More information

FROM METAL TO T1: SYSTEMS AND COMPONENTS FOR MACHINE TRANSLATION APPLICATIONS

FROM METAL TO T1: SYSTEMS AND COMPONENTS FOR MACHINE TRANSLATION APPLICATIONS FROM METAL TO T1: SYSTEMS AND COMPONENTS FOR MACHINE TRANSLATION APPLICATIONS Ulrike Schwall and Gregor Thurmair Gesellschaft für Multilinguale Systeme mbh Balanstr. 57 D-81541 München Germany ulrike.schwall@gmsmuc.de

More information

Structure of the presentation

Structure of the presentation Terminology and Technical Documentation Translation technology and workflowmanagement Structure of the presentation Source text control Issues in translation and multilingual document management : Case

More information

The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized

The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized to each field. The Language Grid, a software that provides

More information

The Principle of Translation Management Systems

The Principle of Translation Management Systems The Principle of Translation Management Systems Computer-aided translations with the help of translation memory technology deliver numerous advantages. Nevertheless, many enterprises have not yet or only

More information

Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8

Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8 Transit NXT Evolution from Service Pack 0 to Service Pack 8 April 2009: Transit NXT Service Pack 0 (Version 4.0.0.671) Additional versions of DTP programs supported: InDesign CS3 and FrameMaker 9 Additional

More information

Translation and Localization Services

Translation and Localization Services Translation and Localization Services Company Overview InterSol, Inc., a California corporation founded in 1996, provides clients with international language solutions. InterSol delivers multilingual solutions

More information

An Interactive Hypertextual Environment for MT Training

An Interactive Hypertextual Environment for MT Training An Interactive Hypertextual Environment for MT Training Etienne Blanc GETA, CLIPS-IMAG BP 53, F-38041 Grenoble cedex 09 Etienne.Blanc@imag.fr Abstract An interactive hypertextual environment for MT training

More information

Machine Translation and the Translator

Machine Translation and the Translator Machine Translation and the Translator Philipp Koehn 8 April 2015 About me 1 Professor at Johns Hopkins University (US), University of Edinburgh (Scotland) Author of textbook on statistical machine translation

More information

Machine Translation as a translator's tool. Oleg Vigodsky Argonaut Ltd. (Translation Agency)

Machine Translation as a translator's tool. Oleg Vigodsky Argonaut Ltd. (Translation Agency) Machine Translation as a translator's tool Oleg Vigodsky Argonaut Ltd. (Translation Agency) About Argonaut Ltd. Documentation translation (Telecom + IT) from English into Russian, since 1992 Main customers:

More information

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features , pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of

More information

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告 SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:

More information

MOVING MACHINE TRANSLATION SYSTEM TO WEB

MOVING MACHINE TRANSLATION SYSTEM TO WEB MOVING MACHINE TRANSLATION SYSTEM TO WEB Abstract GURPREET SINGH JOSAN Dept of IT, RBIEBT, Mohali. Punjab ZIP140104,India josangurpreet@rediffmail.com The paper presents an overview of an online system

More information

Processing: current projects and research at the IXA Group

Processing: current projects and research at the IXA Group Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive

More information

SYSTRAN v6 Quick Start Guide

SYSTRAN v6 Quick Start Guide SYSTRAN v6 Quick Start Guide 2 SYSTRAN v6 Quick Start Guide SYSTRAN Business Translator SYSTRAN Premium Translator Congratulations on your SYSTRAN v6 purchase which uses the latest generation of language

More information

Multilingual Term Extraction as a Service from Acrolinx. Ben Gottesman Michael Klemme Acrolinx CHAT2013

Multilingual Term Extraction as a Service from Acrolinx. Ben Gottesman Michael Klemme Acrolinx CHAT2013 Multilingual Term Extraction as a Service from Acrolinx Ben Gottesman Michael Klemme Acrolinx CHAT2013 Definitions term extraction: automatically identifying potential terms in a document (corpus) multilingual

More information

SYSTRAN 6 Desktop User Guide. Chapter 1: Overview... 1 SYSTRAN Desktop Products Overview... 2

SYSTRAN 6 Desktop User Guide. Chapter 1: Overview... 1 SYSTRAN Desktop Products Overview... 2 SYSTRAN 6 Desktop User Guide i Table of Contents Chapter 1: Overview... 1 SYSTRAN Desktop Products Overview... 2 SYSTRAN 6 Product Comparison... 2 Special Terms Used in this Guide... 4 About Language Translation

More information

Learning Translation Rules from Bilingual English Filipino Corpus

Learning Translation Rules from Bilingual English Filipino Corpus Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,

More information

The history of machine translation in a nutshell

The history of machine translation in a nutshell The history of machine translation in a nutshell John Hutchins [Web: http://ourworld.compuserve.com/homepages/wjhutchins] [Latest revision: November 2005] 1. Before the computer 2. The pioneers, 1947-1954

More information

Fully Automatic High Quality Machine Translation of. Restricted Text: A Case Study

Fully Automatic High Quality Machine Translation of. Restricted Text: A Case Study [Translating and the Computer 28, November 2006 (London: Aslib, 2006)] Fully Automatic High Quality Machine Translation of Restricted Text: A Case Study Uwe Muegge (uwe.muegge@medtronic.com, uwe.muegge@miis.edu)

More information

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990

More information

Statistical Machine Translation

Statistical Machine Translation Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language

More information

Localization Profile 2014

Localization Profile 2014 Localization Profile 2014 January 2014 Localization Profile 1 www.esrinea.com Copyright Notice 2014 Esri Northeast Africa. All rights reserved. No part of this document may be reproduced or transmitted

More information

Multilingual and mixed-lingual TTS applications

Multilingual and mixed-lingual TTS applications Multilingual and mixed-lingual TTS applications LangTech 2003 November 24, 2003 Simona Fina, Manager Linguistics Real-life texts need mixed-lingual analysis Agenda Short presentation of SVOX Challenges

More information

4. Clause combining 2

4. Clause combining 2 Informática Aplicada a la Traducción Building and Using Translation Memories 4.1 What is a Parall Corpus A Parall Corpus consists of a set of sentences (or other segments of text) in one language, each

More information

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

TAUS 2015. Membership Program (Executive Overview) write to memberservices@taus.net to request the 35 pages detailed service overview. www.taus.

TAUS 2015. Membership Program (Executive Overview) write to memberservices@taus.net to request the 35 pages detailed service overview. www.taus. TAUS 2015 Membership Program (Executive Overview) write to memberservices@taus.net to request the 35 pages detailed service overview www.taus.net Five Reasons to be a TAUS Member 1. Access the collaborative

More information

Transit NXT Product Guide Service Pack 7 09/2013

Transit NXT Product Guide Service Pack 7 09/2013 Transit NXT Product Guide Service Pack 7 09/2013 Initial overview In this document, you will find out which product variant and which installation type suits your requirements. You are at the right place

More information

Integra(on of human and machine transla(on. Marcello Federico Fondazione Bruno Kessler MT Marathon, Prague, Sept 2013

Integra(on of human and machine transla(on. Marcello Federico Fondazione Bruno Kessler MT Marathon, Prague, Sept 2013 Integra(on of human and machine transla(on Marcello Federico Fondazione Bruno Kessler MT Marathon, Prague, Sept 2013 Motivation Human translation (HT) worldwide demand for translation services has accelerated,

More information

TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION

TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION Dr. Siddhartha Ghosh 1, Sujata Thamke 2 and Kalyani U.R.S 3 1 Head of the Department of Computer Science & Engineering,

More information

Resource and Workflow Management Support in Teletranslation

Resource and Workflow Management Support in Teletranslation [Translating and the Computer 21. Proceedings 10-11 November 1999 (London: Aslib)] Resource and Workflow Management Support in Teletranslation Klaus Schubert Fachhochschule Flensburg Abstract Value-added

More information

A web-based multilingual help desk

A web-based multilingual help desk LTC-Communicator: A web-based multilingual help desk Nigel Goffe The Language Technology Centre Ltd Kingston upon Thames Abstract Software vendors operating in international markets face two problems:

More information

Customizing an English-Korean Machine Translation System for Patent Translation *

Customizing an English-Korean Machine Translation System for Patent Translation * Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,

More information

Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes

Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes Donghui Lin, Yoshiaki Murakami, Toru Ishida, Yohei Murakami, Masahiro Tanaka National Institute of Information

More information

Introduction. Philipp Koehn. 28 January 2016

Introduction. Philipp Koehn. 28 January 2016 Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)

More information

Word Completion and Prediction in Hebrew

Word Completion and Prediction in Hebrew Experiments with Language Models for בס"ד Word Completion and Prediction in Hebrew 1 Yaakov HaCohen-Kerner, Asaf Applebaum, Jacob Bitterman Department of Computer Science Jerusalem College of Technology

More information

Statistical Machine Translation

Statistical Machine Translation Statistical Machine Translation What works and what does not Andreas Maletti Universität Stuttgart maletti@ims.uni-stuttgart.de Stuttgart May 14, 2013 Statistical Machine Translation A. Maletti 1 Main

More information

KantanMT.com. www.kantanmt.com. The world s #1 MT Platform. No Hardware. No Software. No Hassle MT.

KantanMT.com. www.kantanmt.com. The world s #1 MT Platform. No Hardware. No Software. No Hassle MT. KantanMT.com No Hardware. No Software. No Hassle MT. The world s #1 MT Platform Communicate globally, easily! Create customized language solutions in the cloud. www.kantanmt.com What is KantanMT.com? KantanMT

More information

FUNDAMENTAL TECHNOLOGIES FOR IP TRANSLATION SERVICES

FUNDAMENTAL TECHNOLOGIES FOR IP TRANSLATION SERVICES FUNDAMENTAL TECHNOLOGIES FOR IP TRANSLATION SERVICES 2 The strength of intellectual property documents, including international patent applications, relies in part on the quality of the translation. While

More information

Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya

Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya Luis Villarejo*, Sergio Ortiz** and Mireia Ginestí** *Learning Technologies

More information

EURESCOM Project BabelWeb Multilingual Web Sites: Best Practice Guidelines and Architectures

EURESCOM Project BabelWeb Multilingual Web Sites: Best Practice Guidelines and Architectures EURESCOM Project BabelWeb Multilingual Web Sites: Best Practice Guidelines and Architectures Elisabeth den Os, Lou Boves While the importance of the World Wide Web as a source of information and a vehicle

More information

Neural Machine Transla/on for Spoken Language Domains. Thang Luong IWSLT 2015 (Joint work with Chris Manning)

Neural Machine Transla/on for Spoken Language Domains. Thang Luong IWSLT 2015 (Joint work with Chris Manning) Neural Machine Transla/on for Spoken Language Domains Thang Luong IWSLT 2015 (Joint work with Chris Manning) Neural Machine Transla/on (NMT) End- to- end neural approach to MT: Simple and coherent. Achieved

More information

State of affairs today ALL THESE CAN BE TRUE!!!! We tried MT but it was not good. Because of MT, our revenues increased by 17%

State of affairs today ALL THESE CAN BE TRUE!!!! We tried MT but it was not good. Because of MT, our revenues increased by 17% Identifying the best opportunities to use Machine Translation to address your Big Language Needs Daniel Marcu Chief Science Officer Agenda The machine translation landscape Content to consider for machine

More information

ADVANTAGES AND DISADVANTAGES OF TRANSLATION MEMORY: A COST/BENEFIT ANALYSIS by Lynn E. Webb BA, San Francisco State University, 1992 Submitted in

ADVANTAGES AND DISADVANTAGES OF TRANSLATION MEMORY: A COST/BENEFIT ANALYSIS by Lynn E. Webb BA, San Francisco State University, 1992 Submitted in : A COST/BENEFIT ANALYSIS by Lynn E. Webb BA, San Francisco State University, 1992 Submitted in partial satisfaction of the requirements for the Degree of MASTER OF ARTS in Translation of German Graduate

More information

Translation Management System

Translation Management System Translation Management System LINGOTEK PRODUCT BRIEF Table of Contents Current Lingotek Customers...2 The Challenge of Translation and Localization Today...3 Lingotek s Complete Offering...4 The Lingotek

More information

Natural Language Database Interface for the Community Based Monitoring System *

Natural Language Database Interface for the Community Based Monitoring System * Natural Language Database Interface for the Community Based Monitoring System * Krissanne Kaye Garcia, Ma. Angelica Lumain, Jose Antonio Wong, Jhovee Gerard Yap, Charibeth Cheng De La Salle University

More information

TechWatch. Technology and Market Observation powered by SMILA

TechWatch. Technology and Market Observation powered by SMILA TechWatch Technology and Market Observation powered by SMILA PD Dr. Günter Neumann DFKI, Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Juni 2011 Goal - Observation of Innovations and Trends»

More information

Machine vs. Human Translation Scott Bass, Advanced Language Translation Inc.

Machine vs. Human Translation Scott Bass, Advanced Language Translation Inc. Machine vs. Human Translation Scott Bass, Advanced Language Translation Inc. Using computers to translate text from one language to another (referred to as machine translation [MT]) no longer faces the

More information

Getting Off to a Good Start: Best Practices for Terminology

Getting Off to a Good Start: Best Practices for Terminology Getting Off to a Good Start: Best Practices for Terminology Technologies for term bases, term extraction and term checks Angelika Zerfass, zerfass@zaac.de Tools in the Terminology Life Cycle Extraction

More information

Living, Working, Breathing the toolset How Alpha CRC has incorporated memoq in its production process

Living, Working, Breathing the toolset How Alpha CRC has incorporated memoq in its production process Living, Working, Breathing the toolset How Alpha CRC has incorporated memoq in its production process GT 12/02/2014 Topics covered What are the global L10n challenges to solve? What are the clientside

More information

XTM for Language Service Providers Explained

XTM for Language Service Providers Explained XTM for Language Service Providers Explained 1. Introduction There is a new generation of Computer Assisted Translation (CAT) tools available based on the latest Web 2.0 technology. These systems are more

More information

Cross-Language Information Retrieval by Domain Restriction using Web Directory Structure

Cross-Language Information Retrieval by Domain Restriction using Web Directory Structure Cross-Language Information Retrieval by Domain Restriction using Web Directory Structure Fuminori Kimura Faculty of Culture and Information Science, Doshisha University 1 3 Miyakodani Tatara, Kyoutanabe-shi,

More information

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic by Sigrún Helgadóttir Abstract This paper gives the results of an experiment concerned with training three different taggers on tagged

More information

Online free translation services

Online free translation services [Translating and the Computer 24: proceedings of the International Conference 21-22 November 2002, London (Aslib, 2002)] Online free translation services Thei Zervaki tzervaki@hotmail.com Introduction

More information

Web-based automatic translation: the Yandex.Translate API

Web-based automatic translation: the Yandex.Translate API Maarten van Hees m.van.hees@umail.leidenuniv.nl Web-based automatic translation: the Yandex.Translate API Paulina Kozłowska pk.kozlowska@gmail.com Nana Tian n.tian.2@umail.leidenuniv.nl ABSTRACT Yandex.Translate

More information

GATE Mímir and cloud services. Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation

GATE Mímir and cloud services. Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation GATE Mímir and cloud services Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation GATE Mímir GATE Mímir is an indexing system for GATE documents. Mímir can index: Text: the original

More information

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines , 22-24 October, 2014, San Francisco, USA Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines Baosheng Yin, Wei Wang, Ruixue Lu, Yang Yang Abstract With the increasing

More information

Software Cost. Discounted STS Rate Units Total $0.00 $0.00 $0.00 $0.00 Total $0.00

Software Cost. Discounted STS Rate Units Total $0.00 $0.00 $0.00 $0.00 Total $0.00 Cost Form This cost form has been provided to assist respondents in submitting costs associated by deliverable. Remember that all costs are to be the firm, fixed price of the deliverable and project total.

More information

Intel s Localization BUS Initiative To XLIFF or not to XLIFF. Loïc Dufresne de Virel Localization Strategist

Intel s Localization BUS Initiative To XLIFF or not to XLIFF. Loïc Dufresne de Virel Localization Strategist Intel s Localization BUS Initiative To XLIFF or not to XLIFF Loïc Dufresne de Virel Localization Strategist Intel Information Technology DISCLAIMERS I m presenting here what we are doing Still a work in

More information

How To Write A Multilingual Web Conference

How To Write A Multilingual Web Conference CMS and localisation Multilingual Web Conference Multilingual Web content management Limerick. September 21st, 2011 Page 1 ID Cocomore essentials Agency for integrated communication and IT services 100

More information

Convergence of Translation Memory and Statistical Machine Translation

Convergence of Translation Memory and Statistical Machine Translation Convergence of Translation Memory and Statistical Machine Translation Philipp Koehn and Jean Senellart 4 November 2010 Progress in Translation Automation 1 Translation Memory (TM) translators store past

More information

STATE OF VERMONT. Secretary of Administration ORIGINAL POLICY ADOPTED BY STC DATE: STATUTORY REFERENCE Policy for Web Look and Feel Requirements

STATE OF VERMONT. Secretary of Administration ORIGINAL POLICY ADOPTED BY STC DATE: STATUTORY REFERENCE Policy for Web Look and Feel Requirements STATE OF VERMONT Agency of Administration STANDARD STC State Technology Collaborative ORIGINAL POLICY ADOPTED BY STC DATE: EFFECTIVE DATE ORIGINAL POLICY NUMBER ASSOCIATED DOCUMENTS Standards for Usability

More information

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words , pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan

More information

Reducing Total Cost and Risk in Multilingual Litigation A VIA Legal Brief

Reducing Total Cost and Risk in Multilingual Litigation A VIA Legal Brief Reducing Total Cost and Risk in Multilingual Litigation A VIA Legal Brief Most ediscovery exercises are based on the assumption that the documents of interest are in English. However, in today s globalized

More information

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX COLING 82, J. Horeck~ (ed.j North-Holland Publishing Compa~y Academia, 1982 COMPUTATIONAL DATA ANALYSIS FOR SYNTAX Ludmila UhliFova - Zva Nebeska - Jan Kralik Czech Language Institute Czechoslovak Academy

More information

BILINGUAL TRANSLATION SYSTEM

BILINGUAL TRANSLATION SYSTEM BILINGUAL TRANSLATION SYSTEM (FOR ENGLISH AND TAMIL) Dr. S. Saraswathi Associate Professor M. Anusiya P. Kanivadhana S. Sathiya Abstract--- The project aims in developing Bilingual Translation System for

More information

Recent developments in machine translation policy at the European Patent Office

Recent developments in machine translation policy at the European Patent Office Recent developments in machine translation policy at the European Patent Office Dr Georg Artelsmair Director European Co-operation European Patent Office Brussels, 17 November 2010 The European Patent

More information

Automatische Übersetzung zwischen Hype und Realität. Automatic Translation between Hype and Reality

Automatische Übersetzung zwischen Hype und Realität. Automatic Translation between Hype and Reality Automatische Übersetzung zwischen Hype und Realität Automatic Translation between Hype and Reality Hans Uszkoreit Outline! Applications of MT The MT Hype MT Yesterday: Machine Translation in Verbmobil

More information

Using the BNC to create and develop educational materials and a website for learners of English

Using the BNC to create and develop educational materials and a website for learners of English Using the BNC to create and develop educational materials and a website for learners of English Danny Minn a, Hiroshi Sano b, Marie Ino b and Takahiro Nakamura c a Kitakyushu University b Tokyo University

More information

How To Translate English To Yoruba Language To Yoranuva

How To Translate English To Yoruba Language To Yoranuva International Journal of Language and Linguistics 2015; 3(3): 154-159 Published online May 11, 2015 (http://www.sciencepublishinggroup.com/j/ijll) doi: 10.11648/j.ijll.20150303.17 ISSN: 2330-0205 (Print);

More information

Introduction to formal semantics -

Introduction to formal semantics - Introduction to formal semantics - Introduction to formal semantics 1 / 25 structure Motivation - Philosophy paradox antinomy division in object und Meta language Semiotics syntax semantics Pragmatics

More information

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,

More information

HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN

HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN HIERARCHICAL HYBRID TRANSLATION BETWEEN ENGLISH AND GERMAN Yu Chen, Andreas Eisele DFKI GmbH, Saarbrücken, Germany May 28, 2010 OUTLINE INTRODUCTION ARCHITECTURE EXPERIMENTS CONCLUSION SMT VS. RBMT [K.

More information

Automated Online English -Arabic Translator

Automated Online English -Arabic Translator Faculty of Engineering & Architecture Electrical & Computer Engineering Department Final Year Project - Spring 2006 Automated Online English -Arabic Translator Supervisor: Prof. Ali El Hajj Report Grader:

More information

a Chinese-to-Spanish rule-based machine translation

a Chinese-to-Spanish rule-based machine translation Chinese-to-Spanish rule-based machine translation system Jordi Centelles 1 and Marta R. Costa-jussà 2 1 Centre de Tecnologies i Aplicacions del llenguatge i la Parla (TALP), Universitat Politècnica de

More information

LetsMT!: A Cloud-Based Platform for Do-It-Yourself Machine Translation

LetsMT!: A Cloud-Based Platform for Do-It-Yourself Machine Translation LetsMT!: A Cloud-Based Platform for Do-It-Yourself Machine Translation Andrejs Vasiļjevs Raivis Skadiņš Jörg Tiedemann TILDE TILDE Uppsala University Vienbas gatve 75a, Riga Vienbas gatve 75a, Riga Box

More information