ONLINE TRANSLATION SERVICES FOR THE LAO LANGUAGE

Size: px
Start display at page:

Download "ONLINE TRANSLATION SERVICES FOR THE LAO LANGUAGE"

Transcription

1 ONLINE TRANSLATION SERVICES FOR THE LAO LANGUAGE Vincent BERMENT GETA-CLIPS (IMAG) BP Grenoble Cedex 9, France INALCO 2, rue de Lille Paris Cedex 7, France ABSTRACT The students learning Laotian or French language, the professional or occasional translators, all have the same few means for helping them in their translations task. Those means are the often frozen, old and tiresome to use dictionaries, like Marc Reinhorn s Lao-French dictionary or Pierre Somchine Nginn s French- Lao dictionary. This paper presents LaoTrans, a set of online translation support services between Lao and French (both directions), that has all the flexibility and evolutionarity that the new Information and Communication Technologies (ICT) can bring. RÉSUMÉ Les étudiants laotiens apprenant le français ou français apprenant le laotien, les traducteurs professionnels ou occasionnels disposent de peu de moyens pour les aider dans leur tâche. Il s agit des dictionnaires souvent anciens, figés et dont l utilisation est fastidieuse comme le dictionnaire laotien-français de Marc Reinhorn ou comme celui, français-laotien, de Pierre Somchine Nginn. Le présent article présente LaoTrans, un ensemble de services d aide à la traduction en ligne entre français et lao, qui dispose de toute la souplesse et de l évolutivité que permettent aujourd hui les nouvelles Technologies de l Information et des Communications (TIC). KEYWORDS Machine translation, translation support tool, pi-language, π-language, Lao language, language computerization, unsegmented writing system, syllabic segmentation, word segmentation, lexical database, electronic dictionary, LaoTrans, LaoWord, LaoUniKey, Lao Software INTRODUCTION Thanks to a continuous effort made over the last 30 years, the Lao system of writing has now achieved a satisfactory level of computerization: mature fonts, text input tools and word processors are now available and the Lao Unicode is used over the Internet. So a lot of people is now waiting for what may be looked as a natural next step of this computerization effort: a high quality machine translation service that would allow the Lao people to read English, Chinese, Japanese, French or whatsoever language, translated in their own one. In contrast with the pioneering age where isolated individuals could offer complete solutions, the much more complex machine translation step must rely on groups of linguists and computer scientists to get through. For example, the order of magnitude for producing a good quality Lao-English / English-Lao machine translation system is estimated to 50 men-years (Lafourcade, 1994), only for the lingware, when an add-in such as LaoWord which provides a full set of word processing functions only requires about 6 men-years. So even if the methods used are well adapted to the computerization of under-resourced languages or, as they are called now, π-languages (Berment, 2004), it may take time before a good translation software is available for the Lao language. 1 / 11

2 On the way of this highly desirable future, an interesting translation support can be brought to professional and occasional translators by simpler tools mostly based on the technology developed for word processors. In this paper, we present LaoTrans, a set of online translation services that offers, for the first time, a translation support on the (dictionary, word for word translation of texts). Doing so, we will see how the reuse of a technology previously developed dramatically eased the realization of this new software. Then, looking towards the future, we will also show how the LaoTrans experience can contribute to the design of a full machine translation service. We will also introduce LaoUniKey, a text input tool, and LaoLex, an online collaborative service for building dictionaries. I FUNCTIONAL DESCRIPTION OF LAOTRANS LaoTrans is a set of two online services which are available, among other software dedicated to the Lao language, at the following address: I.1 TRANSLATION OF WORDS: LAOTRANS-WORD The first of the two services is called LaoTrans-Word. It is a look-up dictionary that provides both Lao-French and French-Lao translations for isolated words. Figure 1: LaoTrans-Word translates ແປ into French (example) Using this service from Lao to French, the tool displays all the possible French translations as shown in the figure above for the word ແປ. When using the dictionary in the other direction, from French to Lao, the engine looks for all the translations that include the French word, for example the following translations will be given for chaussure (see next page). 2 / 11

3 Figure 2 : LaoTrans-Word translates chaussure into Lao (example) This is very convenient to find all the occurrences of a word including, for example, plural or conjugated forms. Actually, this was done so because the original lexicon used in LaoTrans is a Lao- French dictionary in which an entry like ກ ງ is translated by the periphrasis l'autre partie d'une paire and another like ເກ ບ is translated by the plural forms souliers and chaussures. So it appeared natural to search the French word not as an exact translation but only as any sub-chain. In both directions, Lao-French and French-Lao, the user can investigate further his translated words as they are displayed with hyperlinks. For example, when the user will click on the first translation given for chaussure (ເກ ບ), LaoTrans-Word will return the three following results. Figure 3 : LaoTrans-Word translates ເກ ບ into plural forms (example) 3 / 11

4 I.2 TRANSLATION OF TEXTS: LAOTRANS-TEXT The second service offered by LaoTrans is more complex as it aims at easing the reading and the translation of a full piece of Lao text ( active reading support tool). With this service, the user receives a word for word translation in response to his Lao text as shown in the figure below. As you can see in this figure, a word which has more than one translation in the dictionary will be displayed with the list of all its translations separated with / characters. Figure 4 : LaoTrans-Text translates a Lao text into a list of its words translated into French Obviously, this is not yet a complete machine translation system with production of sentences. But it is also obvious that the human user will easily and quickly produce his own translation thanks to the translations he got for the words. 4 / 11

5 I.3 TEXTS INPUT IN UNICODE: LAOUNIKEY Unicode is an initiative for standardizing the representation in computers of all the writing systems in the world. Though the Lao writing system is part of the Unicode standard since June 1992 (Unicode version 1.01, see Unicode fonts that work properly for Lao only appeared a few years ago 1. So, after a long period during which Lao Unicode remained almost unused, we now observe that more and more sites adopt Unicode so we are confident that it will become THE standard in a close future and that Lao texts will at last be readable on any computer in the world. In LaoTrans, all the Lao texts are encoded in Unicode. As Lao text input is still not natively present in Microsoft Windows (people in Laos mainly use Windows 98, Me and XP), the users need to rely on additional software to type their texts. LaoUniKey (ລາວຢ ນ ກ ) is such a software. It is available from the same site as LaoTrans: This software is very flexible (formats, hotkeys...) and its human-machine interface is trilingual: Lao, French and English. Figure 5 : LaoUniKey configuration window (Lao language selected) Alternatively, the text can also be copied from an existing document, and in particular from sites using the Unicode encoding. 1 In May 2001, we developed one of the first Unicode font which was properly working for Lao. It was presented during the Papillon seminar in Tokyo in May 2002 where it was used to show the Lao language in PapiLex, a mockup of collaborative online Unicode dictionary. See (Berment 2002) and (Berment 2004). The most popular fonts at the moment are done by John Durdin (e.g. Saysettha OT). 5 / 11

6 I.4 DICTIONARY CONTENT: THE REUSE OF LINGWARE In order to offer a translation service of good quality, the software itself has to be good but it s not enough. A good dictionary is also a key issue as it has to contain as many word-senses and as little mistakes as possible. Fortunately, the dictionaries made by teachers or by linguists with means such as word processors can often be recycled to become lexical databases. This process can be helped by tools such as Recupdic in which the textual structure of the dictionary is formally described in order to automate its transformation into a lexical base (Doan-Nguyen, 1998). In the case of LaoTrans, we could recycle a Lao-French lexicon of about 15,000 word-senses that was created by Paul Jadin 2 who originally typed it in Excel. Its structure had two great advantages: The entries and their translations were already separated in two different columns, The entries corresponding to two different word-senses were in two different rows 3. This 15,000 entries dictionary happens to provide a good coverage for the test texts. The process to transform Paul Jadin s Excel file into a lexical database followed the three steps: Save the Excel file into a.csv file, Import this.csv file into a MySQL table by using phpmyadmin 4, Transforming the proprietary encoding of the Lao entries into Unicode 5. If we now look towards the future, we may wonder what is likely to be reusable in this translation support stuff. Actually, we think that the dictionary is the most essential element to reuse. The current step will have driven to a correct and large dictionary over a period of several months. This will be even more evident with the interactive use of LaoTrans and LaoLex as demonstrated in chapter III. The improvement, by adding lexical content (examples...) to the entries, participates to the lingware s development. So this enriched and improved dictionary can be later used for itself to do the lexical transfer of a machine translation system. Moreover, it can also be used together with LaoTrans and a corpus to provide a help for analyzing and understanding the morphosyntactic structure of the Lao language. 2 Paul Jadin is an independent lexicographer who lives in Belgium. We got in contact thanks to the Internet. This last remark highlights the essential fact that Internet changed the way people are working together, and especially people involved in π-language computerization because they are very few and scattered around the world. The collaboration between Paul Jadin and the author is a typical example as we actually never physically met until now. 3 For example, the word ແປ with the meanings Traduire and Plat were in two different rows. 4 After selecting the table in the database, the Structure page is displayed. This page proposes to Insert data from a text file into table at its bottom. This can be used in particular for inserting a.csv file. For more details, see and 5 This last action is handled by a C++ program deriving from LaoWord which can transform a text from any given encoding into any other, and in particular Unicode. This C++ program is called by a PHP code. 6 / 11

7 II ARCHITECTURE AND ALGORITHMS: THE REUSE OF SOFTWARE II.1 LAOTRANS-WORD SOFTWARE ARCHITECTURE AND ALGORITHMS The LaoTrans-Word software is running on a Linux server. The languages used are PHP and C++, and the dictionary is stored in a MySQL database. The architecture is shown in the figure below. Client Form Response LaoTrans Code (PHP) Request Lexical Data Lexical Base (MySQL) Characters Chain to Process Response LaoProc Code (C++) Server Figure 6 : LaoTrans-Word: Architecture The browser ( client) sends the word to translate to the server in a form (see Figure 1). The principle of the operation in the server is summarized in the following drawing. Client Form Server Response Canonic formatting (LaoProc) Word retrieval (LaoTrans/Lexical base) Response construction (LaoTrans) Figure 7 : LaoTrans-Word: Processing sequence First, the server transforms the received word into a canonical format (for example ໜ ຫນ 6 ) which is the format of the Lao headwords in the database. Then, it simply searches the standardized word in the database and returns the response to the client. 6 For more details, see (Berment, 2004). 7 / 11

8 II.2 LAOTRANS-TEXT SOFTWARE ARCHITECTURE AND ALGORITHMS From a main bricks point of view, the structure of LaoTrans-Text is the same as the one of LaoTrans-Word. The differences are that LaoProc also has to segment the text into syllables and to compute a phonological transcription for each of them. Then the PHP code of LaoTrans associates the syllables together to form words by using a longest matching algorithm (see (Rarunrom, 1991)). Client Form Server Response Canonic formatting (LaoProc) Syllabic segmentation (LaoProc) Phonological transcription (LaoProc) Word retrieval (LaoTrans/Lexical base) Response construction (LaoTrans) Figure 8 : LaoTrans-Text: Processing sequence II.3 GAIN REACHED BY REUSING AN EXISTING SOURCE CODE Though the three linguistic algorithms (canonic formatting, syllabic segmentation and phonological transcription) are not obvious, their part in the development of LaoProc did not even take 15 hours, thanks to the reuse of the source code of LaoWord (Berment, 2003a). Total Three linguistic modules LaoWord > 4,000 h 2,500 h LaoProc 300 h < 15 h Figure 9 : Compared development efforts 7 The gain 8 reached for the three linguistic modules is greater than 99 % (the rest of the two software programs is not comparable so talking of gain would not be relevant). The global gain for the LaoProc program is then (2, ) / 2, %. The same order of magnitude was already observed for LaoUniKey which also derives from LaoWord ( 93 % for the linguistic modules) as well as for other software reuse experiences (Berment, 2004). We may observe that this software reuse was possible because the programming language (C++) can be compiled on both Windows and Linux platforms (LaoWord is running on Windows and LaoProc on a Linux server). 7 Development time, not including feasibility studies, mockups, design & test documentation, user s manuals, distribution, support and maintenance. 8 This gain is defined by the formula : (T S T A ) / T S, with T S = development time without reuse and T A = development time with reuse. 8 / 11

9 III DICTIONARY ENRICHMENT: LAOLEX As any other dictionary, the dictionary used in LaoTrans may contain mistakes and entries may also be missing. The LaoLex software is designed to add and modify lexical entries in online dictionaries. Though this existing software was first presented in 2003, it has not been upgraded yet to handle Unicode. However, as it will soon be available as a natural complement to LaoTrans, we will say a few words here about its capabilities. For more details, see (Berment and Thongvilu 2003), (Berment et al. 2003) and (Berment 2003b). Figure 10 : Creation of an entry in LaoLex An efficient manner to improve the existing dictionary is to couple LaoTrans together with LaoLex dynamically. This means that every time a recorded user asks for a translation and the result does not give full satisfaction, he may modify or add entries in the dictionary 9. The main dictionary will not be modified directly by LaoLex. Only the personal dictionaries of the recorded users will. However, when the content of a personal dictionary becomes significant, it can be checked by the linguist in charge of the main dictionary and then be added. This collaborating way of building the main dictionary can be very efficient as the users themselves do guarantee its lexical quality and completeness. 9 This coupled mechanism is foreseen to be included in the future implementation of LaoLex. 9 / 11

10 As you can see in Figure 10, LaoLex can not only enrich the headwords and their translations (currently the only available data in the dictionary) but also the lexical content of the existing entries. The main additional lexical items are: Old spellings, Alternative spellings, Part of speech, Language level, Specifier, Definition (with its translation), Examples (with their translations), Idioms (with their translations), Comments. Every Lao word in the dictionary can be associated with a category (ປະເພດ). This category, or part of speech, is attributed according to syntactical criteria (semantic and morphological criteria also participate to categorization). In LaoLex, we use a description in three levels: A group level, A category level, A subcategory level. The first level, called group because its elements group together several categories, coincides with the seven parts of speech of the traditional grammar (Comité Littéraire, 1962): Nouns (ຄຳນາມ), Pronouns (ຄຳສ ບພະນາມ / ຄຳສ ພນາມ), Verbs (ຄຳກ ຣ ຍາ), Predicatives 10 (ຄຳວ ເສດ), Prepositions (ຄຳບ ບພະບ ດ / ຄຳບ ພບ ດ), Conjunctions (ຄຳສ ນທານ), Interjections (ຄຳອ ທານ). The second and third levels mostly derive from Marc Reinhorn's works completed by Lamvieng Inthamone's. See (Berment et al. 2003). The levels of language are also important in the Lao language. We are using the following ones: General use Respectful Colloquial Slangy Specialty Refined Monk Royal Literary Spoken Archaic 10 Term borrowed from Marc Reinhorn s grammar (Reinhorn, 1975). 10 / 11

11 REFERENCES LIST Berment Vincent Several Technical Issues for Building New Lexical Bases. Papillon Seminar, July 15 18, 2002, NII, Hitotsubashi, Chiyoda-ku, Tokyo, Japan. Berment Vincent, Thongvilu Houmphanh Cooperative Lao ICT framework. Case study: construction of Lao lexical resources. Regional Conference on Digital GMS, February 26 28, 2003, Asian Institute of Technology, Bangkok, Thailand. Berment Vincent, Jacqmin Thakkhinh, Dechanet Blandine Parts of Speech for the LaoLex Dictionary. Pan Asia Networking All Partners 2003, March 3 10, 2003, Vientiane Novotel, Vientiane, Laos. Berment Vincent. 2003a. LaoWord's Word Processing Functions. Pan Asia Networking All Partners 2003, March 3 10, 2003, Vientiane Novotel, Vientiane, Laos. Berment Vincent. 2003b. Current status of the Papillon-Lao database: Tools (LaoLex), dictionary (LaoDict), XML schema and export towards Papillon. Papillon Seminar, July 3 5, 2003, Sapporo University, Sapporo, Japan. Berment Vincent Méthodes pour informatiser des langues et des groupes de langues «peu dotées». Ph.D. thesis, Grenoble University, France. Comité Littéraire (ກ ມວ ນນະຄະດ ) Lao grammar (ໄວຍາກອນລາວ). Doan-Nguyen Hai Techniques génériques d accumulation d ensembles lexicaux structurés à partir de ressources dictionnairiques informatisées multilingues hétérogènes. Ph.D. thesis, Grenoble University, France. Jadin Paul Dictionnaire laotien-français. Non publié. Lafourcade Mathieu Génie logiciel pour le génie linguiciel. Ph.D. thesis, Grenoble University, France. Nginn Pierre Somchine Dictionnaire français-lao. Idase, Paris. Rarunrom Sampan Dictionary-based Thai word separation,. Senior project report, Thailand. Reinhorn Marc Dictionnaire laotien-français. CNRS. Reinhorn Marc Grammaire de la langue lao. INALCO. 11 / 11

An In-Context and Collaborative Software Localisation Model: Demonstration

An In-Context and Collaborative Software Localisation Model: Demonstration An In-Context and Collaborative Software Localisation Model: Demonstration Amel FRAISSE Christian BOITET Valérie BELLYNCK LABORATOIRE LIG, Université Joseph Fourier, 41 rue des Mathématiques, 38041 Grenoble,

More information

Processing: current projects and research at the IXA Group

Processing: current projects and research at the IXA Group Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

Comprendium Translator System Overview

Comprendium Translator System Overview Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4

More information

A web-based multilingual help desk

A web-based multilingual help desk LTC-Communicator: A web-based multilingual help desk Nigel Goffe The Language Technology Centre Ltd Kingston upon Thames Abstract Software vendors operating in international markets face two problems:

More information

Overview of MT techniques. Malek Boualem (FT)

Overview of MT techniques. Malek Boualem (FT) Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,

More information

INSTITUT FEMTO-ST. Webservices Platform for Tourism (WICHAI)

INSTITUT FEMTO-ST. Webservices Platform for Tourism (WICHAI) INSTITUT FEMTO-ST UMR CNRS 6174 Webservices Platform for Tourism (WICHAI) Kitsiri Chochiang Fouad Hanna Marie-Laure Betbeder Jean-Christophe Lapayre Rapport Technique n RTDISC2015-1 DÉPARTEMENT DISC October

More information

How to translate your website. An overview of the steps to take if you are about to embark on a website localization project.

How to translate your website. An overview of the steps to take if you are about to embark on a website localization project. How to translate your website An overview of the steps to take if you are about to embark on a website localization project. Getting Started Translating websites can be an expensive and complex process.

More information

Module 9. Lesson 9:00 La Culture. Le Minitel. Can you guess what these words mean? surfer le net. chatter. envoyer un mail. télécharger.

Module 9. Lesson 9:00 La Culture. Le Minitel. Can you guess what these words mean? surfer le net. chatter. envoyer un mail. télécharger. Module 9 Lesson 9:00 La Culture Le Minitel The Minitel was a machine used so people could access: Can you guess what these words mean? surfer le net chatter envoyer un mail télécharger être en ligne les

More information

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter

More information

Online free translation services

Online free translation services [Translating and the Computer 24: proceedings of the International Conference 21-22 November 2002, London (Aslib, 2002)] Online free translation services Thei Zervaki [email protected] Introduction

More information

The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized

The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized to each field. The Language Grid, a software that provides

More information

Unrealized Gains in Stocks from the Viewpoint of Investment Risk Management

Unrealized Gains in Stocks from the Viewpoint of Investment Risk Management Unrealized Gains in Stocks from the Viewpoint of Investment Risk Management Naoki Matsuyama Investment Administration Department, The Neiji Mutual Life Insurance Company, 1-1 Marunouchi, 2-chome, Chiyoda-ku,

More information

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words , pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan

More information

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features , pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of

More information

MODULE 7: TECHNOLOGY OVERVIEW. Module Overview. Objectives

MODULE 7: TECHNOLOGY OVERVIEW. Module Overview. Objectives MODULE 7: TECHNOLOGY OVERVIEW Module Overview The Microsoft Dynamics NAV 2013 architecture is made up of three core components also known as a three-tier architecture - and offers many programming features

More information

Points of Interference in Learning English as a Second Language

Points of Interference in Learning English as a Second Language Points of Interference in Learning English as a Second Language Tone Spanish: In both English and Spanish there are four tone levels, but Spanish speaker use only the three lower pitch tones, except when

More information

Semantic annotation of requirements for automatic UML class diagram generation

Semantic annotation of requirements for automatic UML class diagram generation www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute

More information

June 2016 Language and cultural workshops In-between session workshops à la carte June 13-25 2 weeks All levels

June 2016 Language and cultural workshops In-between session workshops à la carte June 13-25 2 weeks All levels June 2016 Language and cultural workshops In-between session workshops à la carte June 13-25 2 weeks All levels We have designed especially for you a new set of language and cultural workshops to focus

More information

Automatic Text Analysis Using Drupal

Automatic Text Analysis Using Drupal Automatic Text Analysis Using Drupal By Herman Chai Computer Engineering California Polytechnic State University, San Luis Obispo Advised by Dr. Foaad Khosmood June 14, 2013 Abstract Natural language processing

More information

Glossary of translation tool types

Glossary of translation tool types Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze

More information

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy Multi language e Discovery Three Critical Steps for Litigating in a Global Economy 2 3 5 6 7 Introduction e Discovery has become a pressure point in many boardrooms. Companies with international operations

More information

Master of Arts in Linguistics Syllabus

Master of Arts in Linguistics Syllabus Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university

More information

Lesson 7. Les Directions (Disc 1 Track 12) The word crayons is similar in English and in French. Oui, mais où sont mes crayons?

Lesson 7. Les Directions (Disc 1 Track 12) The word crayons is similar in English and in French. Oui, mais où sont mes crayons? Lesson 7 Les Directions (Disc 1 Track 12) Listen to the audio and follow along. Où sont mes crayons? Where are your crayons? Yes, where are my crayons? The word crayons is similar in English and in French.

More information

But have you ever wondered how to create your own website?

But have you ever wondered how to create your own website? Foreword We live in a time when websites have become part of our everyday lives, replacing newspapers and books, and offering users a whole range of new opportunities. You probably visit at least a few

More information

Terminology Extraction from Log Files

Terminology Extraction from Log Files Terminology Extraction from Log Files Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet, Mathieu Roche To cite this version: Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet,

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

MODELLING, AUTHORING AND PUBLISHING THE DOCUMENT ANALYSIS LEARNING OBJECT

MODELLING, AUTHORING AND PUBLISHING THE DOCUMENT ANALYSIS LEARNING OBJECT MODELLING, AUTHORING AND PUBLISHING THE DOCUMENT ANALYSIS LEARNING OBJECT Alexandre Flament ([email protected]) Emmanuelle Villiot-Leclercq ([email protected]) Laboratoire CLIPS,

More information

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告 SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:

More information

User Manual Culturethèque Thailand

User Manual Culturethèque Thailand User Manual Culturethèque Thailand Contents: 1. What can one do at Culturethèque?... 3 2. How to access Culturethèque?... 4 3. How to search on Culturethèque?... 5 4. How to access documents on Culturethèque?...

More information

XTM for Language Service Providers Explained

XTM for Language Service Providers Explained XTM for Language Service Providers Explained 1. Introduction There is a new generation of Computer Assisted Translation (CAT) tools available based on the latest Web 2.0 technology. These systems are more

More information

AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH)

AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH) [From: Translating and the Computer, B.M. Snell (ed.), North-Holland Publishing Company, 1979] AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH) Shiu-Chang LOH and Luan KONG Hung

More information

A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students

A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students 69 A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students Sarathorn Munpru, Srinakharinwirot University, Thailand Pornpol Wuttikrikunlaya, Srinakharinwirot University,

More information

Elizabethtown Area School District French III

Elizabethtown Area School District French III Elizabethtown Area School District French III Course Number: 610 Length of Course: 18 weeks Grade Level: 10-12 Elective Total Clock Hours: 120 Length of Period: 80 minutes Date Written: Spring 2009 Periods

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

Financial Literacy Resource French As a Second Language: Core French Grade 9 Academic FSF 1D ARGENT EN ACTION! Connections to Financial Literacy

Financial Literacy Resource French As a Second Language: Core French Grade 9 Academic FSF 1D ARGENT EN ACTION! Connections to Financial Literacy Financial Literacy Resource French As a Second Language: Core French Grade 9 Academic FSF 1D ARGENT EN ACTION! Connections to Financial Literacy Although none of the expectations in the French As a Second

More information

English-Japanese machine translation

English-Japanese machine translation [Information processing: proceedings of the International Conference on Information Processing, Unesco, Paris 1959] 163 English-Japanese machine translation By S. Takahashi, H. Wada, R. Tadenuma and S.

More information

Sun Management Center Change Manager 1.0.1 Release Notes

Sun Management Center Change Manager 1.0.1 Release Notes Sun Management Center Change Manager 1.0.1 Release Notes Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 817 0891 10 May 2003 Copyright 2003 Sun Microsystems, Inc. 4150

More information

MODEL DRIVEN DEVELOPMENT OF BUSINESS PROCESS MONITORING AND CONTROL SYSTEMS

MODEL DRIVEN DEVELOPMENT OF BUSINESS PROCESS MONITORING AND CONTROL SYSTEMS MODEL DRIVEN DEVELOPMENT OF BUSINESS PROCESS MONITORING AND CONTROL SYSTEMS Tao Yu Department of Computer Science, University of California at Irvine, USA Email: [email protected] Jun-Jang Jeng IBM T.J. Watson

More information

Master of Arts (French Studies) M.A. (French Studies)

Master of Arts (French Studies) M.A. (French Studies) Master of Arts (French Studies) M.A. (French Studies) Objectives The program aims producing M.A. graduates who: - have a good knowledge of the French political system, economy, society and culture and

More information

AP FRENCH LANGUAGE 2008 SCORING GUIDELINES

AP FRENCH LANGUAGE 2008 SCORING GUIDELINES AP FRENCH LANGUAGE 2008 SCORING GUIDELINES Part A (Essay): Question 31 9 Demonstrates STRONG CONTROL Excellence Ease of expression marked by a good sense of idiomatic French. Clarity of organization. Accuracy

More information

Translation and Localization Services

Translation and Localization Services Translation and Localization Services Company Overview InterSol, Inc., a California corporation founded in 1996, provides clients with international language solutions. InterSol delivers multilingual solutions

More information

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper Parsing Technology and its role in Legacy Modernization A Metaware White Paper 1 INTRODUCTION In the two last decades there has been an explosion of interest in software tools that can automate key tasks

More information

White Paper. Translation Quality - Understanding factors and standards. Global Language Translations and Consulting, Inc. Author: James W.

White Paper. Translation Quality - Understanding factors and standards. Global Language Translations and Consulting, Inc. Author: James W. White Paper Translation Quality - Understanding factors and standards Global Language Translations and Consulting, Inc. Author: James W. Mentele 1 Copyright 2008, All rights reserved. Executive Summary

More information

Web. Studio. Visual Studio. iseries. Studio. The universal development platform applied to corporate strategy. Adelia. www.hardis.

Web. Studio. Visual Studio. iseries. Studio. The universal development platform applied to corporate strategy. Adelia. www.hardis. Web Studio Visual Studio iseries Studio The universal development platform applied to corporate strategy Adelia www.hardis.com The choice of a CASE tool does not only depend on the quality of the offer

More information

Detection of water leakage using laser images from 3D laser scanning data

Detection of water leakage using laser images from 3D laser scanning data Detection of water leakage using laser images from 3D laser scanning data QUANHONG FENG 1, GUOJUAN WANG 2 & KENNERT RÖSHOFF 3 1 Berg Bygg Konsult (BBK) AB,Ankdammsgatan 20, SE-171 43, Solna, Sweden (e-mail:[email protected])

More information

MULTIFUNCTIONAL DICTIONARIES

MULTIFUNCTIONAL DICTIONARIES In: A. Zampolli, A. Capelli (eds., 1984): The possibilities and limits of the computer in producing and publishing dictionaries. Linguistica Computationale III, Pisa: Giardini, 279-288 MULTIFUNCTIONAL

More information

Blaenau Gwent Schools Year 8 Scheme of Work. UNIT 1 January April 2012

Blaenau Gwent Schools Year 8 Scheme of Work. UNIT 1 January April 2012 Blaenau Gwent Schools Year 8 Scheme of Work UNIT 1 January April 2012 Objectives To express likes and dislikes and give reasons To make considered comparisons in English as well as the TL and give reasons

More information

Natural Language Database Interface for the Community Based Monitoring System *

Natural Language Database Interface for the Community Based Monitoring System * Natural Language Database Interface for the Community Based Monitoring System * Krissanne Kaye Garcia, Ma. Angelica Lumain, Jose Antonio Wong, Jhovee Gerard Yap, Charibeth Cheng De La Salle University

More information

Syntactic Theory. Background and Transformational Grammar. Dr. Dan Flickinger & PD Dr. Valia Kordoni

Syntactic Theory. Background and Transformational Grammar. Dr. Dan Flickinger & PD Dr. Valia Kordoni Syntactic Theory Background and Transformational Grammar Dr. Dan Flickinger & PD Dr. Valia Kordoni Department of Computational Linguistics Saarland University October 28, 2011 Early work on grammar There

More information

Voice Driven Animation System

Voice Driven Animation System Voice Driven Animation System Zhijin Wang Department of Computer Science University of British Columbia Abstract The goal of this term project is to develop a voice driven animation system that could take

More information

Gaumont Selects SYSTRAN to Translate its French Online Newsreel Catalogue for the Global Marketplace. By, Andrew Joscelyne

Gaumont Selects SYSTRAN to Translate its French Online Newsreel Catalogue for the Global Marketplace. By, Andrew Joscelyne Gaumont Selects SYSTRAN to Translate its French Online Newsreel Catalogue for the Global Marketplace By, Andrew Joscelyne October 2002 When Stuart McKay, a freelance film archive researcher, was looking

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

DRH specification framework

DRH specification framework DRH specification framework 2007-03-15 EDM - NIED Takeshi KAWAMOTO, Hiroaki NEGISHI, Mitsuaki SASAKI 1 DRH Basic Development before Sep. 2007 Server architectures Search architectures Multilanguage Architectures

More information

Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages

Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages Hiroshi URAE, Taro TEZUKA, Fuminori KIMURA, and Akira MAEDA Abstract Translating webpages by machine translation is the

More information

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Query optimization. DBMS Architecture. Query optimizer. Query optimizer.

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Query optimization. DBMS Architecture. Query optimizer. Query optimizer. DBMS Architecture INSTRUCTION OPTIMIZER Database Management Systems MANAGEMENT OF ACCESS METHODS BUFFER MANAGER CONCURRENCY CONTROL RELIABILITY MANAGEMENT Index Files Data Files System Catalog BASE It

More information

I will explain to you in English why everything from now on will be in French

I will explain to you in English why everything from now on will be in French I will explain to you in English why everything from now on will be in French Démarche et Outils REACHING OUT TO YOU I will explain to you in English why everything from now on will be in French All French

More information

Elizabethtown Area School District French II

Elizabethtown Area School District French II Elizabethtown Area School District French II Course Number: 605 Length of Course: 18 weeks Grade Level: 9-12 Total Clock Hours: 120 Length of Period: 80 minutes Date Written: Spring 2009 Periods per Week/Cycle:

More information

Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8

Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8 Transit NXT Evolution from Service Pack 0 to Service Pack 8 April 2009: Transit NXT Service Pack 0 (Version 4.0.0.671) Additional versions of DTP programs supported: InDesign CS3 and FrameMaker 9 Additional

More information

Jet Data Manager 2012 User Guide

Jet Data Manager 2012 User Guide Jet Data Manager 2012 User Guide Welcome This documentation provides descriptions of the concepts and features of the Jet Data Manager and how to use with them. With the Jet Data Manager you can transform

More information

AP FRENCH LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES

AP FRENCH LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES AP FRENCH LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES Identical to Scoring Guidelines used for German, Italian, and Spanish Language and Culture Exams Interpersonal Writing: E-mail Reply 5: STRONG

More information

Synergies FLE http://www.france-synergies.org

Synergies FLE http://www.france-synergies.org Digiclass - Linking the Foreign Language Student to a World of Information URSULA ATKINSON As Program Coordinator at the Rutgers University Language Institute, Ursula Atkinson develops, administers and

More information

DHI a.s. Na Vrsich 51490/5, 100 00, Prague 10, Czech Republic ( [email protected], [email protected] )

DHI a.s. Na Vrsich 51490/5, 100 00, Prague 10, Czech Republic ( t.metelka@dhi.cz, z.svitak@dhi.cz ) NOVATECH Rehabilitation strategies in wastewater networks as combination of operational, property and model information Stratégies de réhabilitation des réseaux d'égouts combinant des données d exploitation,

More information

BILAL W S SHAFEI. An-Najah National University LRC / French department An-Najah Street. P.O. Box 07 Nablus Palestine

BILAL W S SHAFEI. An-Najah National University LRC / French department An-Najah Street. P.O. Box 07 Nablus Palestine BILAL W S SHAFEI An-Najah National University LRC / French department An-Najah Street. P.O. Box 07 Nablus Palestine +970 599292310 [email protected] or +970 9 2345113 882225 [email protected] Educational

More information

TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION

TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION Dr. Siddhartha Ghosh 1, Sujata Thamke 2 and Kalyani U.R.S 3 1 Head of the Department of Computer Science & Engineering,

More information

Course Scheduling Support System

Course Scheduling Support System Course Scheduling Support System Roy Levow, Jawad Khan, and Sam Hsu Department of Computer Science and Engineering, Florida Atlantic University Boca Raton, FL 33431 {levow, jkhan, samh}@fau.edu Abstract

More information

EFL Learners Synonymous Errors: A Case Study of Glad and Happy

EFL Learners Synonymous Errors: A Case Study of Glad and Happy ISSN 1798-4769 Journal of Language Teaching and Research, Vol. 1, No. 1, pp. 1-7, January 2010 Manufactured in Finland. doi:10.4304/jltr.1.1.1-7 EFL Learners Synonymous Errors: A Case Study of Glad and

More information

Courtesy Translation

Courtesy Translation Direction centrale de la sécurité des systèmes d information Protection Profile Electronic Signature Creation Application Date : July 17th, 2008 Reference : Version : 1.6 Courtesy Translation Courtesy

More information

edoc Document Generation Suite

edoc Document Generation Suite e Doc Suite is a set of Microsoft Office add-ins for Word, Excel & PowerPoint that lets you use your data in MS Office with ease. Creating simple flat tables from data sources is possible in MS Office,

More information

The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT)

The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT) The Development of Multimedia-Multilingual Storage, Retrieval and Delivery for E-Organization (STREDEO PROJECT) Asanee Kawtrakul, Kajornsak Julavittayanukool, Mukda Suktarachan, Patcharee Varasrai, Nathavit

More information

Quoi de neuf? (What s in the news?)

Quoi de neuf? (What s in the news?) Prior Knowledge: It is helpful if children already know how to give reasons for their opinions. Objectives Engage in conversations; ask and answer questions; express opinions and respond to those of others;

More information

EURESCOM - P923 (Babelweb) PIR.3.1

EURESCOM - P923 (Babelweb) PIR.3.1 Multilingual text processing difficulties Malek Boualem, Jérôme Vinesse CNET, 1. Introduction Users of more and more applications now require multilingual text processing tools, including word processors,

More information

NIVEAU 1 - DEBUTANT. Issu de la méthode Language in Use des Editions Cambridge University Press

NIVEAU 1 - DEBUTANT. Issu de la méthode Language in Use des Editions Cambridge University Press NIVEAU 1 - DEBUTANT Issu de la méthode Language in Use des Editions Cambridge University Press Reflex English Niveau 1 s adresse aux débutants mais aussi à ceux qui souhaitent revoir les bases grammaticales

More information

Credit Cards: Disentangling the Dual Use of Borrowing and Spending

Credit Cards: Disentangling the Dual Use of Borrowing and Spending Staff Analytical Note/Note analytique du personnel 2015-3 Credit Cards: Disentangling the Dual Use of Borrowing and Spending by Olga Bilyk and Brian Peterson Bank of Canada staff analytical notes are short

More information

Modern foreign languages

Modern foreign languages Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007

More information

Formation à l ED STIC ED STIC Doctoral education. Hanna Klaudel

Formation à l ED STIC ED STIC Doctoral education. Hanna Klaudel Formation à l ED STIC ED STIC Doctoral education Hanna Klaudel Texte de référence / Text of low L arrêté de 7 août 2006 : «Les écoles doctorales proposent aux doctorants les formations utiles à leur projet

More information

FrenchPod101.com Learn French with FREE Podcasts

FrenchPod101.com Learn French with FREE Podcasts Newbie Lesson A Rose by Any Other Name Would Smell as Sweet 1 Formal French 2 Formal English 2 Informal French 2 Informal English 2 Vocabulary 2 Grammar Points 4 Cultural Insight 4 Formal French Bonjour

More information

Data Deduplication in Slovak Corpora

Data Deduplication in Slovak Corpora Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, Bratislava, Slovakia Abstract. Our paper describes our experience in deduplication of a Slovak corpus. Two methods of deduplication a plain

More information

CNC Kompetenzzenter mit der multimedialen Software

CNC Kompetenzzenter mit der multimedialen Software CNC Kompetenzzenter mit der multimedialen Software LLP-LdV-TOI-2009-EE-001 1 Information sur le projet Titre: Code Projet: Année: 2009 Type de Projet: Statut: Pays: Accroche marketing: Résumé: CNC Kompetenzzenter

More information

TS3: an Improved Version of the Bilingual Concordancer TransSearch

TS3: an Improved Version of the Bilingual Concordancer TransSearch TS3: an Improved Version of the Bilingual Concordancer TransSearch Stéphane HUET, Julien BOURDAILLET and Philippe LANGLAIS EAMT 2009 - Barcelona June 14, 2009 Computer assisted translation Preferred by

More information

CURRICULUM VITAE Studies Positions Distinctions Research interests Research projects

CURRICULUM VITAE Studies Positions Distinctions Research interests Research projects 1 CURRICULUM VITAE ABEILLÉ Anne Address : LLF, UFRL, Case 7003, Université Paris 7, 2 place Jussieu, 75005 Paris Tél. 33 1 57 27 57 67 Fax 33 1 57 27 57 88 [email protected] http://www.llf.cnrs.fr/fr/abeille/

More information

Archived Content. Contenu archivé

Archived Content. Contenu archivé ARCHIVED - Archiving Content ARCHIVÉE - Contenu archivé Archived Content Contenu archivé Information identified as archived is provided for reference, research or recordkeeping purposes. It is not subject

More information

Tissot Pierre-Henri 2006-2007. Recommended sites. Presentations : Online Activities

Tissot Pierre-Henri 2006-2007. Recommended sites. Presentations : Online Activities Recommended sites Irregular Verbs : THE List http://www.e-anglais.com/ressources/verbes_irreguliers.html Presentations : a genre approach http://iteslj.org/techniques/webster-oralpresentations.html Tips

More information