MACHINE TRANSLATION: A SURVEY.
|
|
|
- Gwen Melton
- 9 years ago
- Views:
Transcription
1 PROFESSIONAL COMMUNICATION AND TRANSLA TION STUDIES MACHINE TRANSLATION: A SURVEY. loan - Lucian POPA 8acau University, Romania 1. Introduction This paper aims at providing the reader with a comprehensive survey of the human quest for devising software for the translation of natural languages. This human quest resulted in what is called 'machine translation' (MT). It has left its imprint on the evolution of linguistics and computer science in the latter half of the 20 th century and has an ever increasing impact on the developments in the fields in the early days of the 21 st. Ideally, the objective of MT would be to bring together all the information necessary for translation in one computer program, so that a text could be translated without human intervention. MT should not be mistaken for CAT (computer-assisted translation) that employs a variety of computer-based translation tools i.e. electronic dictionaries and glossaries, terminology management systems, translation databases, and translator workstations to assist the translator's in his/her work and ensure accuracy and speed, i.e. quality and quantity. The evolution of human society has made it necessary to find solutions for more efficient methods of translation as the demand for translations cannot be met because there are not enough human translators and also because of a series of misconceptions about translation. Human translation is expensive since it requires very serious training, the productivity of the human translator is essentially limited (2,000 words per day) and the processes it needs to duplicate are complex. According to W. John Hutchins (2006), one can identify three major types of MT: (1) machine translation for dissemination (MTd), (2) machine translation for assimilation (MTa), and (3) machine translation for communication (MTc). In the case of MTd, the quality of the translation is paramount. Such translations, after an initial MT processing, must be subjected to a process of editing by human translators, with a view to obtaining the optimum quality. This mode is identified as human-aided machine translation (HAMT). In the cases of MTa and MTc the quality is of secondary importance. 2. Beginnings The origins of MT can be traced back to the seventeenth century ideas of universal and philosophical languages and of 'mechanical' dictionaries; nevertheless, the first practical solutions were suggested in the 20 th century, in 1933, by two individuals: Georges Artsrouni in France, and Piotr Smirnov-Trojanskij in Russia. Artsrouni's idea was that of a general-purpose machine which could also be utilized as a mechanical multwngual dictionary. Trajanskifs idea '.'Vas also a mechanical dictionaryi but he developed the initial idea with proposals for coding and interpreting grammatical functions using sets of universal symbols with a view to creating a multilingual translation device. In 1946 and 1947, Andrew Booth and Warren Weaver suggested that the newly invented computers could be used for translating natural languages. In 1948, Booth cooperated with Richard H. Richens (Cambridge, UK) on a model of morphological analysis for a mechanical dictionary.
2 152 PROFESSIONAL COMMUNICATION AND TRANSLATION STUDIES 2008 In July 1949, Warren Weaver (Rockefeller Foundation) approached the specific problem of ambiguity and offered solutions that were based on his knowledge of cryptography, statistics, Information theory, logic and language universals. In May 1951, the philosopher, mathematician and linguist Yehoshua Bar-Hillel started full-time research on MT at the Massachusetts Institute of Technology, USA, and he organized, in June 1952, the first MT conference. This was a seminal moment as debates made it clear that full automation of translation and optimum results were impossible, and that human involvement either before or after computer processes, i.e. pre-editing and post-editing, respectively, would be indispensable. On January 7, 1954 Leon Dostert (Georgetown University) joined efforts with Peter Sheridan (IBM) and Paul Garvin (Georgetown), in a project which resulted in the first demonstration of an MT system. The demonstration was sufficiently impressive to stimulate large-scale funding of MT research in the USA and trigger the launch of MT projects al over the world. In the same year, the journal Mechanical Translation was initiated by William Locke and by Victor Yngve; it would assemble in its pages some of the most significant papers on the topic until In 1954 and 1955, MT research groups were set up in Cambridge, England, (Margaret Masterman), in Milan (Silvio Ceccato), then the first Soviet, Chinese, and Japanese projects were initiated. In 1955, the first MT book was published, Machine Translation of Languages, a collection edited by A.D. Booth and W.N. Locke. 3. Breaking new ground The period was dedicated to breaking new ground; the expectations were high, the developments were extremely numerous and we will have to limit ourselves to mentioning only the most important ones. MT research, in its early days, could barely rely upon linguistics and the state it was in then and, consequently, the research methods ranged from empirical trial-anderror approaches to what was to become 'computational linguistics'. There were three basic approaches to MT in the decade under scrutiny. The 'direct translation' model (the programming rules were conceived and developed for translation from one source language (SL), into a target language (TL), with a minimal amount of linguistic theory) led to an oversimplification that is not characteristic of natural languages. This model was favored especially by empiricists. They resorted to statistical analyses of actual texts with a view to identify dictionary rules that were all too frequent "of an ad hoc nature, with little or no theoretical foundation" (Hutchins, 2006). The so-called interlinguaf approach or pivot language strategy concentrates "on a trade or transfer of meaning based on an analysis of one language pair alone" (Gross, 1992) and operates on the theory that one can devise a set of abstract language-neutral representations such as codes or symbols independent of both SL and TL, Le. an intermediate 'Ianguage'- in at least one case, a form of Esperanto. Such a language can encode sufficient iinguistic information to serve as a universai intermediate stage - Or" pivot point - enabling translation back and forth between numerous pairs of languages, despite linguistic or cultural differences. The third approach was the transfer approach: texts underwent a conversion through a transfer stage from disambiguated representations of SL texts to equivalent TL representations; translation consisted of three phases: analysis (it describes the SL document linguistically and utilizes an SL dictionary), transfer (it transforms the results of
3 PROFESSIONAL COMMUNICATION AND TRANSLATION STUDIES the analysis stage and establishes the linguistic and structural equivalents between the two languages), and generation or synthesis (it produces a document in the TL on the strength of the linguistic data of the SL using a TL dictionary). The 'interlingua' model and the transfer approach were favored by various groups of researchers; the more pragmatic groups postponed the solution of semantic problems. In Europe and the Soviet Union, research groups were compelled to concentrate on theoretical issues due to the inadequacy of the technical means. Erwin Reifler and his research group (University of Washington, Seattle) developed the dictionary-based 'direct' approach that entailed the compilation of large bilingual dictionaries where lexicographic information was not used solely for identifying TL lexical equivalents for SL items but also for solving grammatical problems without the use of syntactic analysis. Another direction was that of developing methods based on the analysis of language corpora. A group of researchers at the RAND Corporation carried out statistical analyses of a large corpus of Russian physics texts, compiled bilingual glossaries, and gathered grammatical information that were later used for a translation computer program. Researchers at Georgetown University developed the Georgetown Automatic Translation (GAn that had three levels of analysis: morphological, syntagmatic, and syntactic. GAT was initially put into operation on the SERNA system and, then, a new version was successfully installed by Euratom at Ispra (Italy) in 1963 and by the US Atomic Energy Commission in From 1954 to 1960, Anthony Oettinger and his group at Harvard University compiled a substantial Russian-English dictionary considered a precursor of the now common computer-based dictionary aids. From 1959 onwards a new direction for research could be identified: the predictive syntactic analyzer - a system for the identification of acceptable sequences of grammatical items and the probabilistic prediction of following items (initiator: National Bureau of Standards, Ida Rhodes); successive improvements led later to William Woods' familiar Augmented Transition Network parser). From 1953 until 1965, at MIT, under Victor Yngve, syntax was placed at the centre of MT research but, the conclusion was that a semantic barrier was reached and further progress in the field was extremely difficult. The Linguistic Research Center at the University of Texas, (initiator Winfried Lehmannin, 1958), focused on basic syntactic research and reversible grammars in order to realize bi-directional translation within the framework of an essentially syntactic transfer approach. At the University of California, Berkeley, Sydney Lamb developed his 'stratificational grammar', with an architecture of relationships paralleling that of computers and he also highlighted the importance of developing efficient electronic dictionaries and a linguistic theory appropriate for MT. The interlingua approach was not popular in the US; but it was elsewhere in the world: Margaret Masterman and the Cambridge Language Research Unit worked along two basic lines of research: the development of a prototype interlingua system and of tools for improving and refining MT output; Silvio Ceccato (Milan) aimed at developing an interlingua based on cognitive processes and he is considered one of the precursors of the neural net'll/orks. In the Soviet Union, research was somewhat similar to that in the United States (amplitude and mix of empirical and basic theoretical approaches): D.Y. Panov's group's research on English-Russian translation; basic research at the Steklov Mathematical
4 154 PROFESSIONAL COMMUNICATION AND TRANSLATION STUDIES 2008 Institute (A. Ljapunov, 0. Kulagina and I. Melcuk); the interlingua approach was represented especially by the group of Nikolaj Andreev (the Leningrad State University). By the end of the period under discussion, MT research groups had been established in many countries throughout the world, including most European countries, China, Mexico, and Japan. The 1950s was a very optimistic period when some researchers even predicted the imminent implementation of fully automatic MT systems. In 1960, Bar-Hillel stated that the creation of fully automatic high quality translation (FAHQT) systems was not realistic and impossible in principle, but his pessimism was not completely justified. In 1964, the Automatic Language Processing Advisory Committee (ALPAC) examined the situation of MT and, in 1966, the ALPAC report concluded that "there is no immediate or predictable prospect of useful machine translation". The committee recommended the development of machine aids for translators, such as automatic dictionaries, and the continued support of basic research in computational linguistics; the most important effect of the ALPAC report in the US and elsewhere was that the research changed its focus from the direct translation approaches, to the indirect models, both interlingua and transfer-based. 4. Gaining speed In the interval , while in the United States manifest interest in MT diminished, especially in Canada and Europe the situation was different. Due to bilingualism in Canada and multilingualism in the European Community (E.C., i.e. today's European Union) the problems of translation were acute and development in MT continued. In 1970, the TAUM (Traduction Automatlque de I'Univers/te de Montreal) project, and the Meteo system for translating weather forecasts (successful operation since 1976) produced results. Besides Meteo, two other sublanguage systems appeared: in 1970, TITUS (at the Institut Textile de France), a multilingual system for translating abstracts written in a controlled language, and. in 1972, CULT (designed for translating mathematics texts from Chinese into English) of the Chinese University of Hong Kong. The main experiments of the decade focused on approaches from the essentially interlingua perspective (e.g , Bernard Vauquois of Grenoble University, France and his group developed a system for translating mathematics and physics texts from Russian into French). Towards the end of the period, the prospects of the interlingua approach seemed to become narrower and narrower and the less ambitious transfer approach seemed to offer better prospects. The period is characterized mainly by the implementation on a large scale of previously developed systems and the launching of several operational and commercial systems and less by important theoretical breakthroughs: 1976, the Commission of the EC purchased an English-French version of the Systran MT system that had already been used at the US Department of Defense; NATO, International Atomic Energy Authority, Genera! Motors, Dornier, Aerospatiale i etc, a system from the Logos Corporation (Bernard E. Scott), and, at the end of the 1980s, the commercial METAL German-English system. Special-purpose systems were also developed, e.g. systems for the Pan American Health Organization (Washington, D.C.). In Japan, most of the computer companies developed software for computer-aided translation. The languages of choice were English, Korean, and Chinese and the majority of them were low-level direct or transfer systems. A new direction emerges in the 1980s: systems designed for personal computers,
5 PROFESSIONAL COMMUNICATION AND TRANSLATION STUDIES I.e. Weidner (1981) and ALPS (1983). Towards the end of the decade, PC-Translator from linguistic Products, GTS from Globalink and the Language Assistant series from MicroTac). 5. Renewed interest in the theoretical aspects of MT The period is characterized by a renewed general interest in the theoretical aspects of MT. The main tendency was the quasi-general adoption of the three-stage transfer-based approach. The Grenoble group (GETA - Groupe d'etudes pour la Traduction Automatique) switched allegiances from the interlingua system and embarked on the development of a system which was soon considered as the prototype of the new generation of linguisticsbased transfer systems. The Mu system was developed at the University of Kyoto and, since 1986, it has been converted into an operational system used by the Japanese Information Center for Science and Technology for the translation of abstracts. In the mid-1970s, MT research at SaarbrOcken (Germany) resulted in the development of a multilingual transfer system SUSY (SaarbrOcker Obersetzungssystem). In the late 1980s, one can identify a renewed interest in interlingua systems with notable results: in the Netherlands, the DL T (Distributed Language Translation) system of 8S0, Utrecht, led to the construction of large lexical databases, and the Rosetta project focused upon the use of Montague grammar in interlingual representations and the exploration of the reversibility of grammars. Another direction in MT research was the implementation of the results of explorations in artificial intelligence (AI): Yorick Wilks' work on 'preference semantics' and 'semantic templates', in the mid-1970s, and Roger Schank's research at Yale University, and the projects that applied knowledge-based approaches at Carnegie-Mellon University in Pittsburgh. In the late 1980s, the Carnegie-Mellon group elaborated its KANT prototype system and could implement it in an operational knowledge-based system. One must also mention a major change of perspective: the syntactic orientation was replaced by 'Iexicalist' approaches, the results being the Electronic Dictionary Research project (late 1980s, in Japan), the massive database and dictionary resources (the linguistic Data Consortium, United States, and the European Language Resources Association), all these being available to our days. During the 1980s, computer aids were developed, and in the early 1990s, all these and more were integrated in the so-called 'translator's workstation/workbench': Translator's Workbench (Trados), Transit (STAR AG), Translation Manager (IBM), Eurolang Optimizer. followed in the 1990s and early 2000s by many others: Deja Vu (Atnl), the SDLX system (SOL), XMS (Xerox). Log/Term (Terminotix). MulffTrans (MultiCorpora), WordFast (Champollion), MetaTexis, and ProMemoria. And yet, in spite of progress in MT, at the beginning of the 1990s the translation market was as follows (Loffler-Laurian, 1996): Europe and the United States - human translation (HT) 300 million pages as compared to MT 2.5 million pages; Japan HT 150 million pages versus MT 3.5 million pages. Market analysts predicted that this percentage will not change radically by Research since 1989 Until the end of the 1980s, MT research favored the rule-based approach, but since 1989, the supremacy of the rule-based approach has been successfully challenged by the corpus-based approaches. Since then, statistical machine translation (SMT) has
6 156 PROFESSIONAL COMMUNICATION AND TRANSLATION STUDIES ~20,"",O...;...8 become the major focal point of various research groups. Another corpus-based approach is the examp/e-based/memory-based approach (EBMT) that was initiated by Japanese researchers. The rule-based approaches have not been discarded altogether: the CAT2 system (SaarbrOcken), the PaTrans system (for Danish/English translation of patents), the LMT (Logic-programming Machine Translation) project (IBM research centers in Germany, Spain, Israel and the USA). Neither was the interlingua approach: the CATAL YST system at Carnegie-Mellon University, the ULTRA system (New Mexico State University), the UN/TRAN system (University of Maryland), and the Pangloss (the universities of Southern California, New Mexico State and Carnegie-Mellon). In the US, this period marks the end of the negative impact of the ALPAC report. Speech translation is another of the most significant and ambitious developments since the late 1980s. The pioneers were British Telecom (late 1980s, a spoken language phrasebook type system, ATR, Japan (1986, a system for telephone registrations at international conferences and for telephone booking of hotel accommodation). Other important projects are JANUS (Alex Waibel, Carnegie-Mellon University), C-STAR (Consortium for Speech Translation Advanced Research) and VoxTec for the US military, the Phrase/afor (a one-way phrase-based voice-to-voice machine translator into numerous languages). There has been an accelerated increase in the use of MT systems since the 1990s. One of the fastest growing areas is software /ocalization and localization of web pages on company sites, Also, systems for specialized domains and users, the so-called controlled language systems have been developed. A. Gross (1992) identifies "a more extreme form of pre-editing, controlled language, the severely limited vocabulary which is used by a few companies to make MT as foolproof as possible", John Newton (1992:40), identifies one of the most successful machine translation applications at Perkins Engines, as an example of what can be achieved when a system is introduced in a thoroughly planned and methodical way into a restricted domain environment to process controlledlanguage source texts. Since the early 1990s, scores of systems for personal computers have appeared: among the first systems to be sold extensively for use on personal computers were PC Translator (Linguistic Products, Texas) and Power Trans/ator (Globalink). In order to provide an illustration of the evolutions in the field of commercial machine translation systems and computer-aided translation support tools we should mention the fact that if, in its preliminary draft edition of 1999 the Compendium of Trans/ation Software, (a directory of commercial machine translation systems and computer-aided translation support tools compiled by J. Hutchins on behalf of the European Association for Machine Translation and the International Association for Machine Translation) had 69 pages; in its 13 ih edition of June 2007, it has MT on the Internet Since the mid-1990s, when the Internet became more and more available to ever increasing numbers of users, it has exercised a great influence upon MT development. First, MT software products specifically designed for offline translation of web pages and messages made their debut. Then,!nternet~based online translation services for on-demand translation have been offered. A precursor was the service offered in France by Systran on the Minitel network during the 1980s. Then, in 1995, the idea was revived by CompuServe. Then, Babe/fish was offered by AltaVista and various other online services offered online
7 PROFESSIONAL COMMUNICA TlON AND TRANSLA TlON STUDIES versions of their systems: Softissimo with Reverso, LogoMedia with LogoVista and PARS, etc. This is a developing story and it needs a separate survey. 8. Assessment of MT MT evaluation has become a major concem for all those involved in the activity and in the 1990s there were numerous workshops dedicated specifically to the problems of evaluating MT, e,g. Falkedal1994, Vasconcellos 1994, and the workshops attached to many MT conferences. Coherent methodologies have been developed by organizations such as, for instance, Japan Electronic Industry Development Association (JEIDA 1992) and Advanced Research Projects Agency in the US (ARPA 1994). At first, MT quality assessment was performed by humans and they focused upon comprehensibility, intelligibility, fluency, accuracy and appropriateness but this seemed to be time~ consuming and too laborious; especially since 2000, automatic or semi~automatic methods have been developed that rely on the application of statistical analysis to the automatic evaluation of MT systems. We can mention some initial successes: BLEU (IBM) and NIST (National Institute for Standards and Techniques); nevertheless, one must mention that there is no general agreement upon the reliability of such means of evaluation, and research upon devising infallible ones continues and intensifies. 9. Conclusion In its early days, MT's implicit goal was the automatic translation of documents at a quality equal to that of the output of best human translators, notwithstanding their nature and complexity. It soon became obvious that this objective was unfeasible in the predictable future. Revision by specialized humans of MT output was indispensable if the results were meant for publication. Nevertheless, MT was not doomed as it was found that the rough and ready MT output cquld be of use to those who wanted a shortcut to the gist of a text written in a language that was not accessible to them. This use of MT as a tool of assimilation, for information gathering and monitoring was ignored or disregarded. MT was unrealistically supposed to produce human-quality translations for dissemination and for publication. Fortunately though, for many years, the potential of HAMT has been put to use by multinational corporations and other multilingual institutions such as for instance the European Union, and has contributed to the solving of massive translation tasks, The first obvious strong point of MT is productivity. An MT system can process huge amounts of information at speeds unattainable by human translators (a decade or so ago, there have been reports of 700 pages translated per day, by one system). These performances, of course, are dependent upon how well the system has been conceived with respect to the translation task: the more specific, the better defined the task is, the better the results. This brings us to another an enormous advantage of MT, A huge amount of work has already been spent on codifying specialized vocabulary and entering it into the computer's dictionary. Thus, translations of specialized texts can reach high levels of accuracy due to it If the texts to be translated are not specialized, if they are literature or other types that require massive operations of disambiguation, the levels of accuracy will be deficient, the post-editing will be time~ consuming, There is another plus for MT: uniformity of vocabulary within the same translated text. Once the dictionary-building task has been correctly performed, the computer will constantly offer the same translation of the same structure, of the same term.
8 158 PROFESSIONAL COMMUNICATION AND TRANSLA non STUDIES 2008 MT is thus cost-effective for quantitative and/or rapid translation of specialized documentation and software localization materials, while the human translator has been and wlll be the unavoidable solution for non-repetitive linguistically complicated texts (literary, legal, etc.), and even for individual texts using highly specialized vocabularies. The solutions to MT problems that have been devised have turned out to be invaluable tools for human translators, e.g. computer-based translation support tools. All these tools are combined in translator workstations that frequently incorporate full MT systems as well. The coming of online translation on the Internet has given the kiss of life to MT and has triggered most significant changes, with potentially extensive implications for the future. lviassive availability of information in many languages has created a rapidly growing demand for real-time online availability of rough translations to quench instant communication and information needs. These services have already evolved from providing low-quality MT to providing add-on human translation services such as postediting or full translation. A close scrutiny of the evolutions in MT makes it all too obvious that the adoption of multiple approaches will lead to success in achieving good-quality automatic translation and also that MT continuously adapts to its users' needs and is stil/light years away from exhausting its potential. References 1. Gross, Alex, "Limitations of computers as translation tools " in Newton, John, (ed.) Computers in Translation: A Practical Appraisal, New York: Routledge, Hutchins, John "Compendium of Translation Software", various editions, Hutchins, W. John, "Machine translation: a concise history" WJHutchins, Loffler-Laurian, Anne-Marie. La traduction automatique. Villeneuve d'ascq: Presse Universitaire du Septentrion, Newton, John, (ed.) Computers in Translation: A Practical Appraisal, New York: Routledge, Newton, John" "The Perkins Experience", in Newton, John, (ed.) Computers in Translation: A Practical Appraisal, New York: Routledge, Robinson, Douglas, Performative Linguistics: Speaking and Translating as Doing Things with Words. London: Routledge, 2003.
The history of machine translation in a nutshell
The history of machine translation in a nutshell John Hutchins [Web: http://ourworld.compuserve.com/homepages/wjhutchins] [Latest revision: November 2005] 1. Before the computer 2. The pioneers, 1947-1954
The history of machine translation in a nutshell
1. Before the computer The history of machine translation in a nutshell 2. The pioneers, 1947-1954 John Hutchins [revised January 2014] It is possible to trace ideas about mechanizing translation processes
Machine translation over fifty years
[Published in: Histoire, Epistemologie, Langage, Tome XXII, fasc. 1 (2001), p.7-31] Machine translation over fifty years W.John Hutchins University of East Anglia, Norwich, UK. [Email: [email protected]]
Overview of MT techniques. Malek Boualem (FT)
Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,
MACHINE TRANSLATION: A BRIEF HISTORY. W.John Hutchins
[From: Concise history of the language sciences: from the Sumerians to the cognitivists. Edited by E.F.K.Koerner and R.E.Asher. Oxford: Pergamon Press, 1995. Pages 431-445] MACHINE TRANSLATION: A BRIEF
Machine Translation Computer Aided Translation Machine Language Processing
Machine Translation Computer Aided Translation Machine Language Processing Martin Kappus ([email protected]) 1 Machine Translation Computer-Aided Translation Agenda Machine Translation Introduction History
Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken
Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken Introduction First attempts in "automating" the process of translation between natural
How To Overcome Language Barriers In Europe
MACHINE TRANSLATION AS VALUE-ADDED SERVICE by Prof. Harald H. Zimmermann, Saarbrücken (FRG) draft: April 1990 (T6IBM1) I. THE LANGUAGE BARRIER There is a magic number common to the European Community (E.C.):
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:
Comprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
[From: International Journal of Translation 13, 1-2 Jan-Dec 2001, pp.5-20. Special theme issue on machine translation, edited by Michael S.
[From: International Journal of Translation 13, 1-2 Jan-Dec 2001, pp.5-20. Special theme issue on machine translation, edited by Michael S. Blekhman] Machine translation and human translation: in competition
Web-based automatic translation: the Yandex.Translate API
Maarten van Hees [email protected] Web-based automatic translation: the Yandex.Translate API Paulina Kozłowska [email protected] Nana Tian [email protected] ABSTRACT Yandex.Translate
4.1 Multilingual versus bilingual systems
This chapter is devoted to some fundamental questions on the basic strategies of MT systems. These concern decisions which designers of MT systems have to address before any construction can start, and
Glossary of translation tool types
Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze
Machine Translation at the European Commission
Directorate-General for Translation Machine Translation at the European Commission Konferenz 10 Jahre Verbmobil Saarbrücken, 16. November 2010 Andreas Eisele Project Manager Machine Translation, ICT Unit
SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems Jin Yang, Satoshi Enoue Jean Senellart, Tristan Croiset SYSTRAN Software, Inc. SYSTRAN SA 9333 Genesee Ave. Suite PL1 La Grande
REALIZATION SORTING ALGORITHM USING PARALLEL TECHNOLOGIES bachelor, Mikhelev Vladimir candidate of Science, prof., Sinyuk Vasily
2. В. Гергель: Современные языки и технологии параллельного программирования. М., Изд-во МГУ, 2012. 3. Синюк, В. Г. Алгоритмы и структуры данных. Белгород: Изд-во БГТУ им. В. Г. Шухова, 2013. REALIZATION
LANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market
LANGUAGE TRANSLATION SOFTWARE EXECUTIVE SUMMARY Language Translation Software Market Shares and Market Forecasts Language Translation Software Market Driving Forces Language Translation Software Cloud
Anubis - speeding up Computer-Aided Translation
Anubis - speeding up Computer-Aided Translation Rafał Jaworski Adam Mickiewicz University Poznań, Poland [email protected] Abstract. In this paper, the idea of Computer-Aided Translation is first introduced
Language and Computation
Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University [email protected] http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters
THE SCOPE AND LIMITS OF MACHINE TRANSLATION
THE SCOPE AND LIMITS OF MACHINE TRANSLATION Magdalena Cieślak Faculdade de Letras da Universidade do Porto (Portugal) 1 - Translation Process and Theory Even though no one is capable of providing an exact
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006 Yidong Chen, Xiaodong Shi Institute of Artificial Intelligence Xiamen University P. R. China November 28, 2006 - Kyoto 13:46 1
Current commercial machine translation systems and computer-based translation tools: system types and their uses
Current commercial machine translation systems and computer-based translation tools: system types and their uses John Hutchins [Email: [email protected]] [Website: http://ourworld.compuserve.com/homepages/wjhutchins]
AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH)
[From: Translating and the Computer, B.M. Snell (ed.), North-Holland Publishing Company, 1979] AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH) Shiu-Chang LOH and Luan KONG Hung
Automated Online English -Arabic Translator
Faculty of Engineering & Architecture Electrical & Computer Engineering Department Final Year Project - Spring 2006 Automated Online English -Arabic Translator Supervisor: Prof. Ali El Hajj Report Grader:
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati
Translation WARREN WEAVER A War Anecdote Language Invariants
[Written 15 July 1949. Published in: Machine translation of languages: fourteen essays, ed. by William N. Locke and A. Donald Booth (Technology Press of the Massachusetts Institute of Technology, Cambridge,
PROMT Technologies for Translation and Big Data
PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED
Customizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
M LTO Multilingual On-Line Translation
O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in
COMPLEXITY RISING: FROM HUMAN BEINGS TO HUMAN CIVILIZATION, A COMPLEXITY PROFILE. Y. Bar-Yam New England Complex Systems Institute, Cambridge, MA, USA
COMPLEXITY RISING: FROM HUMAN BEINGS TO HUMAN CIVILIZATION, A COMPLEXITY PROFILE Y. BarYam New England Complex Systems Institute, Cambridge, MA, USA Keywords: complexity, scale, social systems, hierarchical
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION
ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION FLORENCE REEDER The MITRE Corporation 1 / George Mason University [email protected] ABSTRACT Developing a system that accurately produces
Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features
, pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of
Collecting Polish German Parallel Corpora in the Internet
Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska
Interlingua-based Machine Translation Systems: UNL versus Other Interlinguas
Interlingua-based Machine Translation Systems: UNL versus Other Interlinguas Sameh AlAnsary Department of Phonetics and Linguistics, Faculty of Arts, Alexandria University ElShatby, Alexandria, Egypt.
Why major in linguistics (and what does a linguist do)?
Why major in linguistics (and what does a linguist do)? Written by Monica Macaulay and Kristen Syrett What is linguistics? If you are considering a linguistics major, you probably already know at least
Getting Evidence into Practice. Final Report Strand II. Funded by the European Commission, project no. 2003123 (790841)
Funded by the European Commission, project no. 2003123 (790841) Getting Evidence into Practice Final Report Strand II The Getting Evidence into Practice Project, (Evidence Consortium, GEP, Grant agreement
The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge
The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter
Automated Translation Quality Assurance and Quality Control. Andrew Bredenkamp Daniel Grasmick Julia V. Makoushina
Automated Translation Quality Assurance and Quality Control Andrew Bredenkamp Daniel Grasmick Julia V. Makoushina Andrew Bredenkamp Introductions (all) CEO acrolinx, Computational Linguist, QA Tool Vendor
Language Translation Software and Services - Global Strategic Business Report
Brochure More information from http://www.researchandmarkets.com/reports/2228040/ Language Translation Software and Services - Global Strategic Business Report Description: This report analyzes the worldwide
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
Online free translation services
[Translating and the Computer 24: proceedings of the International Conference 21-22 November 2002, London (Aslib, 2002)] Online free translation services Thei Zervaki [email protected] Introduction
D2.4: Two trained semantic decoders for the Appointment Scheduling task
D2.4: Two trained semantic decoders for the Appointment Scheduling task James Henderson, François Mairesse, Lonneke van der Plas, Paola Merlo Distribution: Public CLASSiC Computational Learning in Adaptive
Cross-Lingual Concern Analysis from Multilingual Weblog Articles
Cross-Lingual Concern Analysis from Multilingual Weblog Articles Tomohiro Fukuhara RACE (Research into Artifacts), The University of Tokyo 5-1-5 Kashiwanoha, Kashiwa, Chiba JAPAN http://www.race.u-tokyo.ac.jp/~fukuhara/
On Technological Turn of Translation Studies: Evidences and Influences
ISSN 1798-4769 Journal of Language Teaching and Research, Vol. 6, No. 2, pp. 429-434, March 2015 DOI: http://dx.doi.org/10.17507/jltr.0602.25 On Technological Turn of Translation Studies: Evidences and
Study Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan
Study Plan Bachelor s in Spanish and English Faculty of Foreign Languages University of Jordan 2009/2010 Department of European Languages Faculty of Foreign Languages University of Jordan Degree: B.A.
LISA Best Practice Guide Implementing Machine Translation Localization Industry Standards Association
LISA Best Practice Guide Implementing Machine Translation Localization Industry Standards Association Localization Industry Standards Association Since 1990, the Localization Industry Standards Association
Comparative Evaluation of Online Machine Translation Systems with Legal Texts *
Comparative Evaluation of Online Machine Translation Systems with Legal Texts * Chunyu Kit ** and Tak Ming Wong *** The authors discuss both the proper use of available online machine translation (MT)
The Present Status of Automatic Translation of Languages
[From: Advances in Computers, vol. 1 (1960), p.91-163] The Present Status of Automatic Translation of Languages YEHOSHUA BAR-HILLEL Hebrew University, Jerusalem, Israel 1. Aims and Methods, Survey and
MACHINE TRANSLATION. I.1 Introduction to Statistical Machine Translation
Universitat Politècnica de València Master en Inteligencia Artificial, Renoconocimiento de Formas e Imagen Digital Master in Artificial Intelligence, Pattern Recognition and Digital Imaging 2015-2016 MACHINE
REGULATIONS FOR THE DEGREE OF MASTER OF EDUCATION (MEd)
215 REGULATIONS FOR THE DEGREE OF MASTER OF EDUCATION (MEd) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference to the effect that the work
CREATING LEARNING OUTCOMES
CREATING LEARNING OUTCOMES What Are Student Learning Outcomes? Learning outcomes are statements of the knowledge, skills and abilities individual students should possess and can demonstrate upon completion
Research into competency models in arts education
Research into competency models in arts education Paper presented at the BMBF Workshop International Perspectives of Research in Arts Education, Nov. 4 th and 5 th, 2013. Folkert Haanstra, Amsterdam School
A web-based multilingual help desk
LTC-Communicator: A web-based multilingual help desk Nigel Goffe The Language Technology Centre Ltd Kingston upon Thames Abstract Software vendors operating in international markets face two problems:
POST-EDITING SERVICE FOR MACHINE TRANSLATION USERS AT THE EUROPEAN COMMISSION
[Translating and the Computer 20. Proceedings from Aslib conference, 12 & 13 November 1998] POST-EDITING SERVICE FOR MACHINE TRANSLATION USERS AT THE EUROPEAN COMMISSION Dorothy Senez C-80 3/12 Post-editing
Exhibit F. VA-130620-CAI - Staff Aug Job Titles and Descriptions Effective 2015
Applications... 3 1. Programmer Analyst... 3 2. Programmer... 5 3. Software Test Analyst... 6 4. Technical Writer... 9 5. Business Analyst... 10 6. System Analyst... 12 7. Software Solutions Architect...
EUROPEAN. Geographic Trend Report for GMAT Examinees
2011 EUROPEAN Geographic Trend Report for GMAT Examinees EUROPEAN Geographic Trend Report for GMAT Examinees The European Geographic Trend Report for GMAT Examinees identifies mobility trends among GMAT
IBC PhD programme: Professional Intercultural Communication
March 2014 IBC PhD programme: Professional Intercultural Communication The PhD programme in Professional Intercultural Communication (PINC) is hosted by the Department of International Business Communication
Translation Solution for
Translation Solution for Case Study Contents PROMT Translation Solution for PayPal Case Study 1 Contents 1 Summary 1 Background for Using MT at PayPal 1 PayPal s Initial Requirements for MT Vendor 2 Business
How to Manage Costs and Expectations for Successful E-Discovery: Best Practices
How to Manage Costs and Expectations for Successful E-Discovery: Best Practices Mukesh Advani, Esq., Advisory Board Member, UBIC North America, Inc. UBIC North America, Inc. 3 Lagoon Dr., Ste. 180, Redwood
Master of Arts in Linguistics Syllabus
Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university
IAI : Knowledge Representation
IAI : Knowledge Representation John A. Bullinaria, 2005 1. What is Knowledge? 2. What is a Knowledge Representation? 3. Requirements of a Knowledge Representation 4. Practical Aspects of Good Representations
MASTERS OF SCIENCE ADVANCED ENGLISH PROFESSIONAL STUDIES IN THE FIELD OF ELECTRICAL ENERGETICS AND ENGINEERING
MASTERS OF SCIENCE ADVANCED ENGLISH PROFESSIONAL STUDIES IN THE FIELD OF ELECTRICAL ENERGETICS AND ENGINEERING Vasilii V. TIUNOV The FSBEI of HPE «The Perm National Research Polytechnic University» (PNRPU)
LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*
LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,
Introduction to 30th Anniversary Perspectives on Cognitive Science: Past, Present, and Future
Topics in Cognitive Science 2 (2010) 322 327 Copyright Ó 2010 Cognitive Science Society, Inc. All rights reserved. ISSN: 1756-8757 print / 1756-8765 online DOI: 10.1111/j.1756-8765.2010.01104.x Introduction
320 E. 46 th Street, 11G New York, NY 10017 Tel. 212-751-5150 Ext. 5
320 E. 46 th Street, 11G New York, NY 10017 Tel. 212-751-5150 Ext. 5 Ms. Lisa Fowlkes Deputy Chief, Public Safety and Homeland Security Bureau Federal Communications Commission 445 12 th Street, NW, Room
Computers and the Creative Process
Computers and the Creative Process Kostas Terzidis In this paper the role of the computer in the creative process is discussed. The main focus is the investigation of whether computers can be regarded
The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project
Seminar on Dec 19 th Abstracts & speaker information The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Eleni Bozia (USA) Angelos Barmpoutis (USA)
Machine vs. Human Translation Scott Bass, Advanced Language Translation Inc.
Machine vs. Human Translation Scott Bass, Advanced Language Translation Inc. Using computers to translate text from one language to another (referred to as machine translation [MT]) no longer faces the
LSI TRANSLATION PLUG-IN FOR RELATIVITY. within
within LSI Translation Plug-in (LTP) for Relativity is a free plug-in that allows the Relativity user to access the STS system 201 Broadway, Cambridge, MA 02139 Contact: Mark Ettinger Tel: 800-654-5006
The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized
The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized to each field. The Language Grid, a software that provides
Brill s rule-based PoS tagger
Beáta Megyesi Department of Linguistics University of Stockholm Extract from D-level thesis (section 3) Brill s rule-based PoS tagger Beáta Megyesi Eric Brill introduced a PoS tagger in 1992 that was based
THE DEVELOPMENT OF AN INTERNATIONAL BUSINESS STRATEGY
THE DEVELOPMENT OF AN INTERNATIONAL BUSINESS STRATEGY DeRuiter Consultancy International Market & Investment Surveyors A STEP-BY-STEP APPROACH TO DEVELOPING A BUSINESS STRATEGY FOR INTERNATIONAL GROWTH
Master of Science in Artificial Intelligence
Master of Science in Artificial Intelligence Options: Engineering and Computer Science (ECS) Speech and Language Technology (SLT) Big Data Analytics (BDA) Faculty of Engineering Science Faculty of Science
Hybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
Vendor Satisfaction. Survey
Vendor Satisfaction Survey GSDTI May 2004 [email protected] W E H E L P T H E B A N K C O M M U N I C A T E! The Vendor Satisfaction Survey and this report were prepared by Claudia Pinto, (Vendor
