Rapid assembly of a large-scale French-English MT system

Size: px
Start display at page:

Download "Rapid assembly of a large-scale French-English MT system"

Transcription

1 Rapid assembly of a large-scale French-English MT system Jessie Pinkham, Monica Corston-Oliver, Martine Smets, and Martine Pettenaro Microsoft Research One Microsoft Way Redmond, WA {jessiep, martines, martinep}@microsoft.com Butler Hill Group moco@butlerhill.com Abstract Past research has shown that the ideal MT system should be modular and devoid of language pair specific information in its design. We describe here the assembly of (Traduction Automatique Microsoft), the French-English research MT system under development at Microsoft, which was constructed from a combination of pre-existing rule-based components and automatically created components. At this stage, the system has not been adapted either computationally or linguistically to the French-English context and yet it performs only slightly below the French-English Systran system in independent blind human evaluations Keywords Hybrid MT, EBMT, automatic transfer learning, French-English Introduction (Traduction Automatique Microsoft) is a French-English translation system which uses a French broad coverage analyzer, a large multi-purpose French dictionary, a French-English bilingual lexicon, an application-independent English natural language generation component and a transfer component. The transfer component consists of high-quality transfer patterns automatically acquired from sentence-aligned bilingual corpora using an alignment grammar and algorithm described in detail in Menezes (2001) (see Figure 1). The system is best characterized as a data-driven hybrid system, with rule-based analysis and generation, examplebased transfer, and some statistically derived lexical input. The automatic alignment procedure used to create the example base relies on the same parser employed during analysis and also makes use of its own small set of rules for determining permissible alignments. A moderately sized French-English dictionary, containing only word pairs and their parts of speech, provides translation candidates for the alignment procedure and is also used as a backup source of translations during transfer. Statistical techniques supply additional translation pair candidates for alignment and identify certain multi-word terms for parsing and transfer. A complete description of the

2 architecture and technical details appears in Richardson et al. (2001). We will focus here on describing the assembly and early stages of development of the French-English MT system, from its inception in October 2000 to now (April 2001). We report on evaluations conducted so far (in November, February and March), which show significant progress without any hand-coded linguistic changes specific to this language pair. Review of previous work Frank (1999) describes an effort to leverage work on parallel grammar development toward Machine Translation. The ParGram project involves parallel development in four languages (English, French, German, Norwegian). The effort stresses the importance of LFG representations, particularly f-structures, which encode functional properties shared across multiple languages, in the construction of an MT prototype covering fewer than 100 sentences. Though we do not rely on the LFG theory as a formalism, we have in our Logical Form component the same advantage of functional/semantic properties shared across several languages. The transfer rules written for the prototype described by Frank are extremely complex, demonstrating that handcoded transfer rules would be unsuitable for large-scale MT system development. Frank states that automatic efforts for acquisition of transfer rules and lexical transfer are the direction to explore. takes advantage of parallel development in multiple languages (English and French for this project, but see also Richardson et al. 2001). It also leverages a common functional representation, and incorporates a fully automatic transfer component and bilingual word association learning techniques. Example-based transfer as well as automatically extracted transfer has been proposed in a number of systems (see Somers 1999 for an overview). Richardson et al. (2001) report using fully automated transfer based on bilingual corpora for Spanish-English and English-Spanish that results in commercial level quality. In the current study, using the French-English version of the same system, we focus on studying translation quality at various stages of early assembly, before any language pair specific enhancements. Menezes et al. 2001, Moore 2001, Pinkham et al. 2001, Aikawa et al. 2001), but here is a brief outline. The French dictionary started with a 12,000 word lexicon from Brigham Young University. We manually augmented it to 25,000 entries and added more detailed features on words regarding morphology and syntactic behavior. A Novell word list boosted the number of headwords to about 68,000, and also provided morphological and some syntactic information. The analysis and generation of word forms is enhanced by a morphology engine which comprises both inflectional and derivational components. The French-English lexicon was built automatically from these sources: Cambridge University Press English- French, Soft-Art English-French, and Langenscheidt French-English and English-French dictionaries. The English-French translation data was reversed to create French-English pairs in order to augment the size of the dictionary, with a final translation count of 47,000 entries and 75,000 translation pairs of words and phrases 1. We added translation pairs extracted from the actual domain, using statistical word/phrase assignment. The algorithm used is described in Moore (2001). This resulted in one file of automatically created French- English translation correspondences, or word associations (WA), and a second file of specialized multi-word translation correspondences which we term Title Associations (TA). These files and their interaction with the system are detailed in Pinkham et al. (2001). The French and English broad coverage parsers produce conventional phrase structure analyses augmented with grammatical relations. Syntactic analyses undergo further processing in order to derive logical forms (LFs), which are graph structures that describe labeled dependencies among content words in the original input. LFs normalize certain syntactic alternations (e.g. active/passive) and resolve both intrasentential anaphora and long-distance dependencies. See Figure 2 for an illustration of a Logical Form representation. Modules used to assemble Assembly of requires the following list of components: Aligned bilingual text French monolingual dictionary Analysis grammar (morphology, syntax, logical form) Bilingual FE dictionary Automatically derived transfer mappings Automatically derived word-association pairs English generation component The majority of the components have been described elsewhere (Pinkham 1996, Richardson et al. 2001, Figure 2: Sample LF The code that builds LFs from syntactic analyses is shared between French and English 2. This shared architecture greatly simplifies the task of aligning LF segments from different languages, since superficially distinct constructions in two languages frequently collapse onto similar or identical LF representations. 1 The work of conversion and inversion of the information was conducted by J. Pentheroudakis. 2 LF is shared by all languages under development : English, French, Spanish, German, Chinese, Japanese and Korean.

3 acquires transfer mappings by aligning pairs of LFs obtained from parsing sentence pairs in a bilingual corpus. The LF alignment algorithm first establishes tentative lexical correspondences between nodes in the source and target LFs using translation pairs from a bilingual lexicon and the statistical word alignment. After establishing possible correspondences, the algorithm uses a small set of alignment grammar rules to align LF nodes according to both lexical and structural considerations and to create LF transfer mappings. The final step is to filter the mappings based on the frequency of their source and target sides. Menezes & Richardson (2001) provides further details and an evaluation of the LF alignment algorithm. The target LF produced by the transfer component is mapped to the target string by the rule-based generation component. The generation component has no information about the source language for a given input LF, and the English generation component is thus not specific to. Our system is very robust: in case syntactic analysis fails, a fitted parse is the output of the parser. A fitted parse is a collection of partial parses which cannot be combined together in a single parse by the grammar, but can still be used in further processing. More specifically, it can be used in alignment and transfer, and thus in translation. In addition to being designed for robustness, most of the components of our system have been tested extensively and are included in commercial products (for example, the grammar checker of Microsoft Word). Evaluation Method For each version of the system to be tested, seven evaluators were asked to evaluate the same set of 250 blind test sentences. For each sentence, raters were presented with a reference sentence, the original English sentence from which the human French translation was derived. In order to maintain consistency among raters who may have different levels of fluency in the source language, raters were not shown the original French sentence. Raters were also shown two machine translations, one from the system with the component being tested (), and one from the comparison system (Systran 3 ). Because the order of the two machine translation sentences was randomized on each sentence, evaluators could not determine which sentence was from which system. The order of presentation of sentences was also randomized for each rater in order to eliminate any ordering effect. The raters were asked to make a three-way choice. For each sentence, the raters were to determine which of the two automatically translated sentences was the better translation of the (unseen) source sentence, assuming that the reference sentence was a perfect translation, with the option of choosing neither if the differences were negligible. Raters were instructed to use their best judgment about the relative importance of fluency/style and accuracy/content preservation. We chose to use this simple three-way scale in order to avoid making any a priori judgments about the relative importance of these parameters for subjective judgments of quality. The three-way scale also allowed sentences to be rated on the same scale, regardless of whether the differences between output from system 1 and system 2 were substantial or relatively small; and regardless of whether either version of the system produced an adequate translation. The scoring system was similarly simple; each judgment by a rater was represented as 1 (sentence from judged better), 0 (neither sentence judged better), or -1 (sentence from Systran judged better). The score for each version of the sytstem was the mean of the scores of all sentences for all raters. The significance of the scores was calculated in two ways. First, we determined the range around the mean which we could report with 95% confidence (i.e. a confidence interval at.95), taking into account both variations in the sentences and variations across the raters' judgments. In order to determine the effects of each stage of development on the overall quality of the system, we calculated the significance of the difference in the scores across the different versions of the system to determine whether the difference between them was statistically meaningful. We used a one-tailed t-test, since our a priori hypothesis was that the system with more development would show improvement (that is, a statistically meaningful change in quality with respect to Systran). Results Date of evaluation Description of system Score vs. Systran November Basic / components.1 February Improved FE / dictionary.1 March 2001 Wordassociations /-.11 Table 1: Evaluation results Sample size The November system contained only the basic components, while the February system included an improved French-English dictionary. The March system was improved with a dictionary of translation pairs extracted automatically from the domain. The results of the t-tests show that the February system is significantly better than the November system (t = -.480; p <.00001) at a threshold of.95, while the March to February system comparison is on the border of significance at the.95 level (t = ; p = ). It is worth noting, however, that this is in the nature of incremental improvements in system quality; even though each small change may not create a statistically significant improvement, the aggregate of all changes dramatically improves the system over time. We believe that the addition of the word-association components is an example of such a change. 3 Systran was chosen as reference system because it is listed in the IDC report (Flanagan, 2000) as the best commercial FE system.

4 Effectiveness of the transfer mappings On a set of test data of 500 sentences, using the latest system, used an average of 6.9 mappings per sentence, each spanning approximately 1.6 words. (The greater the span, the more complex the mappings learned.) Table 2 below helps clarify the role played during translation by patterns learned during training Transfer database FE dictionary Same as source Lemmas 91.9% 5.3% 2.3% Pronouns 8.3% 56.1% Prepositions 34.5% 65.1% 0.3% Other relations 38.3% 61.7% Table 2: Statistics on transfer derives its transfer information solely from the transfer mappings database and from the FE bilingual dictionary; when both of these fail, the translation is the same as the source 4. We see that 91.9% of non-function words have translations from transfer mappings, with only 5.3% of cases coming from the FE dictionary. Pronouns are excluded from the alignment-learning algorithm, and yet do appear in the transfer mappings at the rate of 8.3%, when they come in bigger mapping chunks. Prepositions represent key relations which necessarily vary from one language to another, and will be better if they come from the transfer database, where they have been learned from corpora. They are learned 34.5% of the time. Other relations from the LF (Deep Subject, Deep Object) are transferred as is, accounting for the large number of relations that are the same as source (61.7%). We successfully learn other relations 38.3% of the time. Translation examples In this section, we present a few examples of translation, and compare them with the translation derived using FE Systran, a commercial translation software which we have taken as benchmark in our evaluations. All settings appropriate for translation in the computer domain were set on in the Systran translations. Our system so far does not outperform Systran on average, although the latest evaluation showed that the systems are quite close in quality of translation. We look at some examples where the raters judged the translation to be better than Systran and give an example of bad translation. The translation examples are presented as follows: first the source sentence labeled, followed by the reference (human) translation (), then the sentence created by our system (), and finally the sentence produced by Systran s machine translation (). 4 The numbers do not add up to 100% because of another category of untranslatable words, such as numbers, etc. Dans un réseau basé sur un domaine, les ordinateurs Windows NT sont configurés en tant que membres d'un domaine précis. In a domain-based network, Windows NT computers are configured as members of a specified domain. In a network based on a domain, the Windows NT computers are configured as specific domain members. In a network based on a field, the computers Windows NT are configured as members of a precise field. Even though it uses a domain dictionary, Systran does not select an appropriate translation for the word domaine (domain in the and translations, field in the Systran translation). In addition, the Systran translation fails to produce a English compound NP for the French sequence NN. For example, les ordinateurs Windows NT is translated as the computers Windows NT by Systran, but as Windows NT computers in the human and translations. Note the weakness in the translation: it keeps the definite determiner in English, which is incorrect. This is exactly the type of problem that will be addressed in future linguistic work in the system. When French uses the NP de NP construction, English more often uses a compound NP. uses the automatically learned transfer mappings (no human coding here at all) to correctly generate specific domain members for membres d'un domaine précis; in the example below, certification authority for Autorité de certification; and certificates services for services de certificats. These translations contribute to make correct translation sound more native than the Systran translation, even when both appear to preserve meaning adequately. Vous sélectionnez la stratégie qu'une Autorité de certification utilisera lors de l'installation des services de certificats. You select the policy a CA will use when you install Certificate Services. You select the policy that a certification authority will use at the time of installing the certificates services. You select the strategy which an Authority of certification will use at the time of the installation of the services of certificates. But there are worse problems than these in Systran translations. In the example below, serveurs is inaccurately translated as host by Systran, and this leads to the wrong choice of relative pronoun (whom) used here to refer to an inanimate (server).

5 Si vous utilisez d'autres serveurs DNS sur votre réseau, vérifiez que l'implémentation de serveur DNS qu'ils utilisent prend en charge les mises à jour dynamiques. If you are using other DNS servers on your network, verify that they are running a DNS server implementation that supports dynamic updates. If you use other DNS servers on your network, verify that the server DNS implementation that they use also supports the dynamic updates. If you use other hosts DNS on your network, check that the implementation of host DNS whom they use deals with the dynamic updates. In the translations, on the other hand, the information automatically collected in the tranfer database and bilingual dictionary permits a correct choice of words and expressions in the target language: serveur is translated as server. translations fail mostly when the analysis (parse or LF) is inadequate, sometimes dramatically. In the following example, the clause introduced by Quand is analyzed as a free relative, and the comma after that clause as a coordination. The boundary of the Quandclause is after services, and the main verb does not have a subject. The generation component then inserts a dummy subject, it. There is another problem, not related to the bad analysis: en réduction is not translated, because it is an unanalyzed unit in French which is not in the bilingual dictionary. On the other hand, some phrases are learned and elegantly translated, such as le Gestionnaire de services SQL Server and le menu Démarrer (SQL Server Service Manager and Start menu). Systran s translation is better but consistently lacks native fluency (and also suffers from untranslated words). Quand vous sélectionnez le Gestionnaire de services SQL Server dans le menu Démarrer, l'icône du gestionnaire de services s'affiche en réduction dans la barre des tâches. When you select SQL Server Service Manager from the Start menu, the Service Manager icon appears minimized in the taskbar by default. It appears on the en réduction bar of the tasks when you select the SQL Server Service Manager in the Start menu, the icon of the Manager of services. When you select the Manager of services SQL Server in the Démarrer menu, the icon of the Manager of services is displayed in reduction in the bar of the tasks. Conclusion The resulting French-English system performs slightly less well than Systran, which is the best French-English commercial system according to the IDC report (Flanagan 2000). By studying the system at its inception, we hope to demonstrate the modularity of the Microsoft Research MT system architecture and the remarkably good results achieved with automatic transfer learning alone. We expect a jump in quality once we begin to address language-pair specific linguistic issues in detail. References Aikawa, T. & Melero, M. & Schwartz, L. & Wu, A. (2001). Multilingual Sentence Generation. In Proceedings of the workshop on Natural Language Generation, ACL Conference, June Flanagan, M and McClure, S. (2000) Machine Translation Engines: An Evaluation of Output Quality, IDC publication Frank, A. (1999). From Parallel Grammar Development towards Machine Translation A Project Overview. In Proceedings of the MT Summit VII. Menezes, A. & Richardson, S. (2001). A Best-First Alignment Algorithm for Automatic Extraction of Transfer Mappings from Bilingual Corpora. In Proceedings of the Workshop on Data-Driven Machine Translation, ACL Conference, June Moore, R.C. (2001). Towards a Simple and Accurate Statistical Approach to Learning Translation Relationships Between Words. In Proceedings of the Workshop on Data-Driven Machine Translation, ACL Conference, June Pinkham, J. (1996). Grammar Sharing in French and English. In Proceedings of the First International Conference on Industrial Applications of Natural Language Processing (IANLP). Pinkham, J. & Corston-Oliver, M. (2001). Adding Domain Specificity to an MT system. In Proceedings of the Workshop on Data-Driven Machine Translation, ACL Conference, June Richardson, S. & Dolan, W. & Menezes, A. & Corston- Oliver, M. (2001). Overcoming the Customisation Bottleneck Using Example-Based MT. In Proceedings of the Workshop on Data-Driven Machine Translation, ACL Conference, June Somers, H. (1999). Review Article: Example-Based Machine Translation. Machine Translation 14: Way, A. (1999). A hybrid architecture for robust MT using LFG-DOP. Journal of Experimental and Theoretical Artificial Intelligence 11(1999) We expect that with the constant feedback provided by working on MT, we will be able to address a large number of these errors and improve the analysis modules. We are optimistic that we will then achieve consistently accurate, native-sounding translation.

Statistical Machine Translation

Statistical Machine Translation Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language

More information

Learning Translation Rules from Bilingual English Filipino Corpus

Learning Translation Rules from Bilingual English Filipino Corpus Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,

More information

CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation.

CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. Miguel Ruiz, Anne Diekema, Páraic Sheridan MNIS-TextWise Labs Dey Centennial Plaza 401 South Salina Street Syracuse, NY 13202 Abstract:

More information

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告 SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:

More information

Overview of MT techniques. Malek Boualem (FT)

Overview of MT techniques. Malek Boualem (FT) Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,

More information

Comprendium Translator System Overview

Comprendium Translator System Overview Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4

More information

Learning Translations of Named-Entity Phrases from Parallel Corpora

Learning Translations of Named-Entity Phrases from Parallel Corpora Learning Translations of Named-Entity Phrases from Parallel Corpora Robert C. Moore Microsoft Research Redmond, WA 98052, USA bobmoore@microsoft.com Abstract We develop a new approach to learning phrase

More information

Customizing an English-Korean Machine Translation System for Patent Translation *

Customizing an English-Korean Machine Translation System for Patent Translation * Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter

More information

Glossary of translation tool types

Glossary of translation tool types Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze

More information

SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统

SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统 SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems Jin Yang, Satoshi Enoue Jean Senellart, Tristan Croiset SYSTRAN Software, Inc. SYSTRAN SA 9333 Genesee Ave. Suite PL1 La Grande

More information

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC)

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC) Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger European Commission Joint Research Centre (JRC) https://ec.europa.eu/jrc/en/research-topic/internet-surveillance-systems

More information

Turker-Assisted Paraphrasing for English-Arabic Machine Translation

Turker-Assisted Paraphrasing for English-Arabic Machine Translation Turker-Assisted Paraphrasing for English-Arabic Machine Translation Michael Denkowski and Hassan Al-Haj and Alon Lavie Language Technologies Institute School of Computer Science Carnegie Mellon University

More information

AP FRENCH LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES

AP FRENCH LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES AP FRENCH LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES Identical to Scoring Guidelines used for German, Italian, and Spanish Language and Culture Exams Interpersonal Writing: E-mail Reply 5: STRONG

More information

Comma checking in Danish Daniel Hardt Copenhagen Business School & Villanova University

Comma checking in Danish Daniel Hardt Copenhagen Business School & Villanova University Comma checking in Danish Daniel Hardt Copenhagen Business School & Villanova University 1. Introduction This paper describes research in using the Brill tagger (Brill 94,95) to learn to identify incorrect

More information

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990

More information

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features , pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of

More information

Online free translation services

Online free translation services [Translating and the Computer 24: proceedings of the International Conference 21-22 November 2002, London (Aslib, 2002)] Online free translation services Thei Zervaki tzervaki@hotmail.com Introduction

More information

Paraphrasing controlled English texts

Paraphrasing controlled English texts Paraphrasing controlled English texts Kaarel Kaljurand Institute of Computational Linguistics, University of Zurich kaljurand@gmail.com Abstract. We discuss paraphrasing controlled English texts, by defining

More information

Natural Language Database Interface for the Community Based Monitoring System *

Natural Language Database Interface for the Community Based Monitoring System * Natural Language Database Interface for the Community Based Monitoring System * Krissanne Kaye Garcia, Ma. Angelica Lumain, Jose Antonio Wong, Jhovee Gerard Yap, Charibeth Cheng De La Salle University

More information

Recent developments in machine translation policy at the European Patent Office

Recent developments in machine translation policy at the European Patent Office Recent developments in machine translation policy at the European Patent Office Dr Georg Artelsmair Director European Co-operation European Patent Office Brussels, 17 November 2010 The European Patent

More information

Identifying Focus, Techniques and Domain of Scientific Papers

Identifying Focus, Techniques and Domain of Scientific Papers Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 sonal@cs.stanford.edu Christopher D. Manning Department of

More information

An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System

An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System Thiruumeni P G, Anand Kumar M Computational Engineering & Networking, Amrita Vishwa Vidyapeetham, Coimbatore,

More information

Statistical Machine Translation: IBM Models 1 and 2

Statistical Machine Translation: IBM Models 1 and 2 Statistical Machine Translation: IBM Models 1 and 2 Michael Collins 1 Introduction The next few lectures of the course will be focused on machine translation, and in particular on statistical machine translation

More information

Hybrid Strategies. for better products and shorter time-to-market

Hybrid Strategies. for better products and shorter time-to-market Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,

More information

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy Multi language e Discovery Three Critical Steps for Litigating in a Global Economy 2 3 5 6 7 Introduction e Discovery has become a pressure point in many boardrooms. Companies with international operations

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University sekine@cs.nyu.edu Kapil Dalwani Computer Science Department

More information

QAD Business Intelligence Release Notes

QAD Business Intelligence Release Notes QAD Business Intelligence Release Notes September 2008 These release notes include information about the latest QAD Business Intelligence (QAD BI) fixes and changes. These changes may affect the way you

More information

Open Domain Information Extraction. Günter Neumann, DFKI, 2012

Open Domain Information Extraction. Günter Neumann, DFKI, 2012 Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for

More information

English Grammar Checker

English Grammar Checker International l Journal of Computer Sciences and Engineering Open Access Review Paper Volume-4, Issue-3 E-ISSN: 2347-2693 English Grammar Checker Pratik Ghosalkar 1*, Sarvesh Malagi 2, Vatsal Nagda 3,

More information

A web-based multilingual help desk

A web-based multilingual help desk LTC-Communicator: A web-based multilingual help desk Nigel Goffe The Language Technology Centre Ltd Kingston upon Thames Abstract Software vendors operating in international markets face two problems:

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

Ten steps to better requirements management.

Ten steps to better requirements management. White paper June 2009 Ten steps to better requirements management. Dominic Tavassoli, IBM Actionable enterprise architecture management Page 2 Contents 2 Introduction 2 Defining a good requirement 3 Ten

More information

THUTR: A Translation Retrieval System

THUTR: A Translation Retrieval System THUTR: A Translation Retrieval System Chunyang Liu, Qi Liu, Yang Liu, and Maosong Sun Department of Computer Science and Technology State Key Lab on Intelligent Technology and Systems National Lab for

More information

Keywords academic writing phraseology dissertations online support international students

Keywords academic writing phraseology dissertations online support international students Phrasebank: a University-wide Online Writing Resource John Morley, Director of Academic Support Programmes, School of Languages, Linguistics and Cultures, The University of Manchester Summary A salient

More information

Report Writing: Editing the Writing in the Final Draft

Report Writing: Editing the Writing in the Final Draft Report Writing: Editing the Writing in the Final Draft 1. Organisation within each section of the report Check that you have used signposting to tell the reader how your text is structured At the beginning

More information

BILINGUAL TRANSLATION SYSTEM

BILINGUAL TRANSLATION SYSTEM BILINGUAL TRANSLATION SYSTEM (FOR ENGLISH AND TAMIL) Dr. S. Saraswathi Associate Professor M. Anusiya P. Kanivadhana S. Sathiya Abstract--- The project aims in developing Bilingual Translation System for

More information

Data Discovery on the Information Highway

Data Discovery on the Information Highway Data Discovery on the Information Highway Susan Gauch Introduction Information overload on the Web Many possible search engines Need intelligent help to select best information sources customize results

More information

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and

More information

D2.4: Two trained semantic decoders for the Appointment Scheduling task

D2.4: Two trained semantic decoders for the Appointment Scheduling task D2.4: Two trained semantic decoders for the Appointment Scheduling task James Henderson, François Mairesse, Lonneke van der Plas, Paola Merlo Distribution: Public CLASSiC Computational Learning in Adaptive

More information

WHITE PAPER. Machine Translation of Language for Safety Information Sharing Systems

WHITE PAPER. Machine Translation of Language for Safety Information Sharing Systems WHITE PAPER Machine Translation of Language for Safety Information Sharing Systems September 2004 Disclaimers; Non-Endorsement All data and information in this document are provided as is, without any

More information

Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection

Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection Gareth J. F. Jones, Declan Groves, Anna Khasin, Adenike Lam-Adesina, Bart Mellebeek. Andy Way School of Computing,

More information

OPTIMIZING CONTENT FOR TRANSLATION ACROLINX AND VISTATEC

OPTIMIZING CONTENT FOR TRANSLATION ACROLINX AND VISTATEC OPTIMIZING CONTENT FOR TRANSLATION ACROLINX AND VISTATEC We ll look at these questions. Why does translation cost so much? Why is it hard to keep content consistent? Why is it hard for an organization

More information

Empirical Machine Translation and its Evaluation

Empirical Machine Translation and its Evaluation Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Jesús Giménez (Advisor, Lluís Màrquez) Universitat Politècnica de Catalunya May 28, 2010 Empirical Machine Translation Empirical

More information

Introduction. Philipp Koehn. 28 January 2016

Introduction. Philipp Koehn. 28 January 2016 Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)

More information

An Interactive Hypertextual Environment for MT Training

An Interactive Hypertextual Environment for MT Training An Interactive Hypertextual Environment for MT Training Etienne Blanc GETA, CLIPS-IMAG BP 53, F-38041 Grenoble cedex 09 Etienne.Blanc@imag.fr Abstract An interactive hypertextual environment for MT training

More information

Language and Computation

Language and Computation Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University tamas.biro@yale.edu http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters

More information

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper Parsing Technology and its role in Legacy Modernization A Metaware White Paper 1 INTRODUCTION In the two last decades there has been an explosion of interest in software tools that can automate key tasks

More information

Translation Solution for

Translation Solution for Translation Solution for Case Study Contents PROMT Translation Solution for PayPal Case Study 1 Contents 1 Summary 1 Background for Using MT at PayPal 1 PayPal s Initial Requirements for MT Vendor 2 Business

More information

Visionet IT Modernization Empowering Change

Visionet IT Modernization Empowering Change Visionet IT Modernization A Visionet Systems White Paper September 2009 Visionet Systems Inc. 3 Cedar Brook Dr. Cranbury, NJ 08512 Tel: 609 360-0501 Table of Contents 1 Executive Summary... 4 2 Introduction...

More information

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines , 22-24 October, 2014, San Francisco, USA Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines Baosheng Yin, Wei Wang, Ruixue Lu, Yang Yang Abstract With the increasing

More information

Handbook on Test Development: Helpful Tips for Creating Reliable and Valid Classroom Tests. Allan S. Cohen. and. James A. Wollack

Handbook on Test Development: Helpful Tips for Creating Reliable and Valid Classroom Tests. Allan S. Cohen. and. James A. Wollack Handbook on Test Development: Helpful Tips for Creating Reliable and Valid Classroom Tests Allan S. Cohen and James A. Wollack Testing & Evaluation Services University of Wisconsin-Madison 1. Terminology

More information

Multi-Engine Machine Translation by Recursive Sentence Decomposition

Multi-Engine Machine Translation by Recursive Sentence Decomposition Multi-Engine Machine Translation by Recursive Sentence Decomposition Bart Mellebeek Karolina Owczarzak Josef van Genabith Andy Way National Centre for Language Technology School of Computing Dublin City

More information

MOVING MACHINE TRANSLATION SYSTEM TO WEB

MOVING MACHINE TRANSLATION SYSTEM TO WEB MOVING MACHINE TRANSLATION SYSTEM TO WEB Abstract GURPREET SINGH JOSAN Dept of IT, RBIEBT, Mohali. Punjab ZIP140104,India josangurpreet@rediffmail.com The paper presents an overview of an online system

More information

POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition

POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics

More information

Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002. Reading (based on Wixson, 1999)

Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002. Reading (based on Wixson, 1999) Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002 Language Arts Levels of Depth of Knowledge Interpreting and assigning depth-of-knowledge levels to both objectives within

More information

Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages

Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages Hiroshi URAE, Taro TEZUKA, Fuminori KIMURA, and Akira MAEDA Abstract Translating webpages by machine translation is the

More information

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no.

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. 248347 Deliverable D5.4 Report on requirements, implementation

More information

Chapter 8. Final Results on Dutch Senseval-2 Test Data

Chapter 8. Final Results on Dutch Senseval-2 Test Data Chapter 8 Final Results on Dutch Senseval-2 Test Data The general idea of testing is to assess how well a given model works and that can only be done properly on data that has not been seen before. Supervised

More information

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain? What s in a Lexicon What kind of Information should a Lexicon contain? The Lexicon Miriam Butt November 2002 Semantic: information about lexical meaning and relations (thematic roles, selectional restrictions,

More information

Adaptive Automated GUI Testing Producing Test Frameworks to Withstand Change

Adaptive Automated GUI Testing Producing Test Frameworks to Withstand Change Adaptive Automated GUI Testing Producing Test Frameworks to Withstand Change Abstract Although QA and Development managers may see many challenges in creating an automated GUI testing framework, it is

More information

Overview of the TACITUS Project

Overview of the TACITUS Project Overview of the TACITUS Project Jerry R. Hobbs Artificial Intelligence Center SRI International 1 Aims of the Project The specific aim of the TACITUS project is to develop interpretation processes for

More information

Special Topics in Computer Science

Special Topics in Computer Science Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS

More information

Robustness of a Spoken Dialogue Interface for a Personal Assistant

Robustness of a Spoken Dialogue Interface for a Personal Assistant Robustness of a Spoken Dialogue Interface for a Personal Assistant Anna Wong, Anh Nguyen and Wayne Wobcke School of Computer Science and Engineering University of New South Wales Sydney NSW 22, Australia

More information

Building a Question Classifier for a TREC-Style Question Answering System

Building a Question Classifier for a TREC-Style Question Answering System Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given

More information

Simple maths for keywords

Simple maths for keywords Simple maths for keywords Adam Kilgarriff Lexical Computing Ltd adam@lexmasterclass.com Abstract We present a simple method for identifying keywords of one corpus vs. another. There is no one-sizefits-all

More information

Convergence of Translation Memory and Statistical Machine Translation

Convergence of Translation Memory and Statistical Machine Translation Convergence of Translation Memory and Statistical Machine Translation Philipp Koehn and Jean Senellart 4 November 2010 Progress in Translation Automation 1 Translation Memory (TM) translators store past

More information

Hybrid Machine Translation Guided by a Rule Based System

Hybrid Machine Translation Guided by a Rule Based System Hybrid Machine Translation Guided by a Rule Based System Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez Kepa Sarasola Universitat Politècnica de Catalunya University of the

More information

The Reporting Console

The Reporting Console Chapter 1 The Reporting Console This chapter provides a tour of the WebTrends Reporting Console and describes how you can use it to view WebTrends reports. It also provides information about how to customize

More information

Master of Arts in Linguistics Syllabus

Master of Arts in Linguistics Syllabus Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university

More information

SYSTRAN v6 Quick Start Guide

SYSTRAN v6 Quick Start Guide SYSTRAN v6 Quick Start Guide 2 SYSTRAN v6 Quick Start Guide SYSTRAN Business Translator SYSTRAN Premium Translator Congratulations on your SYSTRAN v6 purchase which uses the latest generation of language

More information

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites

More information

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,

More information

Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology

Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Makoto Nakamura, Yasuhiro Ogawa, Katsuhiko Toyama Japan Legal Information Institute, Graduate

More information

TS3: an Improved Version of the Bilingual Concordancer TransSearch

TS3: an Improved Version of the Bilingual Concordancer TransSearch TS3: an Improved Version of the Bilingual Concordancer TransSearch Stéphane HUET, Julien BOURDAILLET and Philippe LANGLAIS EAMT 2009 - Barcelona June 14, 2009 Computer assisted translation Preferred by

More information

The Role of Sentence Structure in Recognizing Textual Entailment

The Role of Sentence Structure in Recognizing Textual Entailment Blake,C. (In Press) The Role of Sentence Structure in Recognizing Textual Entailment. ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, Prague, Czech Republic. The Role of Sentence Structure

More information

Technology and Writing Assessment: Lessons Learned from the US National Assessment of Educational Progress 1

Technology and Writing Assessment: Lessons Learned from the US National Assessment of Educational Progress 1 Technology and Writing Assessment: Lessons Learned from the US National Assessment of Educational Progress 1 Randy Elliot Bennett Educational Testing Service Princeton, NJ 08541 rbennett@ets.org 1 This

More information

CALICO Journal, Volume 9 Number 1 9

CALICO Journal, Volume 9 Number 1 9 PARSING, ERROR DIAGNOSTICS AND INSTRUCTION IN A FRENCH TUTOR GILLES LABRIE AND L.P.S. SINGH Abstract: This paper describes the strategy used in Miniprof, a program designed to provide "intelligent' instruction

More information

4.1 Multilingual versus bilingual systems

4.1 Multilingual versus bilingual systems This chapter is devoted to some fundamental questions on the basic strategies of MT systems. These concern decisions which designers of MT systems have to address before any construction can start, and

More information

Historical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics?

Historical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics? Historical Linguistics Diachronic Analysis What is Historical Linguistics? Historical linguistics is the study of how languages change over time and of their relationships with other languages. All languages

More information

AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH)

AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH) [From: Translating and the Computer, B.M. Snell (ed.), North-Holland Publishing Company, 1979] AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH) Shiu-Chang LOH and Luan KONG Hung

More information

Classification of Natural Language Interfaces to Databases based on the Architectures

Classification of Natural Language Interfaces to Databases based on the Architectures Volume 1, No. 11, ISSN 2278-1080 The International Journal of Computer Science & Applications (TIJCSA) RESEARCH PAPER Available Online at http://www.journalofcomputerscience.com/ Classification of Natural

More information

Suite. How to Use GrandMaster Suite. Exporting with ODBC

Suite. How to Use GrandMaster Suite. Exporting with ODBC Suite How to Use GrandMaster Suite Exporting with ODBC This page intentionally left blank ODBC Export 3 Table of Contents: HOW TO USE GRANDMASTER SUITE - EXPORTING WITH ODBC...4 OVERVIEW...4 WHAT IS ODBC?...

More information

Statistical Pattern-Based Machine Translation with Statistical French-English Machine Translation

Statistical Pattern-Based Machine Translation with Statistical French-English Machine Translation Statistical Pattern-Based Machine Translation with Statistical French-English Machine Translation Jin'ichi Murakami, Takuya Nishimura, Masato Tokuhisa Tottori University, Japan Problems of Phrase-Based

More information

Chapter 5. Phrase-based models. Statistical Machine Translation

Chapter 5. Phrase-based models. Statistical Machine Translation Chapter 5 Phrase-based models Statistical Machine Translation Motivation Word-Based Models translate words as atomic units Phrase-Based Models translate phrases as atomic units Advantages: many-to-many

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words , pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan

More information

Extracting translation relations for humanreadable dictionaries from bilingual text

Extracting translation relations for humanreadable dictionaries from bilingual text Extracting translation relations for humanreadable dictionaries from bilingual text Overview 1. Company 2. Translate pro 12.1 and AutoLearn 3. Translation workflow 4. Extraction method 5. Extended

More information

The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2

The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of

More information

2-3 Automatic Construction Technology for Parallel Corpora

2-3 Automatic Construction Technology for Parallel Corpora 2-3 Automatic Construction Technology for Parallel Corpora We have aligned Japanese and English news articles and sentences, extracted from the Yomiuri and the Daily Yomiuri newspapers, to make a large

More information

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS

More information

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic by Sigrún Helgadóttir Abstract This paper gives the results of an experiment concerned with training three different taggers on tagged

More information

Petrel TIPS&TRICKS from SCM

Petrel TIPS&TRICKS from SCM Petrel TIPS&TRICKS from SCM Knowledge Worth Sharing Histograms and SGS Modeling Histograms are used daily for interpretation, quality control, and modeling in Petrel. This TIPS&TRICKS document briefly

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,

More information

CONSTRAINING THE GRAMMAR OF APS AND ADVPS. TIBOR LACZKÓ & GYÖRGY RÁKOSI http://ieas.unideb.hu/ rakosi, laczko http://hungram.unideb.

CONSTRAINING THE GRAMMAR OF APS AND ADVPS. TIBOR LACZKÓ & GYÖRGY RÁKOSI http://ieas.unideb.hu/ rakosi, laczko http://hungram.unideb. CONSTRAINING THE GRAMMAR OF APS AND ADVPS IN HUNGARIAN AND ENGLISH: CHALLENGES AND SOLUTIONS IN AN LFG-BASED COMPUTATIONAL PROJECT TIBOR LACZKÓ & GYÖRGY RÁKOSI http://ieas.unideb.hu/ rakosi, laczko http://hungram.unideb.hu

More information

RRSS - Rating Reviews Support System purpose built for movies recommendation

RRSS - Rating Reviews Support System purpose built for movies recommendation RRSS - Rating Reviews Support System purpose built for movies recommendation Grzegorz Dziczkowski 1,2 and Katarzyna Wegrzyn-Wolska 1 1 Ecole Superieur d Ingenieurs en Informatique et Genie des Telecommunicatiom

More information

31 Case Studies: Java Natural Language Tools Available on the Web

31 Case Studies: Java Natural Language Tools Available on the Web 31 Case Studies: Java Natural Language Tools Available on the Web Chapter Objectives Chapter Contents This chapter provides a number of sources for open source and free atural language understanding software

More information