3rd Cameleon Workshop, UFRGS Written corpora for human beings. Achille Falaise LIG-GETALP & LIDILEM 1/20

Size: px
Start display at page:

Download "3rd Cameleon Workshop, UFRGS Written corpora for human beings. Achille Falaise LIG-GETALP & LIDILEM 1/20"

Transcription

1 3rd Cameleon Workshop, UFRGS Written corpora for human beings Achille Falaise LIG-GETALP & LIDILEM 1/20

2 Motivations For lexical studies, language learning, translation, etc. Dictionaries and grammars are not enough... we need context! 2/20

3 Motivations For lexical studies, language learning, translation, etc. Dictionaries and grammars are not enough... we need context! Textual context In this picture, you can see a lot of large rocks on the banks of the river." 3/20

4 Motivations For lexical studies, language learning, translation, etc. Dictionaries and grammars are not enough... we need context! Textual context In this picture, you can see a lot of large rocks on the banks of the river." 4/20

5 Motivations For lexical studies, language learning, translation, etc. Dictionaries and grammars are not enough... we need context! Textual context Syntactic context 5/20

6 Motivations For lexical studies, language learning, translation, etc. Dictionaries and grammars are not enough... we need context! Textual context Syntactic context Structural context 6/20

7 Motivations For lexical studies, language learning, translation, etc. Dictionaries and grammars are not enough... we need context! Textual context Syntactic context Structural context Thematic context Domain, topic, document type, etc. 7/20

8 Consequence : lots of information to store We keep all these informations Text The role of apoptotic mimicry in host-parasite interplay: is death the only alternative for altruistic behavior? 8/20

9 Consequence : lots of information to store We keep all these informations Text Lemmas and POS <TXT>The role of apoptotic mimicry in host-parasite interplay: is death the only alternative for altruistic behavior? </TXT> <tokens> <t i="1" l="the" f="the" c="det" p="d"/> <t i="2" l="role" f="role" c="nom?s" p="n"/> <t i="3" l="of" f="of" c="prep" p="o"/> <t i="4" l="apoptotic" f="apoptotic" c="adj" p="a"/> <t i="5" l="mimicry" f="mimicry" c="nom?s" p="n"/> [...] </tokens> 9/20

10 Consequence : lots of information to store We keep all these informations Text Lemmas and POS Syntactic trees <TXT>The role of apoptotic mimicry in host-parasite interplay: is death the only alternative for altruistic behavior? </TXT> <tokens> <t i="1" l="the" f="the" c="det" p="d"/> <t i="2" l="role" f="role" c="nom?s" p="n"/> <t i="3" l="of" f="of" c="prep" p="o"/> <t i="4" l="apoptotic" f="apoptotic" c="adj" p="a"/> <t i="5" l="mimicry" f="mimicry" c="nom?s" p="n"/> [...] </tokens> <dependances> <g r="det" s="2" c="1"/> <g r="prep" s="2" c="3"/> <g r="nomprep" s="3" c="5"/> <g r="nn" s="5" c="4"/> [ ] </dependances> 10/20

11 Consequence : lots of information to store We keep all these informations Text Lemmas and POS Syntactic trees Structural information <head> <SEQ id="f eti.xml "> <TXT>The role of apoptotic mimicry in host-parasite interplay: is death the only alternative for altruistic behavior? </TXT> <tokens> <t i="1" l="the" f="the" c="det" p="d"/> <t i="2" l="role" f="role" c="nom?s" p="n"/> <t i="3" l="of" f="of" c="prep" p="o"/> <t i="4" l="apoptotic" f="apoptotic" c="adj" p="a"/> <t i="5" l="mimicry" f="mimicry" c="nom?s" p="n"/> [...] </tokens> <dependances> <g r="det" s="2" c="1"/> <g r="prep" s="2" c="3"/> <g r="nomprep" s="3" c="5"/> <g r="nn" s="5" c="4"/> [ ] </dependances> </SEQ> </head> 11/20

12 Consequence : lots of information to store We keep all these informations Text Lemmas and POS Syntactic trees Structural information Document metadata (document type, topic, author, etc.) 12/20

13 But what about the users? Linguists, language learners, translators... Not computer scientists Easily disturbed by complex interfaces Do not read manuals 13/20

14 ScienQuest : corpora for linguists and language learners Spiral development model Advices from users Development Test with users Multiple corpora English scientific texts (15M words) English learners reports (1M words) French scientific texts (5M words) French press (200M words) German press (200M words) Spanish press (200M words) English press (200M words) Russian litterature (500k words) 14/20

15 ScienQuest : corpora for linguists and language learners Demo 15/20

16 ScienQuest : results Since June 2010, 9,021 requests (1,474 sessions) juillet 2010 août 2010 octobre 2010 décembre 2010 février 2011 septembre 2010 novembre 2010 janvier 2011 mars 2011 avril 2011 Nombre de requêtes Nombre de sessions Assisted search : 75% (of which 47% with syntax) Semantic search : 23% Advanced search : 2% 16/20

17 AXiMAG : corpora by and for translators Interactive Multilingual Access Gateways (website translation) 2 aspects : Collaborative (crowdsourcing) aligned copora building Multilingual access to websites 17/20

18 AxiMAG : corpora by and for translators Demo 18/20

19 AXiMAG : future Real world (& real users) test and evaluation ex. LICIA Website Prototype operationalisation AXiMAG society 19/20

20 Thanks! Questions, comments, corpora? /20

SWISS FOOTBALL LEAGUE (Änderungen vorbehalten / sous réserve de modification) (Version: 19.06.2015)

SWISS FOOTBALL LEAGUE (Änderungen vorbehalten / sous réserve de modification) (Version: 19.06.2015) (Änderungen vorbehalten / sous réserve de modification) (Version: 19.06.2015) Saison 2015/2016: Juli / juillet 2015 MI/ME 1. CL Q1 DO/JE 2. EL Q1 FR/VE 3. SA/SA 4. SO/DI 5. MO/LU 6. DI/MA 7. CL Q1 MI/ME

More information

Maintenance & reliability management. H. Rozelot - Workshop Maintenance & Reliability SOLEIL- November 9 & 10

Maintenance & reliability management. H. Rozelot - Workshop Maintenance & Reliability SOLEIL- November 9 & 10 Maintenance & reliability management SYNCHROTRON SOLEIL A light source to explore matter H. Rozelot XFEL November 3rd 2010 Located 25km from Paris 357 permanent people Some dates : Building Construction

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

olivier labarrère aka

olivier labarrère aka olivier labarrère aka BUZZ BOMB Discography Press Book Démo K7 - Page 3 Mo Dj s Under a Groove #1 - Page 5 Disturbance in The Area - page 13 Mo Dj s Under a Groove #2 - Page 26 Urban Trip - page 34 Fantastico

More information

ACADEMIC GUIDE SMSc SN 2013-03

ACADEMIC GUIDE SMSc SN 2013-03 ACADEMIC GUIDE SMSc SN 2013-03 www.supinfo.com Copyright SUPINFO. All rights reserved SUPINFO Academic Management Avril 2011 Version 1.0 Academic Book Specialized Master of Science - System & Networks

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,

More information

Processing: current projects and research at the IXA Group

Processing: current projects and research at the IXA Group Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive

More information

http://conference.ifla.org/ifla77 Date submitted: June 1, 2011

http://conference.ifla.org/ifla77 Date submitted: June 1, 2011 http://conference.ifla.org/ifla77 Date submitted: June 1, 2011 Lost in Translation the challenges of multilingualism Martin Flynn Victoria and Albert Museum London, United Kingdom E-mail: m.flynn@vam.ac.uk

More information

Multilingual and Localization Support for Ontologies

Multilingual and Localization Support for Ontologies Multilingual and Localization Support for Ontologies Mauricio Espinoza, Asunción Gómez-Pérez and Elena Montiel-Ponsoda UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain {jespinoza,

More information

03 - Lexical Analysis

03 - Lexical Analysis 03 - Lexical Analysis First, let s see a simplified overview of the compilation process: source code file (sequence of char) Step 2: parsing (syntax analysis) arse Tree Step 1: scanning (lexical analysis)

More information

An Overview of Applied Linguistics

An Overview of Applied Linguistics An Overview of Applied Linguistics Edited by: Norbert Schmitt Abeer Alharbi What is Linguistics? It is a scientific study of a language It s goal is To describe the varieties of languages and explain the

More information

REQUEST FOR PROPOSAL. Translation of GLEIF website into various languages

REQUEST FOR PROPOSAL. Translation of GLEIF website into various languages REQUEST FOR PROPOSAL Translation of GLEIF website into various languages 1 Background The purpose of the Global Legal Entity Identifier Foundation (GLEIF) is to serve the broad public interest as the operational

More information

Un autre Sahel est possible! AMESD ECOWAS THEMA. Water resources Management for agriculture and livestock ISSA GARBA AGRHYMET

Un autre Sahel est possible! AMESD ECOWAS THEMA. Water resources Management for agriculture and livestock ISSA GARBA AGRHYMET AMESD ECOWAS THEMA Water resources Management for agriculture and livestock ISSA GARBA AGRHYMET Outline I AGRHYMET /CILSS II AMESD Project/ECOWAS Thema Technical partners intervention axes monitoring vegetation

More information

Trameur: A Framework for Annotated Text Corpora Exploration

Trameur: A Framework for Annotated Text Corpora Exploration Trameur: A Framework for Annotated Text Corpora Exploration Serge Fleury (Sorbonne Nouvelle Paris 3) serge.fleury@univ-paris3.fr Maria Zimina(Paris Diderot Sorbonne Paris Cité) maria.zimina@eila.univ-paris-diderot.fr

More information

Towards a Data Model for the Universal Corpus

Towards a Data Model for the Universal Corpus Towards a Data Model for the Universal Corpus Steven Abney University of Michigan abney@umichedu Steven Bird University of Melbourne and University of Pennsylvania sbird@unimelbeduau Abstract We describe

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

Survey Results: Requirements and Use Cases for Linguistic Linked Data

Survey Results: Requirements and Use Cases for Linguistic Linked Data Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group

More information

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that

More information

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites

More information

Modern foreign languages

Modern foreign languages Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007

More information

LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier. Université Paris-Est Marne-la-Vallée

LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier. Université Paris-Est Marne-la-Vallée LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier Université Paris-Est Marne-la-Vallée paumier@univ-mlv.fr Penguin from http://tux.crystalxp.net/ 1 Linguistic data

More information

Comprendium Translator System Overview

Comprendium Translator System Overview Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4

More information

Automatic identification of construction candidates for a Swedish constructicon

Automatic identification of construction candidates for a Swedish constructicon Automatic identification of construction candidates for a Swedish constructicon Linnea Bäckström, Lars Borin, Markus Forsberg, Benjamin Lyngfelt, Julia Prentice, and Emma Sköldberg Språkbanken University

More information

Empirical Machine Translation and its Evaluation

Empirical Machine Translation and its Evaluation Empirical Machine Translation and its Evaluation EAMT Best Thesis Award 2008 Jesús Giménez (Advisor, Lluís Màrquez) Universitat Politècnica de Catalunya May 28, 2010 Empirical Machine Translation Empirical

More information

Competence-based approach to the formation of information competence of future translators

Competence-based approach to the formation of information competence of future translators Svitlana AMELINA, Rostyslav TARASENKO National University of Life and Environmental Sciences of Ukraine, Kyiv, Ukraine Competence-based approach to the formation of information competence of future translators

More information

LANGUAGE LEARNING CENTRES

LANGUAGE LEARNING CENTRES LANGUAGE LEARNING CENTRES LINGUA CENTRE http://lingua.ubbcluj.ro/ HELLO! Since 1999, the Lingua Centre has been functioning within Babes-Bolyai University, offering linguistic services at European standards.

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

The role of linguistic services multinationals in quality production.

The role of linguistic services multinationals in quality production. Lecture for HONIM translation students www.honim.be Brussels 6 May 2002 The role of linguistic services multinationals in quality production. The relocalization of language production: multilinguism as

More information

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain? What s in a Lexicon What kind of Information should a Lexicon contain? The Lexicon Miriam Butt November 2002 Semantic: information about lexical meaning and relations (thematic roles, selectional restrictions,

More information

Learning Translation Rules from Bilingual English Filipino Corpus

Learning Translation Rules from Bilingual English Filipino Corpus Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,

More information

Database Design For Corpus Storage: The ET10-63 Data Model

Database Design For Corpus Storage: The ET10-63 Data Model January 1993 Database Design For Corpus Storage: The ET10-63 Data Model Tony McEnery & Béatrice Daille I. General Presentation Within the ET10-63 project, a French-English bilingual corpus of about 2 million

More information

Translation Solution for

Translation Solution for Translation Solution for Case Study Contents PROMT Translation Solution for PayPal Case Study 1 Contents 1 Summary 1 Background for Using MT at PayPal 1 PayPal s Initial Requirements for MT Vendor 2 Business

More information

Introduction. Philipp Koehn. 28 January 2016

Introduction. Philipp Koehn. 28 January 2016 Introduction Philipp Koehn 28 January 2016 Administrativa 1 Class web site: http://www.mt-class.org/jhu/ Tuesdays and Thursdays, 1:30-2:45, Hodson 313 Instructor: Philipp Koehn (with help from Matt Post)

More information

ON GETTING THE MOST OUT OF INTERNET RESOURCES TO RAISE TRANSLATION QUALITY OF PROFESSIONAL DOCUMENTATION

ON GETTING THE MOST OUT OF INTERNET RESOURCES TO RAISE TRANSLATION QUALITY OF PROFESSIONAL DOCUMENTATION General and Professional Education 3/2013 pp. 21-27 ISSN 2084-1469 ON GETTING THE MOST OUT OF INTERNET RESOURCES TO RAISE TRANSLATION QUALITY OF PROFESSIONAL DOCUMENTATION Svetlana Sheremetyeva Department

More information

Free Online Translators:

Free Online Translators: Free Online Translators: A Comparative Assessment of worldlingo.com, freetranslation.com and translate.google.com Introduction / Structure of paper Design of experiment: choice of ST, SLs, translation

More information

Machine Translation and the Translator

Machine Translation and the Translator Machine Translation and the Translator Philipp Koehn 8 April 2015 About me 1 Professor at Johns Hopkins University (US), University of Edinburgh (Scotland) Author of textbook on statistical machine translation

More information

M LTO Multilingual On-Line Translation

M LTO Multilingual On-Line Translation O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in

More information

Dutch Parallel Corpus

Dutch Parallel Corpus Dutch Parallel Corpus Lieve Macken lieve.macken@hogent.be LT 3, Language and Translation Technology Team Faculty of Applied Language Studies University College Ghent November 29th 2011 Lieve Macken (LT

More information

MULTIFUNCTIONAL DICTIONARIES

MULTIFUNCTIONAL DICTIONARIES In: A. Zampolli, A. Capelli (eds., 1984): The possibilities and limits of the computer in producing and publishing dictionaries. Linguistica Computationale III, Pisa: Giardini, 279-288 MULTIFUNCTIONAL

More information

Hybrid Strategies. for better products and shorter time-to-market

Hybrid Strategies. for better products and shorter time-to-market Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,

More information

Highland Council (GB)

Highland Council (GB) Highland Council (GB) Highland Council (GB) Highland Council Area Covers 26, 500 sq km 1/3 rd Scotland s landmass 10 x larger than Luxembourg 8 persons per sq km (Scotland = 22) Largest Local Authority

More information

Glossary of translation tool types

Glossary of translation tool types Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze

More information

Considerations for developing VoiceXML in Canadian French

Considerations for developing VoiceXML in Canadian French Considerations for developing VoiceXML in Canadian French This section contains information that is specific to Canadian French. If you are developing Canadian French voice applications, use the information

More information

Erasmus+ Online Linguistic Support. Make the most of your Erasmus+ experience!

Erasmus+ Online Linguistic Support. Make the most of your Erasmus+ experience! Erasmus+ Online Linguistic Support Make the most of your Erasmus+ experience! Erasmus+: changing lives, opening minds Erasmus+ is aimed at boosting skills and employability, as well as modernising Education,

More information

SPANISH UNDERGRADUATE COURSE DESCRIPTIONS

SPANISH UNDERGRADUATE COURSE DESCRIPTIONS SPANISH UNDERGRADUATE COURSE DESCRIPTIONS SPAN 111 Elementary Spanish (3) Language laboratory required. Credit Restriction: Not available to students eligible for 150. Comment(s): For students who have

More information

Thermal performance analysis of two DualSun installa6ons. DualSun - 2015 Thermal performance study of DualSun installa=ons by Transénergie

Thermal performance analysis of two DualSun installa6ons. DualSun - 2015 Thermal performance study of DualSun installa=ons by Transénergie Thermal performance analysis of two DualSun installa6ons DualSun - 2015 Thermal performance study of DualSun installa=ons by Transénergie 1 Objec6ve of the study Project defini6on: Study realized over

More information

TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments

TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments Grzegorz Dziczkowski, Katarzyna Wegrzyn-Wolska Ecole Superieur d Ingenieurs

More information

FOREIGN LANGUAGE, BACHELOR OF ARTS (B.A.) WITH A CONCENTRATION IN SPANISH

FOREIGN LANGUAGE, BACHELOR OF ARTS (B.A.) WITH A CONCENTRATION IN SPANISH VCU 1 FOREIGN LANGUAGE, BACHELOR OF ARTS (B.A.) WITH A CONCENTRATION IN SPANISH Mar Góngora Associate professor and program coordinator, Spanish worldstudies.vcu.edu/spanish (http://worldstudies.vcu.edu/spanish)

More information

7 January 2014. Paris-Bastille.

7 January 2014. Paris-Bastille. Dates des Bureau du CoEE pour le premier semestre 2104. All meetings of Bureau CoEE for the first semester 2014. 7 janvier : Paris 11 Février : Paris 11 Mars. : Paris 8 Avril : Paris 13 Mai : Paris 17

More information

WebLicht: Web-based LRT services for German

WebLicht: Web-based LRT services for German WebLicht: Web-based LRT services for German Erhard Hinrichs, Marie Hinrichs, Thomas Zastrow Seminar für Sprachwissenschaft, University of Tübingen firstname.lastname@uni-tuebingen.de Abstract This software

More information

The Knowledge Sharing Infrastructure KSI. Steven Krauwer

The Knowledge Sharing Infrastructure KSI. Steven Krauwer The Knowledge Sharing Infrastructure KSI Steven Krauwer 1 Why a KSI? Building or using a complex installation requires specialized skills and expertise. CLARIN is no exception. CLARIN is populated with

More information

Presented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software

Presented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software Semantic Research using Natural Language Processing at Scale; A continued look behind the scenes of Semantic Insights Research Assistant and Research Librarian Presented to The Federal Big Data Working

More information

BITS: A Method for Bilingual Text Search over the Web

BITS: A Method for Bilingual Text Search over the Web BITS: A Method for Bilingual Text Search over the Web Xiaoyi Ma, Mark Y. Liberman Linguistic Data Consortium 3615 Market St. Suite 200 Philadelphia, PA 19104, USA {xma,myl}@ldc.upenn.edu Abstract Parallel

More information

COCOVILA Compiler-Compiler for Visual Languages

COCOVILA Compiler-Compiler for Visual Languages LDTA 2005 Preliminary Version COCOVILA Compiler-Compiler for Visual Languages Pavel Grigorenko, Ando Saabas and Enn Tyugu 1 Institute of Cybernetics, Tallinn University of Technology Akadeemia tee 21 12618

More information

The history of machine translation in a nutshell

The history of machine translation in a nutshell The history of machine translation in a nutshell John Hutchins [Web: http://ourworld.compuserve.com/homepages/wjhutchins] [Latest revision: November 2005] 1. Before the computer 2. The pioneers, 1947-1954

More information

Comparing Ontology-based and Corpusbased Domain Annotations in WordNet.

Comparing Ontology-based and Corpusbased Domain Annotations in WordNet. Comparing Ontology-based and Corpusbased Domain Annotations in WordNet. A paper by: Bernardo Magnini Carlo Strapparava Giovanni Pezzulo Alfio Glozzo Presented by: rabee ali alshemali Motive. Domain information

More information

Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1

Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1 Korpus-Abfrage: Werkzeuge und Sprachen Gastreferat zur Vorlesung Korpuslinguistik mit und für Computerlinguistik Charlotte Merz 3. Dezember 2002 Motivation Lizentiatsarbeit: A Corpus Query Tool for Automatically

More information

MASTERS OF SCIENCE ADVANCED ENGLISH PROFESSIONAL STUDIES IN THE FIELD OF ELECTRICAL ENERGETICS AND ENGINEERING

MASTERS OF SCIENCE ADVANCED ENGLISH PROFESSIONAL STUDIES IN THE FIELD OF ELECTRICAL ENERGETICS AND ENGINEERING MASTERS OF SCIENCE ADVANCED ENGLISH PROFESSIONAL STUDIES IN THE FIELD OF ELECTRICAL ENERGETICS AND ENGINEERING Vasilii V. TIUNOV The FSBEI of HPE «The Perm National Research Polytechnic University» (PNRPU)

More information

An Online Service for SUbtitling by MAchine Translation

An Online Service for SUbtitling by MAchine Translation SUMAT CIP-ICT-PSP-270919 An Online Service for SUbtitling by MAchine Translation Annual Public Report 2011 Editor(s): Contributor(s): Reviewer(s): Status-Version: Volha Petukhova, Arantza del Pozo Mirjam

More information

JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM. Table of Contents

JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM. Table of Contents JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM Job Bank for Employers Creating a Job Offer Table of Contents Building the Automated Translation System Integration Steps Automated Translation System

More information

SignLEF: Sign Languages within the European Framework of Reference for Languages

SignLEF: Sign Languages within the European Framework of Reference for Languages SignLEF: Sign Languages within the European Framework of Reference for Languages Simone Greiner-Ogris, Franz Dotter Centre for Sign Language and Deaf Communication, Alpen Adria Universität Klagenfurt (Austria)

More information

Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED

Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED 17 19 June 2013 Monday 17 June Salón de Actos, Facultad de Psicología, UNED 15.00-16.30: Invited talk Eneko Agirre (Euskal Herriko

More information

From concepts to ontologies in metadata: a happy explication of a cognitive scenario for discoverability

From concepts to ontologies in metadata: a happy explication of a cognitive scenario for discoverability Template for the preparation of full texts for the online conference proceedings From concepts to ontologies in metadata: a happy explication of a cognitive scenario for discoverability Raisa Iivonen 1,

More information

Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages

Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages Structural and Semantic Indexing for Supporting Creation of Multilingual Web Pages Hiroshi URAE, Taro TEZUKA, Fuminori KIMURA, and Akira MAEDA Abstract Translating webpages by machine translation is the

More information

DAM-LR at the INL Archive Formation and Local INL. Remco van Veenendaal veenendaal@inl.nl http://imdi.inl.nl 01/03/2007 DAM-LR

DAM-LR at the INL Archive Formation and Local INL. Remco van Veenendaal veenendaal@inl.nl http://imdi.inl.nl 01/03/2007 DAM-LR DAM-LR at the INL Archive Formation and Local INL Remco van Veenendaal veenendaal@inl.nl http://imdi.inl.nl Introducing Remco van Veenendaal Project manager DAM-LR Acting project manager Dutch HLT Agency

More information

Extracting translation relations for humanreadable dictionaries from bilingual text

Extracting translation relations for humanreadable dictionaries from bilingual text Extracting translation relations for humanreadable dictionaries from bilingual text Overview 1. Company 2. Translate pro 12.1 and AutoLearn 3. Translation workflow 4. Extraction method 5. Extended

More information

209 THE STRUCTURE AND USE OF ENGLISH.

209 THE STRUCTURE AND USE OF ENGLISH. 209 THE STRUCTURE AND USE OF ENGLISH. (3) A general survey of the history, structure, and use of the English language. Topics investigated include: the history of the English language; elements of the

More information

Intelligent Natural Language Query Interface for Temporal Databases

Intelligent Natural Language Query Interface for Temporal Databases Intelligent Natural Language Query Interface for Temporal Databases No Author Given No Institute Given Abstract. In this article, in order to enable a novice user to interact with the temporal database

More information

Central and South-East European Resources in META-SHARE

Central and South-East European Resources in META-SHARE Central and South-East European Resources in META-SHARE Tamás VÁRADI 1 Marko TADIĆ 2 (1) RESERCH INSTITUTE FOR LINGUISTICS, MTA, Budapest, Hungary (2) FACULTY OF HUMANITIES AND SOCIAL SCIENCES, ZAGREB

More information

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Query optimization. DBMS Architecture. Query optimizer. Query optimizer.

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Query optimization. DBMS Architecture. Query optimizer. Query optimizer. DBMS Architecture INSTRUCTION OPTIMIZER Database Management Systems MANAGEMENT OF ACCESS METHODS BUFFER MANAGER CONCURRENCY CONTROL RELIABILITY MANAGEMENT Index Files Data Files System Catalog BASE It

More information

TechWatch. Technology and Market Observation powered by SMILA

TechWatch. Technology and Market Observation powered by SMILA TechWatch Technology and Market Observation powered by SMILA PD Dr. Günter Neumann DFKI, Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Juni 2011 Goal - Observation of Innovations and Trends»

More information

The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project

The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Seminar on Dec 19 th Abstracts & speaker information The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Eleni Bozia (USA) Angelos Barmpoutis (USA)

More information

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy Multi language e Discovery Three Critical Steps for Litigating in a Global Economy 2 3 5 6 7 Introduction e Discovery has become a pressure point in many boardrooms. Companies with international operations

More information

Overview of MT techniques. Malek Boualem (FT)

Overview of MT techniques. Malek Boualem (FT) Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,

More information

Linguistics 2288B Introductory General Linguistics 2011-12

Linguistics 2288B Introductory General Linguistics 2011-12 Linguistics 2288B Introductory General Linguistics 2011-12 Class: Instructor: Monday 12:30 p.m 2:30 p.m., Wednesday 12:30 p.m. 1:30 p.m., TH 3154 Ileana Paul UC 136b 519-661-2111 x 85360 ileana@uwo.ca

More information

Annotation in Language Documentation

Annotation in Language Documentation Annotation in Language Documentation Univ. Hamburg Workshop Annotation SEBASTIAN DRUDE 2015-10-29 Topics 1. Language Documentation 2. Data and Annotation (theory) 3. Types and interdependencies of Annotations

More information

The history of machine translation in a nutshell

The history of machine translation in a nutshell 1. Before the computer The history of machine translation in a nutshell 2. The pioneers, 1947-1954 John Hutchins [revised January 2014] It is possible to trace ideas about mechanizing translation processes

More information

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,

More information

English Language and Study Skills (ELSS) History

English Language and Study Skills (ELSS) History Progression requirements for International Foundation Programme students to linked undergraduate degrees within the School of Languages, Linguistics and Film, Queen Mary, University of London There is

More information

Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval

Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information

More information

LabelTranslator - A Tool to Automatically Localize an Ontology

LabelTranslator - A Tool to Automatically Localize an Ontology LabelTranslator - A Tool to Automatically Localize an Ontology Mauricio Espinoza 1, Asunción Gómez Pérez 1, and Eduardo Mena 2 1 UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain

More information

CA4003 - Compiler Construction

CA4003 - Compiler Construction CA4003 - Compiler Construction David Sinclair Overview This module will cover the compilation process, reading and parsing a structured language, storing it in an appropriate data structure, analysing

More information

Teach English Like Never Before. Online Education by

Teach English Like Never Before. Online Education by Teach English Like Never Before Online Education by Learning Philosophy At Savivo we take pride in our educational programs and games, we believe that education is the foundation of the future. We develop

More information

MONTHLY BULLETIN - N 240 May 2010

MONTHLY BULLETIN - N 240 May 2010 MONTHLY BULLETIN - N 24 May 21 DEBT GENERAL DATA... 1 PRIMARY MARKET... 2 SECONDARY MARKET... 4 TRIBUNE.... 6 THE FRENCH ECONOMY... 7 INTERNATIONAL COMPARISONS... 8 FRENCH GOVERNMENT NEGOTIABLE DEBT OUTSTANDING...

More information

How to make Ontologies self-building from Wiki-Texts

How to make Ontologies self-building from Wiki-Texts How to make Ontologies self-building from Wiki-Texts Bastian HAARMANN, Frederike GOTTSMANN, and Ulrich SCHADE Fraunhofer Institute for Communication, Information Processing & Ergonomics Neuenahrer Str.

More information

Text: Rice, Anne-Christine. Cinema for French Conversation. 3rd Edition. Focus Publishing 2009.

Text: Rice, Anne-Christine. Cinema for French Conversation. 3rd Edition. Focus Publishing 2009. Course Syllabus: FRCH 2000 French Cinema and Conversation University of Central Oklahoma Department of Modern Languages, Literatures, and Cultural Studies Class meets MWF 11-11:50 in 234 LA CRN #28608

More information

Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es

Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es KYOTO () Intelligent Content and Semantics Knowledge Yielding Ontologies for Transition-Based Organization http://www.kyoto-project.eu/ Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU

More information

The PALAVRAS parser and its Linguateca applications - a mutually productive relationship

The PALAVRAS parser and its Linguateca applications - a mutually productive relationship The PALAVRAS parser and its Linguateca applications - a mutually productive relationship Eckhard Bick University of Southern Denmark eckhard.bick@mail.dk Outline Flow chart Linguateca Palavras History

More information

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and

More information

Workshop. Neil Barrett PhD, Jens Weber PhD, Vincent Thai MD. Engineering & Health Informa2on Science

Workshop. Neil Barrett PhD, Jens Weber PhD, Vincent Thai MD. Engineering & Health Informa2on Science Engineering & Health Informa2on Science Engineering NLP Solu/ons for Structured Informa/on from Clinical Text: Extrac'ng Sen'nel Events from Pallia've Care Consult Le8ers Canada-China Clean Energy Initiative

More information

Master of Arts in Linguistics Syllabus

Master of Arts in Linguistics Syllabus Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university

More information

997 Functional Acknowledgment

997 Functional Acknowledgment 997 Functional Acknowledgment Version: 1.0 Draft Author: Margie Stewart Publication: 06/10/2013 Notes: Table of Contents 997 Functional Acknowledgment.......................................................................................

More information

English Grammar Checker

English Grammar Checker International l Journal of Computer Sciences and Engineering Open Access Review Paper Volume-4, Issue-3 E-ISSN: 2347-2693 English Grammar Checker Pratik Ghosalkar 1*, Sarvesh Malagi 2, Vatsal Nagda 3,

More information

Français 3 le 6/7 novembre 2013

Français 3 le 6/7 novembre 2013 Français 3 le 6/7 novembre 2013 Classwork : French prepositions with Geographical names (cities, countries and states) 1. PowerPoint on French prepositions with Geographical names (cities, countries and

More information

Organization of DSLE part. Overview of DSLE. Model driven software engineering. Engineering. Tooling. Topics:

Organization of DSLE part. Overview of DSLE. Model driven software engineering. Engineering. Tooling. Topics: Organization of DSLE part Domain Specific Language Engineering Tooling Eclipse plus EMF Xtext, Xtend, Xpand, QVTo and ATL Prof.dr. Mark van den Brand GLT 2010/11 Topics: Meta-modeling Model transformations

More information

CLIR-Based Collaborative Construction of a Multilingual Terminological Dictionary for Cultural Resources

CLIR-Based Collaborative Construction of a Multilingual Terminological Dictionary for Cultural Resources CLIR-Based Collaborative Construction of a Multilingual Terminological Dictionary for Cultural Resources Mohammad DAOUD, Mohammad.Daoud@imag.fr 1 Asanobu KITAMOTO, Kitamoto@nii.ac.jp 2 Christian BOITET,

More information

KHRESMOI. Medical Information Analysis and Retrieval

KHRESMOI. Medical Information Analysis and Retrieval KHRESMOI Medical Information Analysis and Retrieval Integrated Project Budget: EU Contribution: Partners: Duration: 10 Million Euro 8 Million Euro 12 Institutions 9 Countries 4 Years 1 Sep 2010-31 Aug

More information

IP5 MMT Project. June 2012. Korean Intellectual Property Office

IP5 MMT Project. June 2012. Korean Intellectual Property Office IP5 MMT Project June 2012 Korean Intellectual Property Office 1. Background 2. IP5 Efforts to Improve MMT 3. Issues to be Considered 1. Background Power Shift Convertibility of information based on automatic

More information

CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test

CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test CINTIL-PropBank I. Basic Information 1.1. Corpus information The CINTIL-PropBank (Branco et al., 2012) is a set of sentences annotated with their constituency structure and semantic role tags, composed

More information

CONSTRAINING THE GRAMMAR OF APS AND ADVPS. TIBOR LACZKÓ & GYÖRGY RÁKOSI http://ieas.unideb.hu/ rakosi, laczko http://hungram.unideb.

CONSTRAINING THE GRAMMAR OF APS AND ADVPS. TIBOR LACZKÓ & GYÖRGY RÁKOSI http://ieas.unideb.hu/ rakosi, laczko http://hungram.unideb. CONSTRAINING THE GRAMMAR OF APS AND ADVPS IN HUNGARIAN AND ENGLISH: CHALLENGES AND SOLUTIONS IN AN LFG-BASED COMPUTATIONAL PROJECT TIBOR LACZKÓ & GYÖRGY RÁKOSI http://ieas.unideb.hu/ rakosi, laczko http://hungram.unideb.hu

More information