Service model for semi-automatic generation of multilingual terminology resources

Size: px
Start display at page:

Download "Service model for semi-automatic generation of multilingual terminology resources"

Transcription

1 Service model for semi-automatic generation of multilingual terminology resources Mārcis Pinnis, Andrejs Vasiļjevs, Tatiana Gornostay Tilde TKE 2014 / Berlin/

2

3

4 Complexity of terminology work Term identification in the source text Consulting online databases and local files for translation equivalents Creating and maintaining terminology glossaries Sharing term glossaries and involving others in their polishing Structuring data in the industry standard formats Integrating term glossaries in CAT and other productivity tools Keeping terminology up to date etc.

5 SaaS Software as a Service

6 PaaS Platform as a Service

7 TaaS Terminology as a Service

8 Cloud-based service model The automation of individual tasks in terminology work is rendered as a set of interoperable cloud-based services integrated into workflows Collaborative involvement of users in the refinement and enrichment of raw terminological data Automated sharing and integration of the terminology in various use cases by language workers and language processing applications: computer-assisted translation tools machine translation systems terminology management and terminology lookup platforms etc.

9 Terminology as a Service cloud-based platform for acquiring, cleaning up, sharing, and reusing multilingual terminological data

10 TaaS Partners Tilde TAUS Kilgray University of Cologne University of Sheffield Latvia (Coordinator) Netherlands Hungary Germany UK

11 TaaS Industry & Research Collaboration project Supported by EU 7 th R&I Framework Programme Resulted in TaaS cloud-based services Accessible for free at the online portal termunity.com

12 TaaS User Needs Survey Results Professional profile Potential TaaS user groups: What professional groups we address? Technical writers Translators Interpreters Software localizers Domain experts Language teachers/learners Terminologists 7.6% 6.4% 5.1% 4.8% 3.3% 2.8% 1.0% 31.0% 38.0% Technical writer Translator Other Interpreter Terminologist Domain expert Language teacher Software localizer Language learner The prevailing working status of the potential users of TaaS are Employed with an industrial company with 51.1% (mainly technical editors/writers and terminologists) and Freelance with 34.6% (mainly translators and interpreters).

13 TaaS User Needs Survey Results: Importance of terminology work 14.8% 1.8% 43.5% Very important Quite important Less important Not important 39.9%

14 TaaS User Needs Survey Results Addressing the actual needs of potential users: Problems faced when doing online terminology search and optimization potential in this area. Problematic issues % Lack of resources/insufficient terminology management 24.5 Poor quality/up-to-dateness 14.5 Lack of information (contexts, definitions, local and companyspecific use etc.) 13.6 Lack of convincing verification/misleading information online 12.3 Inconsistent use of terminology 10.9 Technical issues 8.9 Lack of covered specific subject fields 6.6 Lack of freely accessible standardized data/ Unwillingness to share information 5.4 Ambiguity/Synonymy 4.5 Lack of covered languages and language pairs 4.1 Lack of equivalents 2.4 Other 11.2

15 TaaS User Needs Survey Results Technical specification of the TaaS platform: Mostly used programs and formats when doing terminology work. Mostly used formats DOC XML TXT RTF TMX CSV XLIFF TBX Other 9.6% 12.2% 17.6% 24.4% 26.8% 33.0% 45.1% 52.4% 84.0%

16 TaaS User Needs Survey: willingness to share 60.5% 39.5% Yes, provided that No, because 16,7% 24,9% 8.3% 4.6% 6,0% 16.5% 48.6% 7,6% 11,4% 19,2% 14,2% Joint contribution to the DB Access control Legal aspects External quality control Little effort Anonymity Other 22.0% Legal restrictions Poor quality/lack of time Own asset Risk of misunderstanding

17 TaaS Goals Simplify the process for language workers to prepare, store and share of task-specific multilingual term glossaries Provide instant access to term translation equivalents and translation candidates for professional translators through CAT tools Domain adaptation of statistical machine translation systems by dynamic integration with TaaS provided terminology data

18 Key services of TaaS Automatic extraction of monolingual term candidates from user uploaded documents Automatic retrieval of translation equivalents from different public and industry terminology databases Translation candidate acquisition from multilingual web data Facilities for cleaning-up by users automatically acquired terminological data; Data sharing and integration facilities through APIs and export tools

19 TaaS Services

20 Simple Search Search for terms in various sources

21 Term candidate identification Tagging the document using part-of-speech or morphosyntactic taggers Term candidate identification using: linguistic filtering with morpho-syntactic term phrase regular expressions statistical ranking methods with co-occurrence measures (e.g., log likelihood, modified mutual information etc.) for terms of two or more tokens and the TF*IDF measure for unigram terms Marking terms in the document

22 Evaluation of service for term candidate extraction Language Information Technology Mechanical Engineering Correct Total Precision Correct Total Precision English % % German % % Hungarian % % Latvian % % Comment: The average agreement rate of the Latvian annotators was only 63.3%.

23 Automatic retrieval of translation equivalents from terminology databases Normalization of terms transforming terms from their surface forms into corresponding canonical forms as used in dictionaries and term banks Lemmatization for single word terms Rule-based methods for multiword terms Automatic search and retrieval from terminology databases: EuroTermBank databank of federated multilingual terminology IATE inter-institutional termbank of European Union TAUS Data repository of multilingual translation memories

24 Service for Translation Equivalent Retrieval from the Web Extracting bilingual terminology from the parallel corpora using bilingual phrase alignment techniques Terminology extraction from focused comparable corpora Focussed Monolingual Crawler FMC (Mastropavlos and Papavassiliou, 2011) Bilingual terminology extraction from comparable RSS news corpora Bilingual terminology extraction from Wikipedia Application of the ACCURAT Toolkit Context independent term mapping - (Pinnis, 2013) Supervised term mapping - Aker et al. (2013)

25 Workflows for multilingual terminology extraction from comparable Web corpora

26 Visualised example of terminological data from the Statistical Data Base

27 Automated Lookup

28

29 Term identification and annotation

30 Identifying and marking terms New W3C standard for Internationalization Tag Set ITS 2.0 ITS 2.0 enriched content Showcase Web Page ITS 2.0 enriched content Terminology Annotation Web Service API Plaintext TaaS Terminology Services Human users (e.g., translators, terminologists) ITS2.0 term-annotated content export / visualisation ITS2.0 term-annotated content ITS 2.0 enriched content Machine users Term-annotated content ITS2.0 term-annotated content CAT Tools MT Systems

31 Term Annotation

32 HTML Term Annotation Term entries for terms identified in EuroTermBank are stored in TBX format in a <script> element that is placed in the HTML5 document.

33 Refine & Approve

34 Share & Collaborate

35 Integration Support for industry standard formats Integration into CAT and productivity tools API to integrate TaaS services into various software applications

36 http/https html TaaS Architecture Web Browsers CAT tools MT External TDBs Presentation Layer https REST https REST https REST Web Page UI Public API included included Application Logic Layer Terminology collection management User management Terminology collection search Terminology collection creation Data Storage Layer (Shared Term Repository) Term extraction workflows Full collection creation workflow Monolingual collection creation Translation candidate extraction Text tagging with terms... Shared Term Repository DB File Store Statistical DB High-performance Computing (HPC) Cluster HPC frontend CPU CPU CPU CPU CPU CPU CPU CPU SGE CPU CPU CPU CPU Modules Term extraction TXT extractor TWSC Kilgray Term Extractor Term normalizer Collection creator Statistical DB acquisition Parameter retriever Bilingual Term Extraction System Statistical DB feeding Translation lookup ETB & STR IATE TAUS API Statistical DB Collection merger Result processing Collection Importer Marked Text enrichment

37 koks timber How to instruct SMT to use the right terms?

38 Put TaaS in the service for MT

39

40 Boost in the quality of machine translation Narrow Domain Automotive MT English Latvian DATA 2 M unique parallel sentences 1.9 M monolingual sentences 0.2 M in-domain monolingual QUALITY 16% improvement from terminology integration

41 Try it yourself! termunity.com

42 Towards a Public EU Infrastructure for Automated Translation Deliver the strategic vision, functional, technical and operational specifications of the infrastructure for EU public service for automated translation and other multilingual services and resources Design a sustainable governance model for the multilingual infrastructure Foster multi-stakeholders alliances to ensure its commitment and support for the deployment of the MLi and its usage

43 Thank you! The research within the project TaaS leading to these results has received funding from the European Union Seventh Framework Programme (FP7/ ), Grant Agreements n and n

Question template for interviews

Question template for interviews Question template for interviews This interview template creates a framework for the interviews. The template should not be considered too restrictive. If an interview reveals information not covered by

More information

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no.

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation www.accurat-project.eu Project no. 248347 Deliverable D5.4 Report on requirements, implementation

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

XTM for Language Service Providers Explained

XTM for Language Service Providers Explained XTM for Language Service Providers Explained 1. Introduction There is a new generation of Computer Assisted Translation (CAT) tools available based on the latest Web 2.0 technology. These systems are more

More information

XTM Cloud Explained. XTM Cloud Explained. Better Translation Technology. Page 1

XTM Cloud Explained. XTM Cloud Explained. Better Translation Technology. Page 1 XTM Cloud Explained Better Translation Technology Page 1 Documentation for XTM Cloud Explained Published by XTM International Ltd. Copyright XTM International Ltd. All rights reserved. No part of this

More information

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter

More information

Translation and Localization Services

Translation and Localization Services Translation and Localization Services Company Overview InterSol, Inc., a California corporation founded in 1996, provides clients with international language solutions. InterSol delivers multilingual solutions

More information

LetsMT!: A Cloud-Based Platform for Do-It-Yourself Machine Translation

LetsMT!: A Cloud-Based Platform for Do-It-Yourself Machine Translation LetsMT!: A Cloud-Based Platform for Do-It-Yourself Machine Translation Andrejs Vasiļjevs Raivis Skadiņš Jörg Tiedemann TILDE TILDE Uppsala University Vienbas gatve 75a, Riga Vienbas gatve 75a, Riga Box

More information

Getting Off to a Good Start: Best Practices for Terminology

Getting Off to a Good Start: Best Practices for Terminology Getting Off to a Good Start: Best Practices for Terminology Technologies for term bases, term extraction and term checks Angelika Zerfass, zerfass@zaac.de Tools in the Terminology Life Cycle Extraction

More information

An Online Service for SUbtitling by MAchine Translation

An Online Service for SUbtitling by MAchine Translation SUMAT CIP-ICT-PSP-270919 An Online Service for SUbtitling by MAchine Translation Annual Public Report 2011 Editor(s): Contributor(s): Reviewer(s): Status-Version: Volha Petukhova, Arantza del Pozo Mirjam

More information

Intel s Localization BUS Initiative To XLIFF or not to XLIFF. Loïc Dufresne de Virel Localization Strategist

Intel s Localization BUS Initiative To XLIFF or not to XLIFF. Loïc Dufresne de Virel Localization Strategist Intel s Localization BUS Initiative To XLIFF or not to XLIFF Loïc Dufresne de Virel Localization Strategist Intel Information Technology DISCLAIMERS I m presenting here what we are doing Still a work in

More information

How the Multilingual Semantic Web can meet the Multilingual Web A Position Paper. Felix Sasaki DFKI / W3C Fellow fsasaki@w3.org

How the Multilingual Semantic Web can meet the Multilingual Web A Position Paper. Felix Sasaki DFKI / W3C Fellow fsasaki@w3.org How the Multilingual Semantic Web can meet the Multilingual Web A Position Paper Felix Sasaki DFKI / W3C Fellow fsasaki@w3.org The success of the Web is not based on technology. It is rather based on the

More information

LISE Legal Language Interoperability Services

LISE Legal Language Interoperability Services LISE Legal Language Interoperability Services Gerhard Budin University of Vienna Austrian Academy of Sciences 27 th of April, 2011 JIAMCATT meeting at ITC ILO - Turin Main objectives of the Project The

More information

Localizing dynamic websites created from open source content management systems

Localizing dynamic websites created from open source content management systems Localizing dynamic websites created from open source content management systems memoqfest 2012, May 10, 2012, Budapest Daniel Zielinski Martin Beuster Loctimize GmbH [daniel martin]@loctimize.com www.loctimize.com

More information

Survey Results: Requirements and Use Cases for Linguistic Linked Data

Survey Results: Requirements and Use Cases for Linguistic Linked Data Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group

More information

Global Solutions. Table of Contents

Global Solutions. Table of Contents Global Solutions LINGOTEK PRODUCT BRIEF Table of Contents Current Lingotek Customers...2 The Challenge of Translation and Localization Today...3 Lingotek s Complete Offering...4 The Lingotek Solution...5

More information

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain Talend Metadata Manager Reduce Risk and Friction in your Information Supply Chain Talend Metadata Manager Talend Metadata Manager provides a comprehensive set of capabilities for all facets of metadata

More information

Introduction to OpenTM2 An Open Source Solution for Translators

Introduction to OpenTM2 An Open Source Solution for Translators Introduction to An Open Source Solution for Translators Gerhard Fetz & Michael Schneider October 25, 2012 Version 1.0 Agenda General Overview Strategy & Vision Objectives & Benefits The Global Architecture

More information

Multilingual Translation Services

Multilingual Translation Services Multilingual Translation Services 25 El. Venizelou Str., Athens GR-17671, GREECE T: +30 210 9571777 F: +30 210 9571787 W: www.paspartu.gr @: info@paspartu.gr Multilingual Translation Services COMPANY PROFILE

More information

Introduction to the Translation Workspace

Introduction to the Translation Workspace Introduction to the Translation Workspace What is the Translation Workspace? The Lionbridge Translation Workspace is a next-generation cloud computing system that combines Live Assets TM and review packages

More information

K@ A collaborative platform for knowledge management

K@ A collaborative platform for knowledge management White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index

More information

Towards Application of User-Tailored Machine Translation in Localization

Towards Application of User-Tailored Machine Translation in Localization Towards Application of User-Tailored Machine Translation in Localization Andrejs Vasiļjevs Tilde SIA andrejs@tilde.com Raivis Skadiņš Tilde SIA raivis.skadins@ tilde.lv Inguna Skadiņa Tilde SIA inguna.skadina@

More information

Scientific and Technical Applications as a Service in the Cloud

Scientific and Technical Applications as a Service in the Cloud Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41

More information

Prof. Dr. Klemens Waldhör Chief Architect

Prof. Dr. Klemens Waldhör Chief Architect Prof. Dr. Klemens Waldhör Chief Architect klemens.waldhoer@heartsome.de opentms; 23.02.2011; Dr. Klemens Waldhör; www.opentms.de 1 The Project Overview Open TMS Goals Architecture Implementation Current

More information

Extracting translation relations for humanreadable dictionaries from bilingual text

Extracting translation relations for humanreadable dictionaries from bilingual text Extracting translation relations for humanreadable dictionaries from bilingual text Overview 1. Company 2. Translate pro 12.1 and AutoLearn 3. Translation workflow 4. Extraction method 5. Extended

More information

PROMT-Adobe Case Study:

PROMT-Adobe Case Study: For Americas: 330 Townsend St., Suite 117, San Francisco, CA 94107 Tel: (415) 913-7586 Fax: (415) 913-7589 promtamericas@promt.com PROMT-Adobe Case Study: For other regions: 16A Dobrolubova av. ( Arena

More information

Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana Network

Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana Network 1/12 Task Force on Enrichment and Evaluation Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana 29/10/2015 Project Name Type of enrichments Tool for manual

More information

Glossary of translation tool types

Glossary of translation tool types Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze

More information

HOPS Project presentation

HOPS Project presentation HOPS Project presentation Enabling an Intelligent Natural Language Based Hub for the Deployment of Advanced Semantically Enriched Multi-channel Mass-scale Online Public Services IST-2002-507967 (HOPS)

More information

Automated Translation Quality Assurance and Quality Control. Andrew Bredenkamp Daniel Grasmick Julia V. Makoushina

Automated Translation Quality Assurance and Quality Control. Andrew Bredenkamp Daniel Grasmick Julia V. Makoushina Automated Translation Quality Assurance and Quality Control Andrew Bredenkamp Daniel Grasmick Julia V. Makoushina Andrew Bredenkamp Introductions (all) CEO acrolinx, Computational Linguist, QA Tool Vendor

More information

Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya

Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya Luis Villarejo*, Sergio Ortiz** and Mireia Ginestí** *Learning Technologies

More information

KantanMT.com. www.kantanmt.com. The world s #1 MT Platform. No Hardware. No Software. No Hassle MT.

KantanMT.com. www.kantanmt.com. The world s #1 MT Platform. No Hardware. No Software. No Hassle MT. KantanMT.com No Hardware. No Software. No Hassle MT. The world s #1 MT Platform Communicate globally, easily! Create customized language solutions in the cloud. www.kantanmt.com What is KantanMT.com? KantanMT

More information

Dutch Parallel Corpus

Dutch Parallel Corpus Dutch Parallel Corpus Lieve Macken lieve.macken@hogent.be LT 3, Language and Translation Technology Team Faculty of Applied Language Studies University College Ghent November 29th 2011 Lieve Macken (LT

More information

Hybrid Strategies. for better products and shorter time-to-market

Hybrid Strategies. for better products and shorter time-to-market Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,

More information

Recent developments in machine translation policy at the European Patent Office

Recent developments in machine translation policy at the European Patent Office Recent developments in machine translation policy at the European Patent Office Dr Georg Artelsmair Director European Co-operation European Patent Office Brussels, 17 November 2010 The European Patent

More information

ICT PSP Call4-2010. Aleksandra Wesolowska. Theme 6: Multilingual Web. infso-e1@ec.europa.eu

ICT PSP Call4-2010. Aleksandra Wesolowska. Theme 6: Multilingual Web. infso-e1@ec.europa.eu ICT PSP Call4-2010 Theme 6: Multilingual Web Aleksandra Wesolowska infso-e1@ec.europa.eu http://cordis.europa.eu/fp7/ict/language-technologies/cip-psp_en.html 2009 Call Call 3 outcome: 8 pilot projects

More information

Machine Translation at the European Commission

Machine Translation at the European Commission Directorate-General for Translation Machine Translation at the European Commission Konferenz 10 Jahre Verbmobil Saarbrücken, 16. November 2010 Andreas Eisele Project Manager Machine Translation, ICT Unit

More information

Terminology Management in the Localization Industry. Results of the LISA Terminology Survey

Terminology Management in the Localization Industry. Results of the LISA Terminology Survey Terminology Management in the Localization Industry Results of the LISA Terminology Survey Kara Warburton, LISA Terminology Advisor October 16, 2001 Terminology Management in the Localization Industry...

More information

How To Make Sense Of Data With Altilia

How To Make Sense Of Data With Altilia HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to

More information

ASTIN Goal and the Eastin System

ASTIN Goal and the Eastin System EASTIN goal Give people a tool to search for assistive technology products and related information in the European context. Rehadat (Germany) HMI (Denmark) DLF (United Kingdom) SIVA (Italy) Vlibank (Belgium)

More information

Challenges of Automation in Translation Quality Management

Challenges of Automation in Translation Quality Management Challenges of Automation in Translation Quality Management Berlin, 12.09.2009 Dr. François Massion D.O.G. Dokumentation ohne Grenzen GmbH francois.massion@dog-gmbh.de Overview Quality definition and quality

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Ling-Xiang Tang 1, Andrew Trotman 2, Shlomo Geva 1, Yue Xu 1 1Faculty of Science and Technology, Queensland University

More information

CONDIS. IT Service Management and CMDB

CONDIS. IT Service Management and CMDB CONDIS IT Service and CMDB 2/17 Table of contents 1. Executive Summary... 3 2. ITIL Overview... 4 2.1 How CONDIS supports ITIL processes... 5 2.1.1 Incident... 5 2.1.2 Problem... 5 2.1.3 Configuration...

More information

Bootstrapping Term Extractors for Multiple Languages

Bootstrapping Term Extractors for Multiple Languages Bootstrapping Term Extractors for Multiple Languages Ahmet Aker, Monica Lestari Paramita, Emma Barker, Robert Gaizauskas Department of Computer Science, University of Sheffield Sheffield, S1 4DP, United

More information

Vesna Lušicky & Tanja Wissik. Translating and the Computer Conference

Vesna Lušicky & Tanja Wissik. Translating and the Computer Conference Vesna Lušicky & Tanja Wissik University of Vienna Translating and the Computer Conference London, 29 & 30 November 2012 LISE project Workflow research Highlights needs analysis LISE Service Version Two

More information

Semantic annotation of requirements for automatic UML class diagram generation

Semantic annotation of requirements for automatic UML class diagram generation www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute

More information

Direct-to-Company Feedback Implementations

Direct-to-Company Feedback Implementations SEM Experience Analytics Direct-to-Company Feedback Implementations SEM Experience Analytics Listening System for Direct-to-Company Feedback Implementations SEM Experience Analytics delivers real sentiment,

More information

Cloudwords Drupal Module. Quick Start Guide

Cloudwords Drupal Module. Quick Start Guide Cloudwords Drupal Module Quick Start Guide 1 Contents INTRO... 3 HOW IT WORKS... 3 BEFORE YOU INSTALL... 4 In Cloudwords... 4 In Drupal... 4 INSTALLING THE CLOUDWORDS DRUPAL MODULE... 5 OPTION ONE: Install

More information

D3.6: REVISED L3DATA FEDERATION PLATFORM RELEASE

D3.6: REVISED L3DATA FEDERATION PLATFORM RELEASE D3.6: REVISED L3DATA FEDERATION PLATFORM RELEASE David Lewis, Alfredo Maldonado, Brian Walshe, Kaniz Fatema, Anton Gerdelan, Arturo Calvo Distribution: Public Report Federated Active Linguistic data CuratiON

More information

JISC. Technical Review of Using Cloud for Research. Guidance Notes to Cloud Infrastructure Service Providers. Introduction

JISC. Technical Review of Using Cloud for Research. Guidance Notes to Cloud Infrastructure Service Providers. Introduction JISC Technical Review of Using Cloud for Research Guidance Notes to Cloud Infrastructure Service Providers May, 2010 Introduction Provisioning and maintenance of research computing facilities is a core

More information

D3.3.1: Sematic tagging and open data publication tools

D3.3.1: Sematic tagging and open data publication tools COMPETITIVINESS AND INNOVATION FRAMEWORK PROGRAMME CIP-ICT-PSP-2013-7 Pilot Type B WP3 Service platform integration and deployment in cloud infrastructure D3.3.1: Sematic tagging and open data publication

More information

How To Help The European Single Market With Data And Information Technology

How To Help The European Single Market With Data And Information Technology Connecting Europe for New Horizon European activities in the area of Big Data Márta Nagy-Rothengass DG CONNECT, Head of Unit "Data Value Chain" META-Forum 2013, 19 September 2013, Berlin OUTLINE 1. Data

More information

Acquia Introduction December 9th, 2009

Acquia Introduction December 9th, 2009 Acquia Introduction December 9 th, 2009 Agenda 1. Content Management 2. Web Application Framework 3. Architecture principles 1. Modular 2. Event driven 3. Skinnable 4. Secure 5. Accessible 4. Enterprise

More information

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC)

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC) Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger European Commission Joint Research Centre (JRC) https://ec.europa.eu/jrc/en/research-topic/internet-surveillance-systems

More information

aloe-project.de White Paper ALOE White Paper - Martin Memmel

aloe-project.de White Paper ALOE White Paper - Martin Memmel aloe-project.de White Paper Contact: Dr. Martin Memmel German Research Center for Artificial Intelligence DFKI GmbH Trippstadter Straße 122 67663 Kaiserslautern fon fax mail web +49-631-20575-1210 +49-631-20575-1030

More information

Elixir Business Analytics Platform and Data API Server for Harnessing Data for Value Creation CFC Presented by:

Elixir Business Analytics Platform and Data API Server for Harnessing Data for Value Creation CFC Presented by: Elixir Business Analytics Platform and Data API Server for Harnessing Data for Value Creation CFC Presented by: Lau Shih Hor Chief Executive Officer Elixir Technology About Elixir Technology Company Founded

More information

GEOFLUENT TRANSLATION MANAGEMENT SYSTEM

GEOFLUENT TRANSLATION MANAGEMENT SYSTEM DATA SHEET GEOFLUENT TRANSLATION MANAGEMENT SYSTEM Moving Translation to the Cloud To compete effectively in global markets, enterprises face the daunting challenge of translating large volumes of content

More information

SYSTRAN v6 Quick Start Guide

SYSTRAN v6 Quick Start Guide SYSTRAN v6 Quick Start Guide 2 SYSTRAN v6 Quick Start Guide SYSTRAN Business Translator SYSTRAN Premium Translator Congratulations on your SYSTRAN v6 purchase which uses the latest generation of language

More information

D3.1: SYSTEM TEST SUITE

D3.1: SYSTEM TEST SUITE D3.1: SYSTEM TEST SUITE Leroy Finn, David Lewis, Kevin Koidl Distribution: Public Report Federated Active Linguistic data CuratiON (FALCON) FP7- ICT- 2013- SME- DCA Project no: 610879 1 Document Information

More information

A web-based multilingual help desk

A web-based multilingual help desk LTC-Communicator: A web-based multilingual help desk Nigel Goffe The Language Technology Centre Ltd Kingston upon Thames Abstract Software vendors operating in international markets face two problems:

More information

TRANSLATIONS FOR A WORKING WORLD. 2. Translate files in their source format. 1. Localize thoroughly

TRANSLATIONS FOR A WORKING WORLD. 2. Translate files in their source format. 1. Localize thoroughly 4 Website Localization Tip Sheet 1. Localize thoroughly For your international audience, the content you publish should be in keeping with both your global marketing strategy and your local campaigns.

More information

Translation Management System

Translation Management System Translation Management System LINGOTEK PRODUCT BRIEF Table of Contents Current Lingotek Customers...2 The Challenge of Translation and Localization Today...3 Lingotek s Complete Offering...4 The Lingotek

More information

THE CCLRC DATA PORTAL

THE CCLRC DATA PORTAL THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims

More information

Document on information points

Document on information points ESCO (2015) SEC 052 FINAL Creation Date: 01/06/2015 Last update: 05/06/2015 Document on information points 1. Purpose of the document With this document the ESCO Secretariat updates the ESCO Maintenance

More information

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines , 22-24 October, 2014, San Francisco, USA Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines Baosheng Yin, Wei Wang, Ruixue Lu, Yang Yang Abstract With the increasing

More information

Assigning Terms to Domains by Document Classification

Assigning Terms to Domains by Document Classification Assigning Terms to Domains by Document Classification Robert Gaizauskas, Emma Barker, Monica Lestari Paramita and Ahmet Aker Department of Computer Science, University of Sheffield, United Kingdom {r.gaizauskas,e.barker,m.paramita,a.aker}@sheffield.ac.uk

More information

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com

More information

Middleware- Driven Mobile Applications

Middleware- Driven Mobile Applications Middleware- Driven Mobile Applications A motwin White Paper When Launching New Mobile Services, Middleware Offers the Fastest, Most Flexible Development Path for Sophisticated Apps 1 Executive Summary

More information

TAUS 2015. Membership Program (Executive Overview) write to memberservices@taus.net to request the 35 pages detailed service overview. www.taus.

TAUS 2015. Membership Program (Executive Overview) write to memberservices@taus.net to request the 35 pages detailed service overview. www.taus. TAUS 2015 Membership Program (Executive Overview) write to memberservices@taus.net to request the 35 pages detailed service overview www.taus.net Five Reasons to be a TAUS Member 1. Access the collaborative

More information

CLIR-Based Collaborative Construction of a Multilingual Terminological Dictionary for Cultural Resources

CLIR-Based Collaborative Construction of a Multilingual Terminological Dictionary for Cultural Resources CLIR-Based Collaborative Construction of a Multilingual Terminological Dictionary for Cultural Resources Mohammad DAOUD, Mohammad.Daoud@imag.fr 1 Asanobu KITAMOTO, Kitamoto@nii.ac.jp 2 Christian BOITET,

More information

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2 Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue

More information

Multilingual Term Extraction as a Service from Acrolinx. Ben Gottesman Michael Klemme Acrolinx CHAT2013

Multilingual Term Extraction as a Service from Acrolinx. Ben Gottesman Michael Klemme Acrolinx CHAT2013 Multilingual Term Extraction as a Service from Acrolinx Ben Gottesman Michael Klemme Acrolinx CHAT2013 Definitions term extraction: automatically identifying potential terms in a document (corpus) multilingual

More information

ADVANTAGES AND DISADVANTAGES OF TRANSLATION MEMORY: A COST/BENEFIT ANALYSIS by Lynn E. Webb BA, San Francisco State University, 1992 Submitted in

ADVANTAGES AND DISADVANTAGES OF TRANSLATION MEMORY: A COST/BENEFIT ANALYSIS by Lynn E. Webb BA, San Francisco State University, 1992 Submitted in : A COST/BENEFIT ANALYSIS by Lynn E. Webb BA, San Francisco State University, 1992 Submitted in partial satisfaction of the requirements for the Degree of MASTER OF ARTS in Translation of German Graduate

More information

Gaia Reply TM as a Service. Mobile Living Framework

Gaia Reply TM as a Service. Mobile Living Framework Gaia Reply TM as a Service Mobile Living Framework GAIA Reply TM : Main Features The flexible, scalable and economical solution to develop and deliver multichannel mobile enterprise services Devices management

More information

Language Technologies in Europe: trends and future perspectives

Language Technologies in Europe: trends and future perspectives Language Technologies in Europe: trends and future perspectives European Commission Márta Nagy-Rothengass, DG CONNECT Data Value Chain Unit Berlin, 24 January 2013 META - Creation of language resources

More information

IBM Watson Ecosystem. Getting Started Guide

IBM Watson Ecosystem. Getting Started Guide IBM Watson Ecosystem Getting Started Guide Version 1.1 July 2014 1 Table of Contents: I. Prefix Overview II. Getting Started A. Prerequisite Learning III. Watson Experience Manager A. Assign User Roles

More information

Research meets practice: t-survey 2005

Research meets practice: t-survey 2005 Page 1 of 27 Research meets practice: t-survey 25 An online survey on terminology extraction and terminology management Daniel Zielinski (d.zielinski@mx.uni-saarland.de) Yamile Ramírez Safar (ramirez.yamile@googlemail.com)

More information

A Framework for Data Management for the Online Volunteer Translators' Aid System QRLex

A Framework for Data Management for the Online Volunteer Translators' Aid System QRLex Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. A Framework for Data Management for the Online Volunteer Translators' Aid System QRLex Youcef Bey,

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

Planning a DITA CMS Deployment. Small Business Edition

Planning a DITA CMS Deployment. Small Business Edition Planning a DITA CMS Deployment Small Business Edition WWW.IXIASOFT.COM / DITACMS v. 4.2 / Copyright 2016 IXIASOFT Technologies. All rights reserved. Last revised: June 09, 2016 Table of contents 3 Table

More information

Developing Microsoft SharePoint Server 2013 Advanced Solutions

Developing Microsoft SharePoint Server 2013 Advanced Solutions Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions Course Details Course Outline Module 1: Creating Robust and Efficient Apps for SharePoint In this module, you will review key

More information

Enhancing Document Review Efficiency with OmniX

Enhancing Document Review Efficiency with OmniX Xerox Litigation Services OmniX Platform Review Technical Brief Enhancing Document Review Efficiency with OmniX Xerox Litigation Services delivers a flexible suite of end-to-end technology-driven services,

More information

Cordys Business Operations Platform

Cordys Business Operations Platform SERVICE DEFINITION Cordys Business Operations GCloud IV - PaaS Copyright 2012 Cordys B.V. All rights reserved. Table of Content Cordys Business Operations... 1 Table of Content... 2 Introduction... 4 Document

More information

Property-based testing for Web Services. University of Kent

Property-based testing for Web Services. University of Kent Property-based testing for Web Services Simon Thompson University of Kent Thomas Arts Quviq Introduction EU PROWESS project Aims to improve testing, particularly for web services, through uptake and use

More information

Extracting and Preparing Metadata to Make Video Files Searchable

Extracting and Preparing Metadata to Make Video Files Searchable Extracting and Preparing Metadata to Make Video Files Searchable Meeting the Unique File Format and Delivery Requirements of Content Aggregators and Distributors Table of Contents Executive Overview...

More information

Language Services. Table of Contents

Language Services. Table of Contents Language Services LINGOTEK PRODUCT BRIEF Table of Contents Current Lingotek Customers...2 The Challenge of Translation and Localization Today...3 Lingotek s Complete Offering...4 The Lingotek Solution...5

More information

TERMS OF REFERENCE. Revamping of GSS Website. GSS Information Technology Directorate Application and Database Section

TERMS OF REFERENCE. Revamping of GSS Website. GSS Information Technology Directorate Application and Database Section TERMS OF REFERENCE Revamping of GSS Website GSS Information Technology Directorate Application and Database Section Tel: Accra 0302 682656 Cables: GHANASTATS In case of reply the number and date of this

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

STAR Deutschland GmbH

STAR Deutschland GmbH STAR Deutschland GmbH Your partner for customised technical solutions in translation and information management STAR Group Your single-source partner for corporate product communication STAR Deutschland

More information

PROGRESS Portal Access Whitepaper

PROGRESS Portal Access Whitepaper PROGRESS Portal Access Whitepaper Maciej Bogdanski, Michał Kosiedowski, Cezary Mazurek, Marzena Rabiega, Malgorzata Wolniewicz Poznan Supercomputing and Networking Center April 15, 2004 1 Introduction

More information

READY FOR GLOBAL MARKETING A BRIEF GUIDE TO WEB CONTENT LOCALIZATION

READY FOR GLOBAL MARKETING A BRIEF GUIDE TO WEB CONTENT LOCALIZATION READY FOR GLOBAL MARKETING A BRIEF GUIDE TO WEB CONTENT LOCALIZATION OVERVIEW Over the last 20 years and despite the sometimes challenging economic conditions, living standards have been steadily improving

More information

Search Engine Technology and Digital Libraries: Moving from Theory t...

Search Engine Technology and Digital Libraries: Moving from Theory t... 1 von 5 09.09.2010 15:09 Search Engine Technology and Digital Libraries Moving from Theory to Practice Friedrich Summann Bielefeld University Library, Germany Norbert Lossau

More information

Design and Implementation of an Automatic Semantic Annotation Service

Design and Implementation of an Automatic Semantic Annotation Service Diploma Thesis Alina Kopp Oberseminar str. 1 76131 Karlsruhe Alina.Kopp@iitb.fraunhofer.de 27.02.2007 Saarbrücken Risk and Crisis Management Issues Common terminology Interoperability of data, information

More information

Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions OVERVIEW

Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions OVERVIEW Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions OVERVIEW About this Course This course provides SharePoint developers the information needed to implement SharePoint solutions

More information

Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output

Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output Christian Federmann Language Technology Lab, German Research Center for Artificial Intelligence, Stuhlsatzenhausweg 3, D-66123 Saarbrücken,

More information

SHAREPOINT CI. Spots Business Opportunities, Eliminates Risk COMPETITIVE INTELLIGENCE FOR SHAREPOINT

SHAREPOINT CI. Spots Business Opportunities, Eliminates Risk COMPETITIVE INTELLIGENCE FOR SHAREPOINT RESULTS Receive key intelligence on clients, prospects, competitors and threats Optimize business strategies Act on critical business opportunities you can t afford to miss. Leverage your knowledge resources

More information

HOW CAN I TRANSFORM AN ONLINE SERVICE TO TRULY MULTILINGUAL?

HOW CAN I TRANSFORM AN ONLINE SERVICE TO TRULY MULTILINGUAL? HOW CAN I TRANSFORM AN ONLINE SERVICE TO TRULY MULTILINGUAL? 10 ways to become truly multilingual The Best Practices were defined in the context of www.organic-lingua.eu project based on the experience

More information

Property-based testing for Web Services. The PROWESS consortium

Property-based testing for Web Services. The PROWESS consortium Property-based testing for Web Services The PROWESS consortium EU PROWESS project Aims to improve testing, particularly for web services, through uptake and use of property-based testing (PBT). The tool

More information