Getting Off to a Good Start: Best Practices for Terminology



Similar documents
Minimal Translation Management (M11M) a training for those working with customers who are managing translations as a side job -Introduction-

Localization Framework tekom Herbsttagung 2009

SYSTRAN v6 Quick Start Guide

The Principle of Translation Management Systems

XTM for Language Service Providers Explained

Glossary of translation tool types

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features

Overview of MT techniques. Malek Boualem (FT)

Understanding Interpretation and Translation

Automation of Translation: Past, Presence, and Future Karl Heinz Freigang, Universität des Saarlandes, Saarbrücken

XTM Cloud Explained. XTM Cloud Explained. Better Translation Technology. Page 1

Get the most value from your surveys with text analysis

Using COTS Search Engines and Custom Query Strategies at CLEF

Survey Results: Requirements and Use Cases for Linguistic Linked Data

4. Clause combining 2

Collecting Polish German Parallel Corpora in the Internet

JOB BANK TRANSLATION AUTOMATED TRANSLATION SYSTEM. Table of Contents

ON GETTING THE MOST OUT OF INTERNET RESOURCES TO RAISE TRANSLATION QUALITY OF PROFESSIONAL DOCUMENTATION

SYSTRAN 6 Desktop User Guide. Chapter 1: Overview... 1 SYSTRAN Desktop Products Overview... 2

Lingotek + Sitecore Finally. Networked translation inside Sitecore.

Multilingual Term Extraction as a Service from Acrolinx. Ben Gottesman Michael Klemme Acrolinx CHAT2013

TRANSREAD LIVRABLE 3.1 QUALITY CONTROL IN HUMAN TRANSLATIONS: USE CASES AND SPECIFICATIONS. Projet ANR CORD 01 5

ACCURAT Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation Project no.

Question template for interviews

COSC 3351 Software Design. Architectural Design (II) Edgar Gabriel. Spring Virtual Machine

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia

A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students

Lingotek + Salesforce

MEMSOURCE QUICK GUIDE FOR PROJECT MANAGERS

XLIFF 2.0. David Filip Secretary & Editor OASIS XLIFF TC

Comprendium Translator System Overview

Recent developments in machine translation policy at the European Patent Office

Cross-Language Information Retrieval by Domain Restriction using Web Directory Structure

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection

Virto Cross Site Lookup for Microsoft SharePoint Release User and Installation Guide

Network Driver Update for Large Keys

A Framework for Data Management for the Online Volunteer Translators' Aid System QRLex

Task 1 Long Reading: Emotional Intelligence

Hybrid Strategies. for better products and shorter time-to-market

Designing Your Website with Localization in Mind

Chapter 5. Phrase-based models. Statistical Machine Translation

SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告

Localization Profile 2014

1. Contents What is AGITO Translate? Supported formats Translation memory & termbase Access, login and support...

FOR IMMEDIATE RELEASE

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR

Computer Aided Translation

Lingotek + Drupal Finally. Networked translation inside Drupal.

FROM METAL TO T1: SYSTEMS AND COMPONENTS FOR MACHINE TRANSLATION APPLICATIONS

Advice Document: Bilingual Drafting, Translation and Interpretation

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

REGIONAL CENTRE EUROPE OF THE INTERNATIONAL FEDERATION OF TRANSLATORS

Integra(on of human and machine transla(on. Marcello Federico Fondazione Bruno Kessler MT Marathon, Prague, Sept 2013

Automated Translation Quality Assurance and Quality Control. Andrew Bredenkamp Daniel Grasmick Julia V. Makoushina

MULTIFUNCTIONAL DICTIONARIES

Construction of Thai WordNet Lexical Database from Machine Readable Dictionaries

ELECTRONIC DICTIONARIES AND ESP STUDENTS

Recurring Work Orders - Overview. Purpose. Generating Tasks

IBM SPSS Text Analytics for Surveys

General Selection and Special Selection of Continuing Education Students

SDL Trados Studio 2015 Project Management Quick Start Guide

Translation and Localization Services

Chapter 6: Data Entry

Fully Automatic High Quality Machine Translation of. Restricted Text: A Case Study

CHAPTER 4: BUSINESS ANALYTICS

Lingotek + Oracle Eloqua

California Lutheran University Information Literacy Curriculum Map Graduate Psychology Department

CHINESE SECOND LANGUAGE

ATLAS.ti 5.5: A Qualitative Data Analysis Tool

Essential Contax PRO Mobile Contact Management for Business

Teaching Framework. Framework components

SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统

Translation and Interpretation

Customizing an English-Korean Machine Translation System for Patent Translation *

KantanMT.com. The world s #1 MT Platform. No Hardware. No Software. No Hassle MT.

Localizing dynamic websites created from open source content management systems

A web-based multilingual help desk

Curriculum Path & Training Guidelines

CHINESE SECOND LANGUAGE ADVANCED

Transit NXT. Ergonomic design New functions Process-optimised user interface. STAR Group your single-source partner for information services & tools

The Use of Terminological Knowledge Bases in Software Localisation

ACS Version Send

SYSTRAN Enterprise Server 7 Online Tools User Guide. Chapter 1: Overview... 1 SYSTRAN Enterprise Server 7 Overview... 2

Introduction to the Translation Workspace

This presentation explains how to integrate Microsoft Active Directory to enable LDAP authentication in the IBM InfoSphere Master Data Management

Why SBVR? Donald Chapin. Chair, OMG SBVR Revision Task Force Business Semantics Ltd

Checking Spelling and Grammar

HOW TO CREATE YOUR TRANSLATION GLOSSARY

Master of Arts in Linguistics Syllabus

Extracting translation relations for humanreadable dictionaries from bilingual text

Revisiting Context-based Projection Methods for Term-Translation Spotting in Comparable Corpora

The University of Amsterdam s Question Answering System at QA@CLEF 2007

Westlaw China Online Legal Database Structure

Make full use of WordFinder Pro for Mac!

Statistical Machine Translation

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

Terminology Management in the Localization Industry. Results of the LISA Terminology Survey

Transcription:

Getting Off to a Good Start: Best Practices for Terminology Technologies for term bases, term extraction and term checks Angelika Zerfass, zerfass@zaac.de

Tools in the Terminology Life Cycle Extraction tools for the creation of monolingual or bilingual term lists Setup of terminology databases / term lists Terminology checks during document creation and translation

Extraction Manual extraction Slowest but best way to extract terminology according to your goal Extraction of most important terms, company specific terms, new important terms (Depends on the knowledge of the extractor) Terminology components of editors and translation tools allow the collection of terms and term pairs

Extraction Concordance Creates a list of ALL terms (single words or word combinations) with frequencies and context Large term list / manual selection of terms required monolingual

Extraction Statistical extraction Monolingual or bilingual Suitable for every language / language combination (for example from a translation memory) The larger the collection of extraction material, the better the extracted lists Stop word lists Context sentences

Extraction Linguistic extraction Rules for analysis of source (and target) language Suitable for those languages that the system was programmed for Background dictionaries Monolingual / bilingual

Terminology Databases Database systems and term lists to be used with editors (document creation) and translation tools Online term bases for company wide terminology lookup Contain company and subject matter specific terms and additional information Definitions, notes, product names, customer names, forbidden synonyms

Term Base Entry Category Synonyms Category Text information

Term Retrieval During translation, terms from the text are looked up automatically in the term base / term list The translation(s) for the term are offered to the translator Additional information (for example the status of a term) can be shown

Term Retrieval

Terminology Checks During document creation Consistent use of terminology Use of forbidden terms

Terminology Checks During Translation Use of forbidden terms Use of terms that are not in the term base / term list

Terminology Checks After translation Check of bilingual files translation memories, translation files Check for Terms that do not have the translation found in the term base Terms that use a translation which is marked as forbidden in the term base Terms that have several translations in the term base Terms that are missing a translation in the term base

Term Check Results Wrong translation (building = Gebäude) Several translations Missing translation in term database

Term Check Results - Forbidden term (Monitor) - Wrong term (Auswahlschalter) - No target (menu)

Term Check Results Wrong translation (selector key = Auswahltaste not Auswahlschalter)

Technologies in Terminology The tools are NOT intelligent, but can only extract, retrieve or check according to rules Extraction of terms that are not important Retrieval of translations that do not fit the context Marking something as error that is not really wrong There always needs to be a human to check what those tools do

Questions? Angelika Zerfass, zerfass@zaac.de