Introduction to Unicode. By: Atif Gulzar Center for Research in Urdu Language Processing

Size: px
Start display at page:

Download "Introduction to Unicode. By: Atif Gulzar Center for Research in Urdu Language Processing"

Transcription

1 Introduction to Unicode By: Atif Gulzar Center for Research in Urdu Language Processing

2 Introduction to Unicode Unicode Why Unicode? What is Unicode? Unicode Architecture

3 Why Unicode?

4 Pre-Unicode Standards and their Limitations ASCII by ANSI in 1964 (7 bit code) ISO adopt ASCII in 1967 as ISO 646 ISO 2022 (8 bit code) ISO 8859 ISO 8859 is a family of 16 Standards Code Page And plenty of standards for East Asian languages

5 ISO 8859 cont. ISO , Latin-1, Western European ISO , Latin-2, Eastern European ISO , Latin-3, Southern European ISO , Latin-4, Northern European ISO , 5, Cyrillic, Russian, Bulgarian.. ISO , Arabic, Arabic ISO , Greek, Greek ISO , 8, Hebrew, Hebrew

6 ISO 8859 ISO , 9, Latin-5, Turkish ISO , Latin-6, Northern European ISO , Thai, Thai ISO , Latin-7, Baltic ISO , Latin-8, Celtic ISO , Latin-9, Western European ISO , Latin-10, Eastern European

7 ASCII Upper case (A-Z) 26 Digits (0-9) 10 Space 1 Punctuation marks (.,+{)%) 32 Lower case (a-z) 26 Control characters (tab, cr, lf) 33 ====================== Total 128

8 ASCII code page

9 ANSI 1252 code page

10 ANSI 1256 code page Arabic

11 ANSI 1252 code page Central Europe Arabic Č ب È 00C8 Hebrew Greek Θ Cyrillic Thai И ศ

12 Code page for Windows Characters not common to other codepages

13 The Code Page Problem cont. Each character set: ASCII (common) + ANSI (Western Europe) Eastern Europe Baltic Greek Cyrillic Thai, Turkish, Arabic, Hebrew, etc. ASCII + extended Extensions for many countries Characters above 128 change meaning

14 The Code Page Problem cont. Characters in most languages are traditionally represented by single-byte values Allows for 256 characters max Real limit for most encodings is 192 characters This includes letters, digits, punctuation, symbols When a system is used for a new language, the encoding has to be adapted to use that language s characters

15 The Code Page Problem Each language or group of languages gets its own encoding Different vendors or standards committees devise different encodings, so generally each language has several, often incompatible, encodings

16 Interoperability Problems Can t t easily mix languages in a document or system Data not tagged with encoding, so loss can occur when transferring between systems Most encodings are ASCII-based, so problems often not seen with English-only data Two possible solutions: Systematic tagging of textual data with encoding ID Universal encoding standard with all languages characters

17 What is Unicode?

18 Unicode or Universal Code One Universal Code for every character no matter what the platform, no matter what the program, no matter what the language. Unicode is not just a bunch of code points Initially it was a 2 byte code, that can support over 65,000 characters Unicode Standard, Version 4.0 provides codes for 96,447 characters Adopted by ISO as ISO 10464

19 Principles of the Unicode Standard Universality Efficiency Characters, not glyphs Semantics Plain text Logical order Unification Convertibility Accurate

20 Universality (Unicode Coverage) European scripts Latin, Greek, Cyrillic, Armenian, Georgian, IPA Bidirectional (Middle Eastern) scripts Hebrew, Arabic, Syriac,, Thaana Indic (Indian and Southeast Asian) scripts Devanagari,, Bengali, Gurmukhi,, Gujarati, Oriya, Tamil, Telugu, Kannada, Malayalam, Sinhala,, Thai, Lao, Khmer, Myanmar, Tibetan, Philippine East Asian scripts Chinese (Han) characters, Japanese (Hiragana and Katakana), Korean (Hangul), Yi Other modern scripts Mongolian, Ethiopic, Cherokee, Canadian Aboriginal Historical scripts Runic, Ogham,, Old Italic, Gothic, Deseret Punctuation and symbols Numerals, math symbols, scientific symbols, arrows, blocks, geometric shapes, Braille, musical notation, etc.

21 Characters and Glyphs fi fi لا ل ا -->

22 Plain Text Its is a plain text Its is a formatted text

23 Logical Data Ordering and Bidi- Algorithm

24 Unification Unicode standard avoid duplicate encoding of characters within Scripts across languages. In Chinese, Japanese and Korean many ideographs are common. The character code U+0057 Y is same in English, German and French

25 Character Semantics cont. The Unicode standard includes an extensive database that specifies a large number of character properties, including: Name Type (e.g., letter, digit, punctuation mark) Decomposition Case and case mappings (for cased letters) Numeric value (for digits and numerals) Combining class (for combining characters) Directionality Line-breaking behavior Cursive joining behavior For Chinese characters, mappings to various other standards and many other properties

26 Character semantics 1781; KHMER LETTER KHA;Lo;0;L;;;;;N;;;;; 17BE KHMER VOWEL SIGN OE;Mc;0;L;;;;;N;;;;; 17E5 KHMER DIGIT FIVE;Nd;0;L;;5;5;5;N;;;;;

27 Unicode Architecture

28 Unicode Architecture Initially Unicode was designed for 16-bit encoding space, consisting of 256 rows of 256 characters each ISO was designed for 32 bit encoding space, thus ISO has room for 2,147,483,648 characters After Unicode came out that 16-bit encoding is too small In Unicode 3.0 the length is increased to 21-bit, allows for 1,114,112 characters

29 Encoding Space Early versions of Unicode used 16 bits Unicode now uses 21 bits Plane number Row number Character number

30 The Unicode Encoding Space 10 F E D C B A Basic Multilingual Plane

31 The Unicode Encoding Space 10 F E D C B A Supplementary Planes

32 The Unicode Encoding Space 10 F E D C B A Private Use Planes Supplementary Special-Purpose Plane Supplementary Ideographic Plane Supplementary Multilingual Plane

33 The Basic Multilingual Plane A B C D E F General Scripts Area Symbols Area CJK Punct. Yi Private Use Area Han Hangul Surrogates Area CJK Punct. Compatibility Area

34 The General Scripts Area 00/01 02/03 04/05 06/07 08/09 0A/0B 0C/0D 0E/0F 10/11 12/13 14/15 16/17 18/19 1A/1B 1C/1D 1E/1F Latin IPA Diacriticals Greek Cyrillic Armenian Hebrew Arabic Syriac Thaana Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala Thai Lao Tibetan Myanmar Georgian Hangul Ethiopic Cherokee Canadian Aboriginal Syllabics Ogh Khmer am Runic Philippine Mongolian Latin Greek

35 Unicode Storage Formats or UTF-32 UTF-16 UTF-8 UTF-7 CESU-8 UTF-EBCDIC BOCU Unicode Encodings

36 Storage formats cont. UTF-32: The 21-bit abstract Unicode value is simply zero-padded to 32 bits:

37 UTF-16: Storage formats For characters in the BMP, the 21-bit value is simply truncated to 16 bits: For other characters, the 21-bit value is turned into a sequence of two 16-bit values called a surrogate pair: A particular numeric value is either a BMP character, a high surrogate, or a low surrogate.

38 UTF-8: Storage formats For ASCII characters, the 21-bit value is truncated to 8 bits: For other characters, the 21-bit value is turned into a sequence of two, three, or four 8-bit values: Different numeric ranges are used for ASCII characters and leading and trailing bytes. Different ranges are used for leading bytes of different-length sequences.

39 Detecting Unicode Storage Format If the files starts with 0xFE 0xFF 0xFF 0xFE 0x00 0x00 0xFE 0xFF 0xFF 0xFE 0x00 0x00 0xEF 0xBB 0x BF 0xDD 0x73 0x73 0x73 0x0E 0xFE 0xFE Any thing else The file contains UTF-16 Byte-swapped UTF-16 UTF-32 Byte-swapped UTF-32 UTF-8 UTF-EBCDIC CESU Non-Unicode or Untagged Unicode

40 The Unicode standard The Unicode standard consists of: The standard text, published in book form (this includes a complete set of printed code charts) The Unicode Character Database, a set of data files providing complete property information on every character Various Web-published supplemental materials: Unicode Standard Annexes (UAX): Amendments to the standard since the last book was published Unicode Technical Standards (UTS): Allied standards maintained separately from Unicode itself Unicode Technical Reports (UTR): Non-normative normative documents providing background info, implementation hints, or other useful information Unicode Technical Notes (UTN): Other articles of

41 References The Unicode Standard Version 4.0 by Unicode Consortium Unicode Demystified by Richard Gillam Unicode Character Database ( Unicode Charts (

42 Questions?

DRH specification framework

DRH specification framework DRH specification framework 2007-03-15 EDM - NIED Takeshi KAWAMOTO, Hiroaki NEGISHI, Mitsuaki SASAKI 1 DRH Basic Development before Sep. 2007 Server architectures Search architectures Multilanguage Architectures

More information

The Unicode Standard Version 8.0 Core Specification

The Unicode Standard Version 8.0 Core Specification The Unicode Standard Version 8.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Inventory of Romanization Tools

Inventory of Romanization Tools Inventory of Romanization Tools Standards Intellectual Management Office Library and Archives Canad Ottawa 2006 Inventory of Romanization Tools page 1 Amharic Ethiopic BGN/PCGN 1967 Arabic Arabic ISO 233:1984.Transliteration

More information

Introduction to Internationalized Domain Names (IDN)

Introduction to Internationalized Domain Names (IDN) Introduction to ized Domain Names (IDN) IP Symposium for CEE, CIS and Baltic States Moscow, Russia 16-19 September 2003 Robert Shaw ITU Internet Strategy and Policy Advisor Agenda

More information

The Unicode Standard Version 8.0 Core Specification

The Unicode Standard Version 8.0 Core Specification The Unicode Standard Version 8.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Red Hat Enterprise Linux International Language Support Guide

Red Hat Enterprise Linux International Language Support Guide Red Hat Enterprise Linux International Language Support Guide Red Hat Enterprise Linux International Language Support Guide Copyright This book is about international language support for Red Hat Enterprise

More information

Multi-lingual Label Printing with Unicode

Multi-lingual Label Printing with Unicode Multi-lingual Label Printing with Unicode White Paper Version 20100716 2009 SATO CORPORATION. All rights reserved. http://www.satoworldwide.com softwaresupport@satogbs.com 2009 SATO Corporation. All rights

More information

PRICE LIST. ALPHA TRANSLATION AGENCY www.biuro-tlumaczen.tv info@biuro-tlumaczen.tv

PRICE LIST. ALPHA TRANSLATION AGENCY www.biuro-tlumaczen.tv info@biuro-tlumaczen.tv We encourage you to get to know the prices of the services provided by Alpha Translation Agency in the range of standard and certified written translations of common and rare languages, as well as interpretation

More information

HKSCS-2004 Support for Windows Platform

HKSCS-2004 Support for Windows Platform HKSCS-2004 Support for Windows Platform Windows XP Font Pack for ISO 10646:2003 + Amendment 1 Traditional Chinese Support (HKSCS-2004) update for Windows XP and Windows Server 2003 June 2010 Version 1.0

More information

Speaking your language...

Speaking your language... 1 About us: Cuttingedge Translation Services Pvt. Ltd. (Cuttingedge) has its corporate headquarters in Noida, India and an office in Glasgow, UK. Over the time we have serviced clients from various backgrounds

More information

Data Integrator. Encoding Reference. Pervasive Software, Inc. 12365-B Riata Trace Parkway Austin, Texas 78727 USA

Data Integrator. Encoding Reference. Pervasive Software, Inc. 12365-B Riata Trace Parkway Austin, Texas 78727 USA Data Integrator Encoding Reference Pervasive Software, Inc. 12365-B Riata Trace Parkway Austin, Texas 78727 USA Telephone: 888.296.5969 or 512.231.6000 Fax: 512.231.6010 Email: info@pervasiveintegration.com

More information

EURESCOM - P923 (Babelweb) PIR.3.1

EURESCOM - P923 (Babelweb) PIR.3.1 Multilingual text processing difficulties Malek Boualem, Jérôme Vinesse CNET, 1. Introduction Users of more and more applications now require multilingual text processing tools, including word processors,

More information

Table 1: TSQM Version 1.4 Available Translations

Table 1: TSQM Version 1.4 Available Translations Quintiles, Inc. 1 Tables 1, 2, & 3 below list the existing and available translations for the TSQM v1.4, TSQM vii, TSQM v9. If Quintiles does not have a translation that your Company needs, the Company

More information

Binary Representation

Binary Representation Binary Representation The basis of all digital data is binary representation. Binary - means two 1, 0 True, False Hot, Cold On, Off We must tbe able to handle more than just values for real world problems

More information

FOREIGN LANGUAGE AND AREA STUDIES (FLAS) FELLOWSHIP For Graduate Students Academic Year 2016 2017

FOREIGN LANGUAGE AND AREA STUDIES (FLAS) FELLOWSHIP For Graduate Students Academic Year 2016 2017 FOREIGN LANGUAGE AND AREA STUDIES (FLAS) FELLOWSHIP For Graduate Students Academic Year 2016 2017 Program: Foreign Language and Area Studies (FLAS) Fellowships provide funding to students to encourage

More information

www.cle.org.pk PROFESSOR AND HEAD DR. SARMAD HUSSAIN Al- Khwarizmi Institute of Computer Sciences University of Engineering and Technology, Lahore

www.cle.org.pk PROFESSOR AND HEAD DR. SARMAD HUSSAIN Al- Khwarizmi Institute of Computer Sciences University of Engineering and Technology, Lahore Internationalized Domain Names (IDNs) www.cle.org.pk DR. SARMAD HUSSAIN PROFESSOR AND HEAD Al- Khwarizmi Institute of Computer Sciences University of Engineering and Technology, Lahore sarmad.hussain@kics.edu.pk

More information

Tel: +971 4 266 3517 Fax: +971 4 268 9615 P.O. Box: 22392, Dubai - UAE info@communicationdubai.com comm123@emirates.net.ae www.communicationdubai.

Tel: +971 4 266 3517 Fax: +971 4 268 9615 P.O. Box: 22392, Dubai - UAE info@communicationdubai.com comm123@emirates.net.ae www.communicationdubai. Tel: +971 4 266 3517 Fax: +971 4 268 9615 P.O. Box: 22392, Dubai - UAE info@communicationdubai.com comm123@emirates.net.ae www.communicationdubai.com ALL ABOUT TRANSLATION Arabic English Online Human Translation

More information

Frequently Asked Questions on character sets and languages in MT and MX free format fields

Frequently Asked Questions on character sets and languages in MT and MX free format fields Frequently Asked Questions on character sets and languages in MT and MX free format fields Version Final 17 January 2008 Preface The Frequently Asked Questions (FAQs) on character sets and languages that

More information

Internationalizing the Domain Name System. Šimon Hochla, Anisa Azis, Fara Nabilla

Internationalizing the Domain Name System. Šimon Hochla, Anisa Azis, Fara Nabilla Internationalizing the Domain Name System Šimon Hochla, Anisa Azis, Fara Nabilla Internationalize Internet Master in Innovation and Research in Informatics problematic of using non-ascii characters ease

More information

Preservation Handbook

Preservation Handbook Preservation Handbook Plain text Author Version 2 Date 17.08.05 Change History Martin Wynne and Stuart Yeates Written by MW 2004. Revised by SY May 2005. Revised by MW August 2005. Page 1 of 7 File: presplaintext_d2.doc

More information

Four ACEs. A Survey of ASCII Compatible Encodings. International Unicode Conference 22 September 2002

Four ACEs. A Survey of ASCII Compatible Encodings. International Unicode Conference 22 September 2002 Four ACEs A Survey of ASCII Compatible Encodings International Unicode Conference 22 September 2002 by Addison P. Phillips Director, Globalization Architecture c TABLE OF CONTENTS INTRODUCTION... 3 WHAT'S

More information

Email Content Control. Admin Guide

Email Content Control. Admin Guide Email Content Control Admin Guide Document Revision Date: May 7, 2013 Email Content Control Admin Guide i Contents Introduction... 1 About Content Control... 1 Configuration Overview for Content Control...

More information

Chapter 4: Computer Codes

Chapter 4: Computer Codes Slide 1/30 Learning Objectives In this chapter you will learn about: Computer data Computer codes: representation of data in binary Most commonly used computer codes Collating sequence 36 Slide 2/30 Data

More information

HP Business Notebook Password Localization Guidelines V1.0

HP Business Notebook Password Localization Guidelines V1.0 HP Business Notebook Password Localization Guidelines V1.0 November 2009 Table of Contents: 1. Introduction..2 2. Supported Platforms...2 3. Overview of Design...3 4. Supported Keyboard Layouts in Preboot

More information

Who We Are. Services We Offer

Who We Are. Services We Offer Who We Are Atkins Translation Services is a professional language agency providing cost effective and rapid language services. Our network of over 70 native language professionals ensures we are able to

More information

Kazuraki : Under The Hood

Kazuraki : Under The Hood Kazuraki : Under The Hood Dr. Ken Lunde Senior Computer Scientist Adobe Systems Incorporated Why Develop Kazuraki? To build excitement and awareness about OpenType Japanese fonts Kazuraki is the first

More information

coral SOFTWARE LOCALISATION LANGUAGE SERVICES WEBSITE TRANSLATION MEDICAL TRANSLATION MULTILINGUAL DTP TRANSCRIPTION VOICEOVER & SUBTITLING

coral SOFTWARE LOCALISATION LANGUAGE SERVICES WEBSITE TRANSLATION MEDICAL TRANSLATION MULTILINGUAL DTP TRANSCRIPTION VOICEOVER & SUBTITLING SOFTWARE LOCALISATION LANGUAGE SERVICES // TRANSCRIPTION MULTILINGUAL DTP MEDICAL TRANSLATION WEBSITE TRANSLATION VOICEOVER & SUBTITLING INTERPRETER SERVICES elearning TRANSLATION about us Coral Knowledge

More information

The future of International SEO. The future of Search Engine Optimization (SEO) for International Business

The future of International SEO. The future of Search Engine Optimization (SEO) for International Business The future of International SEO The future of Search Engine Optimization (SEO) for International Business Whitepaper The World Wide Web is now allowing special characters in URLs which means crawlers now

More information

ASCII Code. Numerous codes were invented, including Émile Baudot's code (known as Baudot

ASCII Code. Numerous codes were invented, including Émile Baudot's code (known as Baudot ASCII Code Data coding Morse code was the first code used for long-distance communication. Samuel F.B. Morse invented it in 1844. This code is made up of dots and dashes (a sort of binary code). It was

More information

Rendering/Layout Engine for Complex script. Pema Geyleg pgeyleg@dit.gov.bt

Rendering/Layout Engine for Complex script. Pema Geyleg pgeyleg@dit.gov.bt Rendering/Layout Engine for Complex script Pema Geyleg pgeyleg@dit.gov.bt Overview What is the Layout Engine/ Rendering? What is complex text? Types of rendering engine? How does it work? How does it support

More information

New International features of Internet Explorer

New International features of Internet Explorer New International features of Internet Explorer Michel Suignard Microsoft Corporation 1 Summary This document presents new implementations of international features by Microsoft Internet Explorer version

More information

Report on Data from the 2004 05 MLA Guide to Doctoral Programs in English and Other Modern Languages

Report on Data from the 2004 05 MLA Guide to Doctoral Programs in English and Other Modern Languages Prepublication Release: The final version of this report will appear in the ADE Bulletin No. 140, Fall 2006. Report on Data from the 2004 05 MLA Guide to Doctoral Programs in and Other Modern Languages

More information

TRIDINDIA IT TRANSLATION SERVICES PRIVATE LIMITED

TRIDINDIA IT TRANSLATION SERVICES PRIVATE LIMITED TRIDINDIA IT TRANSLATION SERVICES PRIVATE LIMITED As we understand your business is mostly about words, we not only translate words, we transform business in the world of words. Established in 2002 with

More information

Internationalized Domain Names -

Internationalized Domain Names - Internationalized Domain Names - Getting them to work Gihan Dias LK Domain Registry What is IDN? Originally DNS names were restricted to the characters a-z (letters), 0-9 (digits) and '-' (hyphen) (LDH)

More information

Developing international webapplications. Frode Eika Sandnes Faculty of Engineering, Oslo University College. internationalisation 18 letters.

Developing international webapplications. Frode Eika Sandnes Faculty of Engineering, Oslo University College. internationalisation 18 letters. Developing international webapplications Frode Eika Sandnes Faculty of Engineering, Oslo University College internationalisation 18 letters i18n 1 Internationalisation vs localisation Internationalisation

More information

I. FOR STUDENTS WHO WANT TO CONTINUE A FOREIGN LANGUAGE:

I. FOR STUDENTS WHO WANT TO CONTINUE A FOREIGN LANGUAGE: R e c o m m e n d e d C o u r s e s f o r T H H S B r i d g e Y e a r S t u d e n t s The following is a list of Fall 2016 Queens College courses which are recommended for Townsend Harris seniors. For

More information

Right-to-Left Language Support in EMu

Right-to-Left Language Support in EMu EMu Documentation Right-to-Left Language Support in EMu Document Version 1.1 EMu Version 4.0 www.kesoftware.com 2010 KE Software. All rights reserved. Contents SECTION 1 Overview 1 SECTION 2 Switching

More information

Survey of University of Michigan Graduate-level Area Studies Alumni/ae & FLAS Recipients from 1996-2006: Selected Findings

Survey of University of Michigan Graduate-level Area Studies Alumni/ae & FLAS Recipients from 1996-2006: Selected Findings Survey of University of Michigan Graduate-level Area Studies Alumni/ae & FLAS Recipients from 1996-2006: Selected Findings Azumi Ann Takata, Center for Japanese Studies, International Institute Donna Parmelee,

More information

Analyzing Unicode Text with Regular Expressions

Analyzing Unicode Text with Regular Expressions Analyzing Unicode Text with Regular Expressions Andy Heninger IBM Corporation heninger@us.ibm.com Abstract For decades now, Regular Expressions have been used in the analysis of text data, for searching

More information

encoding compression encryption

encoding compression encryption encoding compression encryption ASCII utf-8 utf-16 zip mpeg jpeg AES RSA diffie-hellman Expressing characters... ASCII and Unicode, conventions of how characters are expressed in bits. ASCII (7 bits) -

More information

Freescale Embedded GUI Converter Utility 2.0 Quick User Guide

Freescale Embedded GUI Converter Utility 2.0 Quick User Guide Freescale Semiconductor User Guide Document Number: EGUICUG Rev. 1, 08/2010 Freescale Embedded GUI Converter Utility 2.0 Quick User Guide 1 Introduction The Freescale Embedded GUI Converter Utility 2.0

More information

Binary Representation. Number Systems. Base 10, Base 2, Base 16. Positional Notation. Conversion of Any Base to Decimal.

Binary Representation. Number Systems. Base 10, Base 2, Base 16. Positional Notation. Conversion of Any Base to Decimal. Binary Representation The basis of all digital data is binary representation. Binary - means two 1, 0 True, False Hot, Cold On, Off We must be able to handle more than just values for real world problems

More information

WORKING DRAFT. ISO/IEC International Standard International Standard 10646. ISO/IEC 10646 1 st Edition + Amd1

WORKING DRAFT. ISO/IEC International Standard International Standard 10646. ISO/IEC 10646 1 st Edition + Amd1 ISO/IEC JC1/SC2/WG2 N2937 ISO/IEC International Standard International Standard 10646 ISO/IEC 10646 1 st Edition + Amd1 Information technology Universal Multiple-Octet Coded Character Set (UCS) Architecture

More information

Pemrograman Dasar. Basic Elements Of Java

Pemrograman Dasar. Basic Elements Of Java Pemrograman Dasar Basic Elements Of Java Compiling and Running a Java Application 2 Portable Java Application 3 Java Platform Platform: hardware or software environment in which a program runs. Oracle

More information

Internationalization & Localization

Internationalization & Localization Internationalization & Localization Of OpenOffice.org - The Indian Perspective Comprehensive Office Suite for Multilingual Indic Computing Bhupesh Koli, Shikha G Pillai

More information

Unicode in Mobile Phones

Unicode in Mobile Phones Unicode in Mobile Phones Adil Allawi Technical Director adil@diwan.com Unicode in Mobile Phones Complex text in small devices Memory is very limited Processing speed is low yet time is critical Screen

More information

Bachelor of Science Degree Requirements for Students Fulfilling the REVISED General Education Curriculum

Bachelor of Science Degree Requirements for Students Fulfilling the REVISED General Education Curriculum General College Requirements Bachelor of Science Degree Requirements for Students Fulfilling the REVISED General Education Curriculum Note: see the Middle Childhood Education (MCE) curriculum sheet for

More information

Digital Imaging and Communications in Medicine (DICOM) Part 5: Data Structures and Encoding

Digital Imaging and Communications in Medicine (DICOM) Part 5: Data Structures and Encoding Digital Imaging and Communications in Medicine (DICOM) Part 5: Data Structures and Encoding Published by National Electrical Manufacturers Association 1300 N. 17th Street Rosslyn, Virginia 22209 USA Copyright

More information

EMC SourceOne. Products Compatibility Guide 300-008-041 REV 54

EMC SourceOne. Products Compatibility Guide 300-008-041 REV 54 EMC SourceOne Products Compatibility Guide 300-008-041 REV 54 Copyright 2005-2016 EMC Corporation. All rights reserved. Published in the USA. Published February 23, 2016 EMC believes the information in

More information

Translation/interpreting Services in Nottingham

Translation/interpreting Services in Nottingham Translation/interpreting Services in Nottingham (This is not a conclusive list, check telephone directories and the internet for more. Nottingham CDP cannot be help responsible for the quality of work

More information

Multilingual Ediscovery: Options, Obstacles and Opportunities Report

Multilingual Ediscovery: Options, Obstacles and Opportunities Report Multilingual Ediscovery: Options, Obstacles and Opportunities Report A guide to collecting, filtering, reviewing and producing multilingual documents in discovery. An Altegrity Company Copyright 2014 Kroll

More information

.ASIA CJK (Chinese Japanese Korean) IDN Policies

.ASIA CJK (Chinese Japanese Korean) IDN Policies Date: Status: Version: 1.1.ASIA IDN Policies 04-May-2011 COMPLETE Archive URL: References: http://dot.asia/policies/dotasia-cjk-idn-policies-complete--2011-05-04.pdf.asia ZH / JA / KO IDN Language Tables

More information

IDN: Challenges and Opportunities A registry s view of the multilingual web. Rome, March 2013!

IDN: Challenges and Opportunities A registry s view of the multilingual web. Rome, March 2013! IDN: Challenges and Opportunities A registry s view of the multilingual web " Rome, March 2013! Everything is about the end user! 2! Name! Deng Fu Xiang"! Occupation! Freelance photographer" " Age! 35

More information

2011 Census: Language

2011 Census: Language October 25, 2012 2011 Census: Language The 2011 Census Day was May 9, 2011. Today, Statistics Canada released its fourth set of data from this Census on the language characteristics of Canadians. The population

More information

Geolinguistic Regions and Diasporas in the Age of Satellite Television 1

Geolinguistic Regions and Diasporas in the Age of Satellite Television 1 Geolinguistic Regions and Diasporas in the Age of Satellite Television 1 Josu Amezaga Albizu josu.amezaga@ehu.es University of the Basque Country 1 Geolinguistic Regions and Diasporas in the Age of Satellite

More information

Designing Global Applications: Requirements and Challenges

Designing Global Applications: Requirements and Challenges Designing Global Applications: Requirements and Challenges Sourav Mazumder Abstract This paper explores various business drivers for globalization and examines the nature of globalization requirements

More information

San José, February 16, 2001

San José, February 16, 2001 San José, February 16, 2001 Feel free to distribute this text (version 1.4) including the author s e-mail address (mailto:dmeyer@adobe.com) and to contact him for corrections and additions. Please do not

More information

Unraveling Unicode: A Bag of Tricks for Bug Hunting

Unraveling Unicode: A Bag of Tricks for Bug Hunting Unraveling Unicode: A Bag of Tricks for Bug Hunting Black Hat USA July 2009 Chris Weber www.lookout.net chris@casabasecurity.com Casaba Security Can you tell the difference? How about now? The Transformers

More information

INTERNATIONALIZATION FEATURES IN THE MICROSOFT.NET DEVELOPMENT PLATFORM AND WINDOWS 2000/XP

INTERNATIONALIZATION FEATURES IN THE MICROSOFT.NET DEVELOPMENT PLATFORM AND WINDOWS 2000/XP INTERNATIONALIZATION FEATURES IN THE MICROSOFT.NET DEVELOPMENT PLATFORM AND WINDOWS 2000/XP Dr. William A. Newman, Texas A&M International University, wnewman@tamiu.edu Mr. Syed S. Ghaznavi, Texas A&M

More information

National Language (Tamil) Support in Oracle An Oracle White paper / November 2004

National Language (Tamil) Support in Oracle An Oracle White paper / November 2004 National Language (Tamil) Support in Oracle An Oracle White paper / November 2004 Vasundhara V* & Nagarajan M & * vasundhara.venkatasubramanian@oracle.com; & Nagarajan.muthukrishnan@oracle.com) Oracle

More information

Windows NLS Considerations

Windows NLS Considerations Radoslav Rusinov Radoslav.Rusinov.remove._.spam.@gmail.com Contents 1. Introduction... 3 1.1. Windows and Code Pages... 3 1.2. CharacterSet... 3 1.3. Encoding Scheme... 3 1.4. Fonts... 4 1.5. So Why Are

More information

Translution Price List GBP

Translution Price List GBP Translution Price List GBP TABLE OF CONTENTS Services AD HOC MACHINE TRANSLATION... LIGHT POST EDITED TRANSLATION... PROFESSIONAL TRANSLATION... 3 TRANSLATE, EDIT, REVIEW TRANSLATION (TWICE TRANSLATED)...3

More information

LOCALIZATION PROCESS CHECKLIST

LOCALIZATION PROCESS CHECKLIST LOCALIZATION PROCESS CHECKLIST THE TRANSLATION COMPANY LOCALIZATION CHECKLIST This checklist should be completed for all new projects involving localization. A proper planning of the requirements upfront

More information

HAZARD COMMUNICATION TRANSLATION RESOURCES Introduction

HAZARD COMMUNICATION TRANSLATION RESOURCES Introduction HAZARD COMMUNICATION TRANSLATION RESOURCES Introduction Purpose To create a list of translation resources that will aid in the selection of an appropriate firm to meet the user s requirements for the translation

More information

List of Higher School Certificate Board Developed Courses

List of Higher School Certificate Board Developed Courses List of Higher School Certificate Board Developed Courses ACE 6002 Last Updated: 27 February 2013 Subjects Courses Extension Courses Aboriginal Studies Aboriginal Studies Agriculture Agriculture Ancient

More information

Session ID: SPC251 Unicode Interfaces Data Exchange Between Unicode and non-unicode Systems

Session ID: SPC251 Unicode Interfaces Data Exchange Between Unicode and non-unicode Systems Session ID: SPC251 Unicode Interfaces Data Exchange Between Unicode and non-unicode Systems Dr. Christian Hansen, SAP AG Agenda Introduction About Code Pages Communication: The Ideal Picture Communication:

More information

GCE/GCSE subjects recognised for NUI matriculation purposes

GCE/GCSE subjects recognised for NUI matriculation purposes Subjects listed below are recognised for the purpose of NUI matriculation. See NUI Matriculation Regulations pp.11 and 14. Unless otherwise indicated only one subject from each group may be presented.

More information

RECENSEO Quick Reference

RECENSEO Quick Reference Your team has the tools to dramatically speed document review. And those tools are as easy as,, Pronunciation. Re cĕn sēō Origination. From the Latin word, Review. Adjective. Powerful, intuitive, secure,

More information

SuperOffice CRM for Windows. and. Eastern European characters

SuperOffice CRM for Windows. and. Eastern European characters SuperOffice CRM for Windows and Eastern European characters SuperOffice supports different languages such as Norwegian, German, French, Polish, and Czech. If you experience strange characters or unreadable

More information

The Unicode Standard Version 8.0 Core Specification

The Unicode Standard Version 8.0 Core Specification The Unicode Standard Version 8.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

INTERC O MBASE. Global Language Solution WWW.INTERCOMBASE.COM

INTERC O MBASE. Global Language Solution WWW.INTERCOMBASE.COM INTERC O MBASE Global Language Solution Tel.: (UK) +44 20 360 86157 E-mail: info@intercombase.com Skype ID: intercombase.translations WWW.INTERCOMBASE.COM Services Credentials Expertise Document Translation

More information

The Microsoft Layer for Unicode on Windows 95/98/Me Systems

The Microsoft Layer for Unicode on Windows 95/98/Me Systems The Microsoft Layer for Unicode on Windows 95/98/Me Systems Michael Kaplan President and Lead Developer Trigeminal Software, Inc. Cathy Wissink Program Manager, Windows Globalization Microsoft Corporation

More information

Fujiyama Co. Ltd. Company profile

Fujiyama Co. Ltd. Company profile Fujiyama Co. Ltd. Company profile Greeting The world has changed rapidly through the development of the Internet, the symbol of modern technology. The amount of information we have about our world is increasing

More information

A global leader in document translations

A global leader in document translations Since 1993, Northwest Translations has been a global leader in providing exceptional high quality document translations with emphasis in the MEDICAL/LIFE SCIENCES, LEGAL, ENGINEERING, MARKETING/ADVERTISING

More information

The Virtual Tibetan Classroom

The Virtual Tibetan Classroom The Virtual Tibetan Classroom by William Magee, DDBC Thanks to a Generous Grant from the Taiwan National Science Council and the Hopkins MultimediaTibetan Research Archive Project http://haa.ddbc.edu.tw

More information

Points to Note. Chinese and English characters shall be coded in ISO/IEC 10646:2011, and the set of Chinese

Points to Note. Chinese and English characters shall be coded in ISO/IEC 10646:2011, and the set of Chinese General Format, Manner and Procedure for the Submission of Electronic Information under Law by virtue of the Electronic Transactions Ordinance (Chapter 553) Points to Note (This Note aims to set out the

More information

Yandex.Translate API Developer's guide

Yandex.Translate API Developer's guide 5.08.2015 .. Version 1.5 Document build date: 5.08.2015. This volume is a part of Yandex technical documentation. Yandex helpdesk site: http://help.yandex.ru 2008 2015 Yandex LLC. All rights reserved.

More information

Oracle Watchlist Screening

Oracle Watchlist Screening 1 Oracle Watchlist Screening Mike Matthews 3 rd party logo 2 Topics Screening trends & needs Increasing screening data accuracy Reducing false positives Screening international data

More information

Centricity Enterprise Web 3.0 DICOM Conformance Memo DOC0094970

Centricity Enterprise Web 3.0 DICOM Conformance Memo DOC0094970 DOC0094970 CONTENTS 1 Introduction... 3 1.1 Scope and Purpose... 3 1.2 Intended Audience... 3 1.3 Scope and Field of Application... 3 1.4 References... 4 1.5 Definitions... 4 1.6 Symbols and Abbreviations...

More information

GENERAL SERVICES ADMINISTRATION

GENERAL SERVICES ADMINISTRATION GENERAL SERVICES ADMINISTRATION ALLWORLD LANGUAGE CONSULTANTS AUTHORIZED FEDERAL SUPPLY SCHEDULE - 2012 1 SCHEDULE FOR TRANSLATION AND INTERPRETATION SERVICE FEDERAL SUPPLY GROUP 738 CLASS 7382 CONTRACT

More information

User Guide. Printing Unicode characters from SAP to SATO GT4xxe Printers. www.satoworldwide.com. Version 061030-02

User Guide. Printing Unicode characters from SAP to SATO GT4xxe Printers. www.satoworldwide.com. Version 061030-02 Printing Unicode characters from SAP to SATO GT4xxe Printers User Guide Version 061030-02 2006 SATO Corporation. All rights reserved. Table of Contents 1. Introduction... 3 2. Configuration at SAP environment...

More information

PHOTOSTORE 3 SERIES MANUAL TABLE OF CONTENTS

PHOTOSTORE 3 SERIES MANUAL TABLE OF CONTENTS PHOTOSTORE 3 SERIES MANUAL Manual Version 3.9.1 TABLE OF CONTENTS PHOTOSTORE 3 SERIES MANUAL TABLE OF CONTENTS INSTALLATION, SUPPORT, AND UPGRADES SECURITY USING THE STORE MANAGER HOME SETTINGS Backup

More information

Derby Translations. Translation for all languages. Active Knowledge. Team of Experts. Quality Is Our Priority. Competitive Prices.

Derby Translations. Translation for all languages. Active Knowledge. Team of Experts. Quality Is Our Priority. Competitive Prices. Active Knowledge Team of Experts Quality Is Our Priority Competitive Prices 100% Performance Fast Delivery Derby Translations is one of the UAE leading translation and language services providers, has

More information

Japanese Character Printers EPL2 Programming Manual Addendum

Japanese Character Printers EPL2 Programming Manual Addendum Japanese Character Printers EPL2 Programming Manual Addendum This addendum contains information unique to Zebra Technologies Japanese character bar code printers. The Japanese configuration printers support

More information

Product Internationalization of a Document Management System

Product Internationalization of a Document Management System Case Study Product Internationalization of a ì THE CUSTOMER A US-based provider of proprietary Legal s and Archiving solutions, with a customizable document management framework. The customer s DMS was

More information

Languages Supported. SpeechGear s products are being used to remove communications barriers throughout the world.

Languages Supported. SpeechGear s products are being used to remove communications barriers throughout the world. Languages Supported SpeechGear s products are being used to remove communications barriers throughout the world. Each of the following pages lists the languages that we currently support for that product.

More information

Bachelor of Science Degree Requirements for Students Fulfilling the REVISED General Education Curriculum

Bachelor of Science Degree Requirements for Students Fulfilling the REVISED General Education Curriculum Goal Bachelor of Science Degree Requirements for Students Fulfilling the REVISED General Education Curriculum The following lists semester courses. For quarter courses, use the approved quarter version;

More information

Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8

Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8 Transit NXT Evolution from Service Pack 0 to Service Pack 8 April 2009: Transit NXT Service Pack 0 (Version 4.0.0.671) Additional versions of DTP programs supported: InDesign CS3 and FrameMaker 9 Additional

More information

How to represent characters?

How to represent characters? Copyright Software Carpentry 2010 This work is licensed under the Creative Commons Attribution License See http://software-carpentry.org/license.html for more information. How to represent characters?

More information

Encoding script-specific writing rules based on the Unicode character set

Encoding script-specific writing rules based on the Unicode character set Encoding script-specific writing rules based on the Unicode character set Malek Boualem, Mark Leisher, Bill Ogden Computing Research Laboratory (CRL), New Mexico State University, Box 30001, Dept 3CRL,

More information

The use of binary codes to represent characters

The use of binary codes to represent characters The use of binary codes to represent characters Teacher s Notes Lesson Plan x Length 60 mins Specification Link 2.1.4/hi Character Learning objective (a) Explain the use of binary codes to represent characters

More information

Keyboards for inputting Japanese language -A study based on US patents

Keyboards for inputting Japanese language -A study based on US patents Keyboards for inputting Japanese language -A study based on US patents Umakant Mishra Bangalore, India umakant@trizsite.tk http://umakant.trizsite.tk (This paper was published in April 2005 issue of TRIZsite

More information

2014 HIGHER SCHOOL CERTIFICATE EXAMINATION TIMETABLE Monday 13 October to Wednesday 5 November

2014 HIGHER SCHOOL CERTIFICATE EXAMINATION TIMETABLE Monday 13 October to Wednesday 5 November 2014 HIGHER SCHOOL CERTIFICATE EXINATION TIMETABLE Monday 13 October to Wednesday 5 November Important information for students The following pages list information about all the written examinations for

More information

Fax Service QUICK START GUIDE

Fax Service QUICK START GUIDE Receiving Faxes To receive faxes in your e-mail, give out your Broadview Fax service fax number to all of your contacts. When people send a fax to your Broadview fax number, your faxes will arrive in your

More information

White Paper. Translation Quality - Understanding factors and standards. Global Language Translations and Consulting, Inc. Author: James W.

White Paper. Translation Quality - Understanding factors and standards. Global Language Translations and Consulting, Inc. Author: James W. White Paper Translation Quality - Understanding factors and standards Global Language Translations and Consulting, Inc. Author: James W. Mentele 1 Copyright 2008, All rights reserved. Executive Summary

More information

Supported Languages and Code Pages in Non-Unicode Systems. NW AS Internationalization

Supported Languages and Code Pages in Non-Unicode Systems. NW AS Internationalization Supported Languages and Code Pages in Non-Unicode Systems NW AS Internationalization Copyright Copyright 2005 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted in

More information

GLOBAL LEX Lexington Global Engagement Center

GLOBAL LEX Lexington Global Engagement Center GLOBAL LEX Lexington Global Engagement Center Isabel Gereda Taylor, JD, LLM Multicultural Affairs Coordinator Lexington Fayette Urban County Government itaylor@lexingtonky.gov US Census data shows only

More information

General Education Requirements for the Bachelor of Science Degree Semester Curriculum

General Education Requirements for the Bachelor of Science Degree Semester Curriculum General College Requirements General Education Requirements for the Bachelor of Science Degree Semester Curriculum Summer Term 2012 The following degree requirements apply if you began your career as a

More information

USER GUIDE: Trading Central Indicator for the MT4 platform

USER GUIDE: Trading Central Indicator for the MT4 platform USER GUIDE: Trading Central Indicator for the MT4 platform 1 Table of content Description...3 Installation process...4 Features and parameters...8 Upgrade process... 10 2 Description Trading Central Indicator

More information