IDN - what s up? Patrik Fältström

Similar documents
Request for Comments: 3491 Category: Standards Track Viagenie March 2003

PROMOTION OF THE ARABIC DOMAIN NAME SYSTEM

Internationalizing the Domain Name System. Šimon Hochla, Anisa Azis, Fara Nabilla

Internationalization of Domain Names

How To Write A Domain Name In Unix (Unicode) On A Pc Or Mac (Windows) On An Ipo (Windows 7) On Pc Or Ipo 8.5 (Windows 8) On Your Pc Or Pc (Windows

Internet Engineering Task Force (IETF) Request for Comments: 7790 Category: Informational. February 2016

Introduction to Internationalized Domain Names (IDN)

Internationalized Domain Names -

NAME. Internationalized Domain Names (IDNs) -.IN Domain Registry. Policy Framework. Implementation

MULTILINGUILIZATION STANDARD. Wael Nasr Director, I-DNS.Net

IDN Server Proxy Architecture for Internationalized Domain Name Resolution and Experiences with Providing Web Services

Internationalization of Domain Names: A history of technology development

IDN FREQUENTLY ASKED QUESTIONS

Published by ICANN 7 June For Information Only

Agenda. Network Services. Domain Names. Domain Name. Domain Names Domain Name System Internationalized Domain Names. Domain Names & DNS

.ASIA CJK (Chinese Japanese Korean) IDN Policies

Domain Name Registration Policy

Four ACEs. A Survey of ASCII Compatible Encodings. International Unicode Conference 22 September 2002

.امارات (dotemarat) Arabic Domain Name Policy

Technical Integration Guide

Unicode in Mobile Phones

Internationalization of the Domain Name System: The Next Big Step in a Multilingual Internet

Chapter 4: Computer Codes

Right-to-Left Language Support in EMu

EAI Status. ( Address Internationalization)

The Proposal for Internationalizing cctld Names

Report on the Meeting of the Project Country Partners with Policy Makers

Order of introduction of «.ҚАЗ» domain name

ICANN Synthesis on Single-Character Domain Names at the Second-Level

My Services Online Service Support. User Guide for DNS and NTP services

Introduction to Unicode. By: Atif Gulzar Center for Research in Urdu Language Processing

Māori Language Domain Names

Unraveling Unicode: A Bag of Tricks for Bug Hunting

.eu Insights. EURid-UNESCO World report on Internationalised Domain Names deployment 2012

PROFESSOR AND HEAD DR. SARMAD HUSSAIN Al- Khwarizmi Institute of Computer Sciences University of Engineering and Technology, Lahore

Implementation of Internet Domain Names in Sinhala

Pre Delegation Testing (PDT) Frequently Asked Questions (FAQ)

Arabic Domain Names. Dr. Abdulaziz H. Al-Zoman Director of SaudiNIC Chairman of Steering Committee - ADN Pilot Project zoman@isu.net.sa.

Procedure to Develop and Maintain the Label Generation Rules for the Root Zone in Respect of IDNA Labels

<.bloomberg> gtld Registration Policies

Get me off Your Fucking Mailing List

The use of binary codes to represent characters

Hebrew Numbers. Number Values For Hebrew Letters. Modern versus Traditional Number Forms in Hebrew Writing. Hebrew Letters And Their Number Values

API of DNS hosting. For DNS-master and Secondary services Table of contents

Rendering/Layout Engine for Complex script. Pema Geyleg

Taipei Enterprise Sunrise Period Policy

ENUM. Patrik Fältström

IDN and SSL certificates

Project 4: IP over DNS Due: 11:59 PM, Dec 14, 2015

NASK.PL Registry & Registrars. partner@dns.pl

Bangla Localization of OpenOffice.org. Asif Iqbal Sarkar Research Programmer BRAC University Bangladesh

Spear Phishing. October 12, 2015 TLP: WHITE.

MPI Frequently Asked Questions

BorderWare Firewall Server 7.1. Release Notes

The future of International SEO. The future of Search Engine Optimization (SEO) for International Business

IDN: Challenges and Opportunities A registry s view of the multilingual web. Rome, March 2013!

The Unicode Standard Version 8.0 Core Specification

Designing Global Applications: Requirements and Challenges

Critical Values for I18n Testing. Tex Texin Chief Globalization Architect XenCraft

Global Registry Services Registrar Frequently Asked Questions (FAQ) for TLDs using Afilias Technology

Product Internationalization of a Document Management System

Points to Note. Chinese and English characters shall be coded in ISO/IEC 10646:2011, and the set of Chinese

The Application Layer. CS158a Chris Pollett May 9, 2007.

A URN Namespace for Network Resources

Commission on E-Business, IT and Telecoms Task Force on the Internet and IT Services

September 2009 Report #33

NICC ND 1633 V1.1.2 ( )

Komumaldata Barcoded PDF Forms PDF417 Barcode Generator Reference XFA (LiveCycle) Version Generator Version: 1.12

Proposed Update Unicode Technical Standard #39

How to represent characters?

.hitachi Domain Name Registration Policies

Chapter 6 Electronic Mail Security

Active Directory Integration with Blue Coat

Draft WGIG Issue Paper on the Multilingualization of

RelaViz Graph Visualization of Learned Relations Between Entities. Joel Ferstay - joelaf@cs.ubc.ca

Table of Contents DNS. How to package DNS messages. Wire? DNS on the wire. Some advanced topics. Encoding of domain names.

Best Practices in Domain Name Registry Solutions Understanding the Technical Requirements of ICANN's Applicant Guidebook

Unicode Security. Software Vulnerability Testing Guide. July 2009 Casaba Security, LLC

dra$-...- domain- names-

XML Character Encoding and Decoding

Taiwan Network Information Center Registration Policies and Rules for. 台 灣 Top-Level Domain Names in the Chinese Characters

Encoding script-specific writing rules based on the Unicode character set

HOMEWORK # 2 SOLUTIO

Internationalised Domain Names (IDNs) necessary or waste of money? IP and the Internet_ a KOLSTER Seminar_ Helsinki 31 May 2012.

Request For Comments: 1350 STD: 33 July 1992 Obsoletes: RFC 783

. 世 界 Registration Policy

Understanding SSL for Apps

Pemrograman Dasar. Basic Elements Of Java

.taipei Domain Registration Policy

Guidelines for exchange of danmarc2 records in MarcXchange

Motivation. Domain Name System (DNS) Flat Namespace. Hierarchical Namespace

Internationalizing JavaScript Applications Norbert Lindenberg. Norbert Lindenberg All rights reserved.

Network Working Group. Category: Standards Track Novell November 1997

NiceLabel Automation Version 1.5 Release Notes. Rev-1602

For the purpose of service extensions to SMTP, SMTP relays a mail object containing an envelope and a content.

Quectel Cellular Engine

Translating QueueMetrics into a new language

.AXA Domain Policy. As of March 3, 2014

Authority file comparison rules Introduction

CS 356 Lecture 27 Internet Security Protocols. Spring 2013

Transcription:

IDN - what s up? Patrik Fältström paf@cisco.com

In the beginning 3454 Preparation of Internationalized Strings ("stringprep"). P. Hoffman, M. Blanchet. December 2002. (Format: TXT=138684 bytes) (Status: PROPOSED STANDARD) 3490 Internationalizing Domain Names in Applications (IDNA). P. Faltstrom, P. Hoffman, A. Costello. March 2003. (Format: TXT=51943 bytes) (Status: PROPOSED STANDARD) 3491 Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN). P. Hoffman, M. Blanchet. March 2003. (Format: TXT=10316 bytes) (Status: PROPOSED STANDARD) 3492 Punycode: A Bootstring encoding of Unicode for Internationalized Domain Names in Applications (IDNA). A. Costello. March 2003. (Format: TXT=67439 bytes) (Status: PROPOSED STANDARD)

What is this? 3454 Specifies overall algorithm - stringprep 3490 Specifies IDN algorithm - IDNA 3491 Specifies Nameprep 3492 Specifies Punycode

stringprep With profiles, any Unicode based string can be converted to another Unicode string so that they can be compared Include illegal codepoints Include mapping table Give ability to create profiles Used for IDN, LDAP and other protocols

idna Algorithm for how to convert a domain name with Unicode codepoints to ascii How to use the stringprep profile and unicode Includes specification on how to handle unallocated codepoints core to IDN standard

nameprep Specific stringprep profile for unicode based domain names Convert a domain name with unicode codepoints to one of Illegal domain name Domain name with Unicode codepoints

punycode Converts a label with unicode codepoints to a domain name in ascii Example: fältström xn--fltstrm-5wa1o

What happened? 4690 Review and Recommendations for Internationalized Domain Names (IDNs). J. Klensin, P. Faltstrom, C. Karp, IAB. September 2006. (Format: TXT=100929 bytes) (Status: INFORMATIONAL)

In short... Explains the problems in the earlier standards Bidirectional scripts Non-spacing codepoints Explains the problems with scripts not yet created when IDNA was written Explains problem with versioning of Unicode Old standard based on Unicode 3.2

Example If a label include a character that has right to left directionality, both first and last character of the string has to have right to left directionality Creates problem if for example the string ends with a codepoint with no directionality

יי ווא U+05D9 HEBREW LETTER YOD (R) U+05D9 HEBREW LETTER YOD (R) U+05B4 HEBREW POINT HIRIQ (NSM) U+05D5 HEBREW LETTER VAV (R) U+05D5 HEBREW LETTER VAV (R) U+05D0 HEBREW LETTER ALEF (R) U+05B8 HEBREW POINT QAMATS (NSM) Note that last codepoint has no directionality (Non Spacing Mark)

יי ווא U+05D9 HEBREW LETTER YOD (R) U+05D9 HEBREW LETTER YOD (R) U+05B4 HEBREW POINT HIRIQ (NSM) U+05D5 HEBREW LETTER VAV (R) U+05D5 HEBREW LETTER VAV (R) U+05D0 HEBREW LETTER ALEF (R) U+05B8 HEBREW POINT QAMATS (NSM) Note that last codepoint has no directionality (Non Spacing Mark)

New IDN standard Will consist of a few documents Will not change punycode Backward compatible Some incompatibilities exists

New documents draft-ietf-idnabis-rationale draft-ietf-idnabis-protocol draft-ietf-idna-bidi draft-ietf-idnabis-tables draft-ietf-idnabis-mappings

draft-ietf-idnabis-rationale In fact named Rationale and issues... Addresses the concerns in the IAB document RFC 4690 Explain how the issues are resolved

draft-ietf-idnabis-protocol Replaces the IDNA specification Core specification of new IDN standard

draft-ietf-idna-bidi Gives specifics for bidirectional scripts

draft-ietf-idnabis-tables Defines algorithm to use to calculate whether a codepoint in Unicode is in one of the categories PVALID (Protocol Valid) CONTEXTO / CONTEXTJ DISALLOWED UNASSIGNED

But IDNA2003 had mappings Mappings are not part of IDNA200x Labels MUST be stable under NFC Codepoints in label MUST pass bidi requirements Codepoints MUST be ok according to algorithm specified in tables document (which might include contextual rules) We MIGHT see a separate document on mapping, recommended behaviour for different applications etc

Why is this needed? IDNA standard must be independent of Unicode version IDNA standard must handle bidirectional scripts...plus other things mentioned in RFC 4690

When will it be ready? Within 6 months Actually, it is approved by IESG Mailing list: idna-update@alvestrand.no

Patrik Fältström paf@cisco.com