Keyboard for inputting Chinese language -A study based on US patents

Similar documents
Keyboards for inputting Japanese language -A study based on US patents

EURESCOM - P923 (Babelweb) PIR.3.1

The use of binary codes to represent characters

The Chinese Language and Language Planning in China. By Na Liu, Center for Applied Linguistics

User Recognition and Preference of App Icon Stylization Design on the Smartphone

White Paper. Translation Quality - Understanding factors and standards. Global Language Translations and Consulting, Inc. Author: James W.

Easy Bangla Typing for MS-Word!

Encoding Text with a Small Alphabet

Points of Interference in Learning English as a Second Language

Thai Pronunciation and Phonetic Symbols Prawet Jantharat Ed.D.

Intro to Ancient Writing Systems

AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH)

Principles of Instruction. Teaching Letter-Sound Associations by Rebecca Felton, PhD. Introduction

Tibetan For Windows - Software Development and Future Speculations. Marvin Moser, Tibetan for Windows & Lucent Technologies, USA

Protecting Anti-virus Programs From Viral Attacks

Chapter 14 MS-DOS Operating System

Strand: Reading Literature Topics Standard I can statements Vocabulary Key Ideas and Details

Internationalized Domain Names -

4.3 TABLE 3 TABLE five

Changes to statutory reporting for pupils working below the standard on the National Curriculum

Using Neural Networks to Create an Adaptive Character Recognition System

OCPS Curriculum, Instruction, Assessment Alignment

Chapter 4: Computer Codes

6-1. Process Modeling

What's New in QuarkXPress 10

AIMMS Language Reference - Format of Text Data Files

Web Writing. Workshop 8. Promoting excellence in learning and teaching

A Study of Mobile Search Queries in Japan

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Ohio Early Learning and Development Standards Domain: Language and Literacy Development

Indiana Department of Education

Massachusetts Tests for Educator Licensure

SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89. by Joseph Collison

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE

Chinese Zodiac Charlee Raddish Green Intermediate School

Knowledge Based Tools for Software Supported Innovation and Problem Solving. Abstract

Chapter One Introduction to Programming

Very often my clients ask me, Don I need Chinese translation. If I ask which Chinese? They just say Just Chinese. If I explain them there re more

Likewise, we have contradictions: formulas that can only be false, e.g. (p p).

Reading IV Grade Level 4

A Programming Language for Mechanical Translation Victor H. Yngve, Massachusetts Institute of Technology, Cambridge, Massachusetts

Keystation Pro 88 Advanced Guide. Contents: 1 Getting Started. 2 Terminology. 3 Performance Operations of the Keystation Pro 88

The Virtual Tibetan Classroom

Interpreting areading Scaled Scores for Instruction

The Revolution in Patent Information Publishing: Making International Full-Text Databases a Reality

2016 national curriculum assessments. Key stage 1. Interim teacher assessment frameworks at the end of key stage 1. September 2015

PDF Primer PDF. White Paper

Develop Software that Speaks and Listens

PharmaSUG Paper QT26

Creating a table of contents quickly in Word

DATA ITEM DESCRIPTION

Chapter 3. Vocabulary Development

Red Hat Enterprise Linux International Language Support Guide

Radicals of Chinese Characters

Tibetan Number Identification Based on Classification of Number Components in Tibetan Word Segmentation

Android based Alcohol detection system using Bluetooth technology

Multi-lingual Label Printing with Unicode

2011, The McGraw-Hill Companies, Inc. Chapter 3

Log Normalization Systems and CEE Profiles

P2WW ENZ0 User s Guide

Design Patents for Animated Images: Development Trends

As Example 2 demonstrates, minor keys have their own pattern of major, minor, and diminished triads:

Minnesota K-12 Academic Standards in Language Arts Curriculum and Assessment Alignment Form Rewards Intermediate Grades 4-6

Maryland Technology Literacy Standards for Students

Chapter 2 Review Questions and Answers

textthing MFL KS2 Content Evaluation (by Suzanne Ford)

Oracle Process Manufacturing Production Management

Completion/Revision Date Approved by Board of Education on November 16, 2009

Software localization to China and Chinese

Preservation Handbook

User s Guide for Polycom HDX Systems

LECTURE 11: PROCESS MODELING

The Adobe PostScript Printing Primer

An Integrated Operational Knowledge Base (System of Operators) and the Innovation Workbench System Software 1

Downloading Your Financial Statements to Excel

Title: Chinese Characters and Top Ontology in EuroWordNet

Choosing a classification method

Chapter Three Voices: Layered Tone Colors

ID of alternative translational initiation events. Description of gene function Reference of NCBI database access and relative literatures

Beginning Microsoft Word XP

Create stories, songs, plays, and rhymes in play activities. Act out familiar stories, songs, rhymes, plays in play activities

Educational Level Guide. Pros

Lab 4.4 Secret Messages: Indexing, Arrays, and Iteration

Using Text & Graphics with Softron s OnTheAir CG and OnTheAir Video

Macros in Word & Excel

Review of Fundamental Mathematics

The ID Technology. Introduction to GS1 Barcodes

Grade 1 LA Subject Grade Strand Standard Benchmark. Florida K-12 Reading and Language Arts Standards 27

INTRODUCTION. The principles of which are to:

stress, intonation and pauses and pronounce English sounds correctly. (b) To speak accurately to the listener(s) about one s thoughts and feelings,

File by OCR Manual. Updated December 9, 2008

Programming Lists of Data

2 SYSTEM DESCRIPTION TECHNIQUES

WEB-BASED MATERIALS AND APPLICATIONS FOR CHINESE LANGUAGE TEACHING

Kindergarten Common Core State Standards: English Language Arts

Student Achievement in Asian Languages Education. Part 2: Descriptions of Student Achievement

Year 8 KS3 Computer Science Homework Booklet

If you want to skip straight to the technical details of localizing Xamarin apps, start with one of these platform-specific how-to articles:

Cherokee Language Technology Program - Education Services

Oracle Process Manufacturing Process Operation Controls

Transcription:

Keyboard for inputting Chinese language -A study based on US patents Umakant Mishra Bangalore, India umakant@trizsite.tk http://umakant.trizsite.tk (This paper was published in April 2005 issue of TRIZsite Journal, http://trizsite.tk/trizsite/articles/default.asp?month=apr&year=2005. Also available at SSRN http://papers.ssrn.com/abstract=932271, and http://dx.doi.org/10.2139/ssrn.932271 ) Contents 1. Introduction...1 1.1 Technique of inputting Chinese character...1 1.2 Problems relating to inputting Chinese characters...2 2. Inventions of inputting Chinese characters...3 2.1 Method and apparatus for inputting radical-encoded Chinese characters (Patent 5119296)...3 2.2 Keyboard for processing Chinese language text (Patent 5893133)...4 3. Summary and conclusion...5 4. Reference:...5 1. Introduction 1.1 Technique of inputting Chinese character As the structure of Chinese characters are very different from the relatively simple alphabetic system of western languages, it is very difficult to input Chinese characters into computer quickly and conveniently. There are a few existing systems which include those based on the "PinYin" (phonetic) system, a combination of the PinYin system and character form techniques, whole character encoding, stroke input encoding, and stoke form encoding.

The PinYin (Phonetic) system The PinYin encoding system presents several advantages over other systems. First, it is already part of the curriculum of China's primary schools and many people are therefore familiar with the system. Besides, PinYin uses English characters as its encoding characters. Therefore it is a simple matter to directly encode Chinese characters by using standard English character computer keyboards. However, since the output of the PinYin encoding system consists of Chinese characters, and not the PinYin forms of those characters, there are too many degenerate cases. Generally some software processing is used to compensate this effect. For a language such as Chinese, with its many dialects and slangs, there are simply too many Chinese characters for the user to remember or pronounce correctly. The whole character system The whole character keyboard is based on the use of a special hardware keyboard and not the standard English language keyboard. The input speed of this keyboard is comparatively low and is not suitable for wide use. Stroke input encoding system The stroke input encoding system is easy to learn. But it presents too many degenerate cases and is not suitable for entering large amounts of data. Stoke from encoding system The stoke form encoding system is based on a process of first determining the radicals of the characters, then encoding them according to an established set of rules. The stoke form systems developed to date, however, have suffered from the lack of a suitably developed theoretical structure. Therefore, the system still suffers from a large number of synonym codes and slow input speed. It is also difficult to use. 1.2 Problems relating to inputting Chinese characters The Chinese character strokes and symbols are so different and so complicated that they can be stored and grouped in a wide variety of ways. The conventional keyboard does not support the pictorial characters in Chinese language. There are 3000 to 6000 commonly used pictorial Chinese characters (Hanzi). There are too many dialects and slangs in Chinese which makes the language more difficult to remember and pronounce correctly. In PinYin type of system, the user needs to know two aspects of Chinese characters, viz., their pronunciation and their written form. This is inconvenient, as the user has to do two operations.

There are too many generates of PinYin system of encoding. Besides, in Taiwan, Chinese characters are denoted with different phonetic symbols, not English letters. Therefore, PinYin faces a special barrier in Taiwan and other Asian countries. Unlike English, there are no boundaries between words in Chinese text. For example, a sentence can be a contiguous string of ideograms, where one or more ideograms may form a word, without spaces between words. Some encoding systems, while high in speed, are poorly described, thereby leading to confusion. Some encoding systems fail to use to the standard 26-character English character keyboard. For example, the whole characher system uses a special hardware keyboard. 2. Inventions of inputting Chinese characters 2.1 Method and apparatus for inputting radical-encoded Chinese characters (Patent 5119296) Background problem There remains a problem of inputting Chinese characters into computers because of the complicated structure of Chinese characters. The different systems of inputting Chinese characters such as PinYin system, stroke input encoding system, stoke from encoding system etc. have their own limitations. Solution provided by the patent Zheng et al. provided a system (patent 5119296, issued June 1992) of computer encoding system for Chinese characters. The system analyzes different Chinese characters and identifies six basic strokes. Besides 26 most frequently used radicals are selected and classified into these six categories. Each of these radicals is depicted by English character key-positions on a standard computer keyboard. The invention assigns English characters which denote the above single radicals as codes for Chinese characters such that each Chinese character is encoded by

four or fewer English characters. The codes input through the keyboard are converted to Chinese by a converter. TRIZ based analysis The invention tries to use the regular English keyboard for inputting Chinese characters (Principle-3: Local quality). The Chinese characters in the regular script are classified into six basic strokes (Principle-1: Segmentation). The 26 most frequently used radicals are mapped on the 26 English character key positions on the keyboard (Principle-28: Mechanics substitution). 2.2 Keyboard for processing Chinese language text (Patent 5893133) Background problem It is necessary to produce the characters of Chinese language with a conventional keyboard. However, the conventional keyboard does not support the pictorial characters as in Chinese language. There are 3000 to 6000 commonly used Chinese characters (Hanzi) and so many more rarely used. It is difficult to enter the Chinese characters through the keyboard for text processing. Solution provided by the invention Chenjun Julian Chen discloses a method of accurately and efficiently entering Chinese language through the keyboard (patent 5893133, assigned to IBM, April 99). The system has a keyboard with diacritic keys that permits the user to annotate each entered phonetic text syllable with a diacritic that indicates the tone of the syllable. The user enters phonetic Chinese (Pinyin and BPMF) into a computer system and the phonetic input is compared to a list of acceptable phonetic syllables and abbreviations. If the entered syllable is found on the list, the syllable is displayed on a phonetic portion of the graphical display. When the user presses a delimiting character the phonetic syllable is converted to proper hanzi character and displayed on the screen.

TRIZ based analysis A standard keyboard should support easy input of all hanzi characters (Ideal Final Result). But the number of hanzi are more than 10,000 which are not possible to generate out of such a small keyboard (Contradiction). The invention uses some special keys and key combinations to generate more characters out of the limited number of keys (Principle-40: Composite) The invention uses a different intermediate form like phonetic Chinese which has less number of characters (about 1300) compared to hanzi (Principle-24: Intermediary). 3. Summary and conclusion The complexity of Chinese language and Chinese character have made people struggle to input Chinese characters into computer. Entering of Chinese characters and processing of Chinese language will remain a topic of continuous research and development for all times in future. 4. Reference: 1. US Patent 5119296, Method and apparatus for inputting radical-encoded Chinese characters, invented by Zheng, et al., assignee-nil, issued June 1992. 2. 5893133, Keyboard for processing Chinese language text., Chenjun Julian Chen, assigned to IBM, April 99 3. US Patent and Trademark Office (USPTO) site, http://www.uspto.gov/